Comment by skydhash
The analogies fail because the copyrighted material were not used for creating the copy machine, Illustration, or (maybe?) the keyboard suggestion engine. If LLMs were produced ethically, then the whole discussion is moot. But if the only way to produce copyrighted material requires being trained on copyrighted material, then...
Copyright law applies to distribution of output, not input.
An artist, writer, whoever, could read all the copyrighted material in the world, even pirated material, unless their output is a copy or copyrighted artifact, then there is no infringement.