Comment by Retric
> don’t you open the door to the same argument for copyright itself?
Yes, it comes down to intentional control of output. Copyright applies when someone uses a pen to make a drawing because of the degree of control.
On the flip side there are copyright free photos where an animal picked up a camera etc, the same applies to a great deal of automatically generated data. The output of an LLM is likely in the public domain unless it’s a derivative work of something in the training set.