Comment by TremendousJudge
Comment by TremendousJudge a day ago
No, the exception they are asking for (we can train on copyrighted material and the image produced is non-copyright infringing) is copyright infringing in the most basic sense.
I'll prove it by induction: Imagine that I have a service where I "train" a model on a single image of Indiana Jones. Now you prompt it, and my model "generates" the same image. I sell you this service, and no money goes to the copyright holder of the original image. This is obviously infringment.
There's no reason why training on a billion images is any different, besides the fact that the lines are blurred by the model weights not being parseable
>There's no reason why training on a billion images is any different
You gloss over this as if it's a given. I don't agree. I think you're doing a different thing when you're sampling billions of things equallly.