Comment by barrucadu

Comment by barrucadu 3 hours ago

5 replies

> I don’t think a country’s government can justify no commercial LLMs to its populace

They're not saying no LLMs, they're saying no LLMs using lyrics without a license. OpenAI simply need to pay for a license, or train an LLM without using lyrics.

akersten 12 minutes ago

Oi, you got a loisense to read those words and then repeat them back to me when asked?

Myrmornis 3 hours ago

But lyrics are just one example. Are you saying that training experiments must filter out all substrings from the training input that bear too close a resemblance to a substring of a copyrighted work?

  • barrucadu 2 hours ago

    Obviously there's a limit, reproducing a single sentence is unlikely to be copyright infringement just because there are only so many words in a language; but if reproducing some text would be copyright infringement if a human did it, I don't see why LLM companies should get a free pass.

    If it's really essential that they train their models on song lyrics, or books, or movie scripts, or articles, or whatever, they should pay license fees.

  • freejazz 15 minutes ago

    At some point, use of the lyrics becomes de minimis

luke5441 3 hours ago

This obviously applies to all copyrighted works. I could sue OpenAI when it reproduces my source code that I published on the Internet.

They already "filter" the code to prevent it from happening (reproducing exact works). My guess it is just superficially changing things around so it is harder to prove copyright violations.