Comment by esafak
That's got nothing to do with it. It's all about copyright. Can it reproduce its training data verbatim? If so, Meta is in hot water.
That's got nothing to do with it. It's all about copyright. Can it reproduce its training data verbatim? If so, Meta is in hot water.
I read harry potter, and you ask me about a page, and I can recite it verbatim, did I just commit copyright infringement?
I pay for a service. The service recites a novel to me. The service would need permission to do this or it is copyright infringement.
But if it's corpora do NOT include the Harry Potter books then Meta is NOT in hot water,! So take the Harry Potter books out of the corpora. What is lost? Nothing IMO useful other than the ability to discuss Harry Potter books. BFD.