Comment by mock-possum
Comment by mock-possum 3 hours ago
I don’t read this as “don’t show we broke the law,” I read it as “don’t give the user the false impression that there’s any legal issue with this generated content.”
There’s nothing law breaking about quoting publicly available information. Google isn’t breaking the law when it displays previews of indexed content returned by the search algorithm, and that’s clearly the approach being taken here.
Masked token prediction is reconstruction. It goes far beyond “quoting.”