Comment by Cthulhu_
Comment by Cthulhu_ 13 hours ago
You (and the article, etc) show what a lot of the "work" in AI is going into at the moment - creating guardrails against creating something that might get them in trouble, and / or customizing weights and prompts under water to generate stuff that isn't the obvious. I'm reminded of when Google's image generator came up and this customization bit them in the ass when they generated a black pope or asian vikings. AI tools don't do what you wish they did, they do what you tell them and what they are taught, and if 99% of their learning set associates Mario with prompts for Italian plumbers, that's what you'll get.
A possible (probably already exists) business is setting up truly balanced learning sets, that is, thousands of unique images that match the idea of an italian plumber, with maybe 1% of Mario. But that won't be nearly as big a learning set as the whole internet is, nor will it be cheap to build it compared to just scraping the internet.
I would love to know how YouTube does this for music. There's some holes obviously, like some cover artists will play the iconic riffs of a song and then stop somewhere. There's people who do reels or "commentary" of a movie scene and then put some horrible high pitched music to mask it from copyright.
There's probably even some rules around this to only detect just enough to take legal action. Like GP stumbled on a trademark landmine, but obviously just selling red shirts with a bird on it can't be a trademark violation; it needs to be a specific kind of red too.