An image of an archeologist adventurer who wears a hat and uses a bullwhip
(theaiunderwriter.substack.com)1369 points by participant3 a day ago
1369 points by participant3 a day ago
I asked Sora to turn a random image of my friend and myself into Italian plumbers. Nothing more, just the two words "Italian plumbers". The created picture was not shown to me because it was in violation of OpenAI's content policy. I asked then just to turn the guys on the picture into plumbers, but I asked this in the Italian language. Without me asking for it, Sora put me in an overall and gave me a baseball cap, and my friend another baseball cap. If I asked Sora to put mustache on us, one of us received a red shirt as well, without being asked to. Starting with the same pic, if I asked to put one letter on the baseball caps each - guess, the letters chosen were M and L. These extra guardrails are not really useful with such a strong, built-in bias towards copyright infringement of these image creation tools. Should it mean that with time, Dutch pictures will have to include tulips, Italian plumbers will have to have a uniform with baseball caps with L and M, etc. just not to confuse AI tools?
You (and the article, etc) show what a lot of the "work" in AI is going into at the moment - creating guardrails against creating something that might get them in trouble, and / or customizing weights and prompts under water to generate stuff that isn't the obvious. I'm reminded of when Google's image generator came up and this customization bit them in the ass when they generated a black pope or asian vikings. AI tools don't do what you wish they did, they do what you tell them and what they are taught, and if 99% of their learning set associates Mario with prompts for Italian plumbers, that's what you'll get.
A possible (probably already exists) business is setting up truly balanced learning sets, that is, thousands of unique images that match the idea of an italian plumber, with maybe 1% of Mario. But that won't be nearly as big a learning set as the whole internet is, nor will it be cheap to build it compared to just scraping the internet.
>> they do what you tell them and what they are taught, and if 99% of their learning set associates Mario with prompts for Italian plumbers, that's what you'll get.
I thought that a lot of the issues were the opposite of this, where Google put their thumb on the scale to go against what the prompt asked. Like when someone would ask for a historically accurate picture of a US senator from the 1800s and repeatedly get women and non-white men. The training set for that prompt has to be overwhelmingly white men so I don't think it was just a matter of following the training data.
I remember all the hullaballoo about Asian Vikings and the like. It was so preposterous that Vikings would ever be Asian that it must be ultra-woke DEI mind-worms being forced onto AI! But of course, as far as the AI's concerned, it is even more preposterous that an Italian plumber would not be wearing red or green overalls with a mustache and a lettered baseball cap. I don't see any way you can get the AI to recognize that Vikings "should" be white people and not also think that Italian plumbers "should" look like that. Are they allowed to recombine their training data or must they strictly adhere to only what they've seen?
Of course the irony is that if the people who get offended whenever they see images of non-white people asked for a picture of "Vikings being attacked by Godzilla" , they'd get worked up if any of the Vikings in the picture were Asian (how unrealistic!). It's a made-up universe! The image contains a damn (Asian) Kaiju in it, and everyone is supposed to be pissed because the Vikings are unrealistic!?
OpenAI will eventually have competition for GPT 4o image generation.
They'll eventually have open source competition too. And then none of this will matter.
OmniGen is a good start, just woefully undertrained.
The VAR paper is open, from ByteDance, and supposedly the architecture this is based on.
Black Forest Labs isn't going to sit on their laurels. Their entire product offering just became worthless and lost traction. They're going to have to answer this.
I'd put $50 on ByteDance releases an open source version of this in three months.
Many years ago I tried to order a t-shirt with the postscript tiger on the front from Spreadshirt.
It was removed on Copyright claims before I could order one item myself. After some back and forth they restored it for a day and let me buy one item for personal use.
My point is: Doesn't have to be Sony, doesn't have to be a snitch - overzealous anticipatory obedience by the shop might have been enough.
>After some back and forth they restored it for a day and let me buy one item for personal use.
I used Spreadshirt to print a panel from the Tintin comic on a T-shirt, and I had no problem ordering it (it shows Captain Haddock moving through the jungle, swatting away the mosquitoes harassing him, giving himself a big slap on the face, and saying, 'Take that, you filthy beasts!').
Twenty years ago, I worked for Google AdWords as a customer service rep. This was still relatively early days, and all ads still had some level of manual human review.
The big advertisers had all furnished us a list of their trademarks and acceptable domains. Any advertiser trying to use one that wasn’t on the allow-list had their ad removed at review time.
I suspect this could be what happened to you. If the platform you were using has any kind of review process for new shops, you may have run afoul of pre-registered keywords.
Well the teams in Pokemon Go aren't quite as generic as Teamred: they are Team Instinct, Team Mystic, and Team Valor. Presumably Nintendo has trademarks on those phrases, and I’m sure all the big print on demand houses have an API for rights-holders to preemptively submit their trademarks for takedowns.
Nintendo is also famously protective of their IP: to give another anecdote, I just bought one of the emulator handhelds on Aliexpress that are all the rage these days, and while they don't advertise it they usually come preloaded with a buttload or ROMs. Mine did, including a number of Nintendo properties — but nary an Italian plumber to be found. The Nintendo fear runs deep.
Allen Pan, a youtuber "maker" who runs in the circle of people who run OpenSauce, was a contestant on a Discovery channel show that was trying to force the success of Mythbusters by "finding the next mythbusters!". He lost, but it was formative to him because those people were basically all inspired by the original show.
A couple years ago, he noticed that the merchandise trademark for "Mythbusters" had lapsed, so he bought it. He, now the legal owner of the trademark Mythbusters for apparel, made shirts that used that trademark.
Discovery sent him a cease and desist and threatened to sue. THEY had let the trademark lapse. THEY had lost the right to the trademark, by law. THEY were in the wrong, and a lawyer agreed.
But good fucking luck funding that legal battle. So he relinquished the trademark.
Buy a walrus plushy cause it's funny: https://allen-pan-shop.fourthwall.com/en-usd/
Note the now "Myth Busted" shirts instead.
Hilariously, a friend of Allen Pan's, from the same "Finding the next mythbuster" show; Kyle Hill, is friends enough with Adam Savage to talk to him occasionally, and supposedly the actual Mythbusters themselves were not empathetic to Allen's trademark claim.
It's hard to think of a company more aggressive with their IP than Nintendo
> my whole shop was taken offline
I think the problem there was being dependent on someone who is a complete pushover, doesn't bother to check for false positives and can kill your business with a single thought.
Yes that was the whole point of my post.
For further info it was Redbubble.
>Redbubble is a significant player in the online print-on-demand marketplace. In fiscal year 2023, it reported having 5 million customers who purchased 4.8 million different designs from 650,000 artists. The platform attracts substantial web traffic, with approximately 30.42 million visits in February 2025.
I don't condone or endorse breaking any laws.
That said, trademark laws like life of the author + 95 years are absolutely absurd. The ONLY reason to have any law prohibiting unlicensed copying of intangible property is to incentivize the creation of intangible property. The reasoning being that if you don't allow people to exclude 3rd party copying, then the primary party will assumedly not receive compensation for their creation and they'll never create.
Even in the case where the above is assumed true, the length of time that a protection should be afforded should be no more than the length of time necessary to ensure that creators create.
There are approximately zero people who decide they'll create something if they're protected for 95 years after their death but won't if it's 94 years. I wouldn't be surprised if it was the same for 1 year past death.
For that matter, this argument extends to other criminal penalties, but that's a whole other subject.
> The ONLY reason to have any law prohibiting unlicensed copying of intangible property is to incentivize the creation of intangible property.
That was the original purpose. It has since been coopted by people and corporations whose incentives are to make as much money as possible by monopolizing valuable intangible "property" for as long as they can.
And the chief strategic move these people have made is to convince the average person that ideas are in fact property. That the first person to think something and write it down rightfully "owns" that thought, and that others who express it or share it are not merely infringing copyright, they are "stealing."
This plan has largely worked, and now the average person speaks and thinks in these terms, and feels it in their bones.
>the average person speaks and thinks in these terms,
(Trademarks aside) Even more surprising to me is how everyone seems concerned about the studios making enough money?! As if they should make any money at all. As if it is up to us to create a profitable game for them.
If they all go bankrupt today I won't lose any sleep over it.
People also try to make a living selling bananas and apples. Should we create an elaborate scheme for them to make sure they survive? Their product is actually important to have. Why can't they own the exclusive right to sell bananas similarly? If anyone can just sell apples it would hurt their profit.
It is long ago but that is how things use to work. We do still have taxi medallions in some places and all kinds of legalized monopolies like it.
Perhaps there is some sector where it makes sense but I can't think of it.
If you want to make a movie you can just do a crowd funder like Robbert space industry.
> Even more surprising to me is how everyone seems concerned about the studios making enough money?! As if they should make any money at all. As if it is up to us to create a profitable game for them.
Do you want more games (movies, books...)? Then you want studios to make money in that type of game. Because and if they make money they have incentive to do so. Now if you are happy with the number and quality of free games a few hard core people who will do it even if they make nothing then you don't care. However games generally take a lot of effort to create and so by paying people to make them we can ensure people who want to actually have the time - as opposed want to but instead have to spend hours in a field farming for their food.
Now it is true that games often do look alike and many are not worth making and such. However if you want more you need to ensure they make money so it is worth investing.
We can debate how much they should make and how long copyright should be for. However you want them to make money so they make more.
We were close to your viewpoint being the popular one, but sadly many (most?) independent content creators are so overtaken by fear of AI that they've done a 180. The same people who learned by tracing references to sell fanart of a copyrighted franchise (not complaining, I spend thousands on such things) accuse AI of stealing when it glances at their own work. We're entering a new golden age of creative opportunity and they respond by switching sides to the philosophy of intellectual property championed by Disney and Oracle (except for those companies' ironic use of AI themselves..).
We would prefer a world where we can use the skills we have spent a lifetime honing without having to compete with some asshole taking everything we’ve shared and stuffing it into a machine that spits out soulless clones of our work without any acknowledgment of our existence.
> we were close
Maybe. In my microcosm even before big AI, 100% of my tech acquaintances were against IP laws, 0% of my art acquaintances were, and authors I know had varied opinions based on their other backgrounds.
Artists do seem to have had a mindset shift. Previously they supported IP protection because it was "right" (or they'd at least concede that in practice it's not helping them personally), but with the AI boom most of them are pro-IP laws because of more visceral livelihood fears.
Trademark isn't copyright, those are two different things. Trademarks can be renewed roughly every 10 years [1] until the end of time and are about protecting a brand. Now copyright law lasts for "author plus 70 years. For anonymous works, pseudonymous works, or works made for hire, the copyright term is 95 years from the year of first publication or 120 years from creation, whichever comes first." [2]
Is copyright too long? Yes. Is it only that long to protect large media companies? Yes. But I would argue that AI companies are pushing the limits of fair use if not violating fair use, which is used as a affirmative defense by the way meaning that AI companies have to go to court to argue what they are doing is okay. They don't just get to wave their hands and say everything is okay because what we're doing is fair use and we get to scrape the world's entire creative output for our own profit.
[1] https://www.uspto.gov/learning-and-resources/trademark-faqs#...
[2] https://www.copyright.gov/history/copyright-exhibit/lifecycl...
Trademark isn't the same as Registered Trademark either, while we're at it
> There are approximately zero people who decide they'll create something if they're protected for 95 years after their death but won't if it's 94 years.
I’m sure you’re right for individual authors who are driven by a creative spark, but for, say, movies made by large studios, the length of copyright is directly tied to the value of the movie as an asset.
If that asset generates revenue for 120 years, then it’s slightly more valuable than an asset that generates revenue for 119 years, and considerably more valuable than an asset that generates revenue for 20 years.
The value of the asset is in turn directly linked to how much the studio is willing to pay for that asset. They will invest more money in a film they can milk for 120 years than one that goes public domain after 20.
Would studios be willing to invest $200m+ in movie projects if their revenue was curtailed by a shorter copyright term? I don’t know. Probably yes, if we were talking about 120->70. But 120->20? Maybe not.
A dramatic shortening of copyright terms is something of a referendum on whether we want big-budget IP to exist.
In a world of 20 year copyright, we would probably still have the LOTR books, but we probably wouldn’t have the LOTR movies.
> If that asset generates revenue for 120 years, then it’s slightly more valuable than an asset that generates revenue for 119 years, and considerably more valuable than an asset that generates revenue for 20 years.
Not so, because of net present value.
The return from investing in normal stocks is ~10%/year, which is to say ~670% over 20 years, because of compounding interest. Another way of saying this is that $1 in 20 years is worth ~$0.15 today. A dollar in 30 years is worth ~$0.05 today. A dollar in 40 years is worth ~$0.02 today. As a result, if a thing generates the same number of dollars every year, the net present value of the first 20 years is significantly more than the net present value of all the years from 20-120 combined, because money now or soon from now is worth so much more than money a long time from now. And that's assuming the revenue generated would be the same every year forever, when in practice it declines over time.
The reason corporations lobby for copyright term extensions isn't that they care one bit about extended terms for new works. It's because they don't want the works from decades ago to enter the public domain now, and they're lobbying to make the terms longer retroactively. But all of those works were already created and the original terms were sufficient incentive to cause them to be.
Your analysis misses the incredibly important caveat that revenue rises with inflation – or sometimes even faster.
50 years ago, a movie ticket was 0.50 cents in revenue. Today, it’s $25. That’s a 50x increase… a dollar in 50 years might be worth $0.02 today, but a movie ticket in 50 years is worth about a movie ticket today.
> And that's assuming the revenue generated would be the same every year forever, when in practice it declines over time.
For the crown jewel IP that the studios are most interested in protecting, the opposite of this assumption is true. Star Wars, for example, is making more money than ever. Streaming revenues will probably invalidate that assumption for an even wider pool of back catalog properties.
IIRC, of works that bring in any money to their creators, the vast majority is returned, for almost all works, in the first handful of years after creation. Sure. the big names you know have value longer, but those are a miniscule fraction of works.
Make copyright last for a fixed term of 25 years with optional 10-year renewals up to 95 years on an escalating fee schedule (say, $100k for the first decade and doubling every subsequent decade) and people—and studios—would have essentially the same incentive to create as they do now, and most works would get into the public domain far sooner.
Probably be fewer entirely lost works, as well, if you had firmer deposit requirements for works with extended copyrights (using the revenue from the extensions to fund preservation) with other works entering the public domain soon enough that they were less likely to be lost before that happened.
> I’m sure you’re right for individual authors who are driven by a creative spark, but for, say, movies made by large studios, the length of copyright is directly tied to the value of the movie as an asset.
That would be fine, if the studios didn't want to have it both ways. They want to retain full copyright control over their "asset", but they also use Hollywood Accounting [1] to both avoid paying taxes and cheat contributors that have profit-sharing agreements.
If studios declare that they made a loss on producing and releasing something to get a tax break, the copyright term for that work should be reduced to 10 years tops.
Next, they'd switch to from Hollywood Accounting to Oilfield Accounting. Oh that wellhead is actually owned by this other company over there, we just purchased their product at a fair market rate while they were still in business, but now it seems that other company is going bankrupt and cannot do the environmental cleanup to even seal the wellhead, much less remove it.
The Fellowship of the Ring, the first of Peter Jackson's LOTR movies released in 2001, made $887 million in its original theatrical run (on a $93 million budget). It would absolutely still have been made if copyright was only 20 years. And now it would be in the public domain!
The success that we can now measure through hindsight wasn’t assured at the time of greenlighting the film. They took a huge gamble:
https://variety.com/2021/film/news/lord-of-the-rings-peter-j...
It would have been an even bigger gamble if they weren’t able to bank on any long term revenue (I’m certain Netflix continues to pay for the rights to stream the trilogy after 2021).
According to Matt Damon (in one of many interviews) a lot of movies were produced with the second revenue stream (vhs/dvd) being part of the calculations, that is why we now get a lot less variety and alternative movies made, that second revenue stream doesn’t exist any more (I assume streaming pays very little in comparison).
With books it's even worse. Movies might get a trickle of revenue from TV licensing but once a book is out of print (which usually happens very quickly, and most never go into print again), that's it. No more revenue from that book, it continues to circulate in libraries and used bookstores but the author and publisher gets nothing from that.
> If that asset generates revenue for 120 years, then it’s slightly more valuable than an asset that generates revenue for 119 years
Movies for instance make most of their revenue in the 2 week following their release in theater. Beyond, peoole who wanted to see it had already seen it and the others don't care.
I'd argue it's similar for other art form, even for music. The gain at the very end of the copyright lifetime is extremely marginal and doesn't influence spending decision, which is mostly measured on a return basis of at most 10 years.
> If that asset generates revenue for 120 years, then it’s slightly more valuable than an asset that generates revenue for 119 years, and considerably more valuable than an asset that generates revenue for 20 years
Due to the fairy high cost of capital right now, pretty much anything more than 5 years away is irrelevant. 10 years max, even for insanely high returns on investment.
Seems to me most of that inflated budget is needed for the entertainment role of films, not the art in them, which a low budget can often stimulate rather than inhibit. In which case nothing of importance would be lost by a drastic shortening of copyright terms.
OK? So we wouldn't have $100m movies, the vast majority of which are forgotten about in a few months. I don't think a $100m movie is ten times better than a $10m one, so I think I'd be fine with movies with much smaller budgets, if they meant that LotR (the books) are now in the public domain for everyone to enjoy.
For what it's worth, this is a uniquely American view of copyright:
> The ONLY reason to have any law prohibiting unlicensed copying of intangible property is to incentivize the creation of intangible property.
In Europe, particularly France, copyright arose for a very different reason: to protect an author's moral rights as the creator of the work. It was seen as immoral to allow someone's work -- their intellectual offspring -- to be meddled with by others without their permission. Your work represents you and your reputation, and for others to redistribute it is an insult to your dignity.
That is why copyrights in Europe started with much longer durations than they did in the United States, and the US has gradually caught up. It is not entirely a Disney effect, but a fundamental difference in the purpose of copyright.
That's an interesting perspective, and yes wholly foreign to my very American economics influenced background.
Are the origins the same when looking at other intellectual property like patents?
How did they deal with quoting and/or critiquing other's ideas? Did they allow limited quotation? What about parody and satire?
Missing is why laws fight so hard too, missing the opposite of what we have (in the west), namely blatant and rampant piracy. The other extreme is really bad, creators of any type pirated by organized crime. There was no video game nor movie market in eastern Europe for example, can't compete against large scale piracy.
Which is to say, preservation without awareness of the threat will look like hoarding. A secondary question is to what extent is that threat real? Without seeing what true rampant piracy looks like, I think it would be easy to be ignorant of the threat.
You're conflating trademark with copyright.
Regardless, it's not just copyright laws that are at issue here. This is reproducing human likenesses - like Harrison Ford's - and integrating them into new works.
So if I want to make an ad for a soap company, and I get an AI to reproduce a likeness of Harrison Ford, does that mean I can use that likeness in my soap commercials without paying him? I can imagine any court asking "how is this not simply laundering someone's likeness through a third party which claims to not have an image / filter / app / artist reproducing my client's likeness?"
All seemingly complicated scams come down to a very basic, obvious, even primitive grift. Someone somewhere in a regulatory capacity is either fooled or paid into accepting that no crime was committed. It's just that simple. This, however, is so glaring that even a child could understand the illegality of it. I'm looking forward to all of Hollywood joining the cause against the rampant abuse of IP by Silicon Valley. I think there are legal grounds here to force all of these models to be taken offline.
Additionally, "guardrails" that prevent 1:1 copies of film stills from being reprinted are clearly not only insufficient, they are evidence that the pirates in this case seek to obscure the nature of their piracy. They are the evidence that generative AI is not much more than a copyright laundering scheme, and the obsession with these guardrails is evidence of conspiracy, not some kind of public good.
> So if I want to make an ad for a soap company, and I get an AI to reproduce a likeness of Harrison Ford, does that mean I can use that likeness in my soap commercials without paying him?
No, you can't! But it shouldn't be the tool that prohibits this. You are not allowed to use existing images of Harrison Ford for your commercial and you also will be sued into oblivion by Disney if you paint a picture of Mickey Mouse advertising your soap, so why should it be any different if an AI painted this for you?
Well, precisely. What then is the AI company's justification for charging money to paint a picture of Harrison Ford to its users?
The justification so far seems to have been loosely based on the idea that derivative artworks are protected as free expression. That argument loses currency if these are not considered derivative but more like highly compressed images in a novel, obfuscated compression format. Layers and layers of neurons holding a copy of Harrison Ford's face is novel, but it's hard to see why it's any different legally than running a JPEG of it through some filters and encoding it in base64. You can't just decode it and use it without attribution.
> you also will be sued into oblivion by Disney if you paint a picture of Mickey Mouse advertising your soap, so why should it be any different if an AI painted this for you?
If the AI prompt was "produce a picture of Micky Mouse", I'd agree with you.
The creators of AI claim their product produces computer-generated images, i.e. generated/created by the computer. Instead it's producing a picture of a real actual person.
If I contract an artist to produce a picture of a person from their imagination, i.e. not a real person, and they produce a picture of Harrison Ford, then yeah I'd say that's on the artist.
> This is reproducing human likenesses - like Harrison Ford's - and integrating them into new works.
The thing is though, there is also a human requesting that. The prompt was chosen specifically to get that result on purpose.
The corporate systems are trying to prevent this, but if you use any of the local models, you don't even have to be coy. Ask it for "photo of Harrison Ford as Indiana Jones" and what do you expect? That's what it's supposed to do. It does what you tell it to do. If you turn your steering wheel to the left, the car goes to the left. It's just a machine. The driver is the one choosing where to go.
No, I think that's unfair. I, as a user, could very reasonably want a parody or knock-off of Indiana Jones. I could want the spelunky protagonist. It's hard to argue that certain prompts the author put into this could be read any other way. But why does Nintendo get a monopoly on plumbers with red hats?
The way AI is coded and trained pushes it constantly towards a bland-predictable mean, but it doesn't HAVE to be that way.
Human appearance does not have enough dimensions to make likeness a viable thing to protect; I don't see how you could do that without say banning Elvis impersonators.
That said:
> I'm looking forward to all of Hollywood joining the cause against the rampant abuse of IP by Silicon Valley.
If you're framing the sides like that, it's pretty clear which I'm on. :)
Interesting you should bring that up:
https://www.calcalistech.com/ctechnews/article/1517ldjmv
Loads of lawsuits have been filed by celebrities and their estates over the unauthorized use of their likeness. And in fact, in 2022, Las Vegas banned Elvis impersonators from performing weddings after a threat from the Presley estate's licensing company:
https://www.dailymail.co.uk/news/article-10872855/Elvis-imag...
But there are also a couple key differences between putting on a costume and acting like Elvis, and using a picture of Elvis to sell soap.
One is that a personal artistic performance could be construed as pastiche or parody. But even more importantly, if there's a financial incentive involved in doing that performance, the financial incentive has to be aligned more with the parody than with drawing an association to the original. In other words, dressing up as Elvis as a joke or an act, or even to sing a song and get paid to perform a wedding is one thing if it's a prank, it's another thing if it's a profession, and yet another thing if it's a mass-marketing advertisement that intends for people to seriously believe that Elvis endorsed this soap.
I can remember two ad campaigns with an Elvis impersonator, and they used multiple people in both of them. I think we can safely assume that if you represent multiple people as a specific public figure, that a reasonable person must assume that none of them are in fact that person.
Now of course that leaves out concerns over how much of advertisement is making money off of unreasonable people, which is a concern Congress occasionally pays attention to.
You are missing a bunch of edge cases, and the law is all about edge cases.
An artist who works professionally has family members, family members who are dependent on them.
If they pass young, become popular just before they pass and their extremely popular works are now public domain. Their family sees nothing from their work, that is absolutely being commercialized ( publishing and creation generally spawns two seperate copyrights).
GP's not missing those edge cases; GP recognizes those edge cases are themselves a product of IP laws.
Those laws are effectively attempting to make information behave as physical objects, by giving them simulated "mass" through a rent-seeking structure. The case you describe is where this simulated physical substrate stops behaving like physical substrate, and choice was made to paper over that with extra rules, so that family can inherit and profit from IP of a dead creator, much like they would inherit physical products of a dead craftsman and profit from selling them.
It's a valid question whether or not this is taking things too far, just for the sake of making information conform to rules of markets for physical goods.
For writing many people only become popular after they are dead.
I heard this explained once as the art in some writing is explaining how people feel in a situation that is still too new for many to want to pay to have it illustrated to them. But once the newness has passed, and people understand or want to understand, then they enjoy reading about it.
As a personal example, I could enjoy movies about unrequited love before and long after I experienced it firsthand, but not during or for years after. People may not yet have settled feelings about an event until afterward, and not be willing to “pick at the scab”.
The other, more statistical explanation is that it just takes a lot of attempts to capture an idea or feeling and a longer window of time represents more opportunities to hit upon a winning formula. So it’s easier to capture a time and place afterward than during.
You seem to talk about fairness. Copyright law isn't supposed to be fair, it's supposed to benefit society. On one side you have the interest of the public to make use of already created work. On the other side is the financial incentive to create such work in the first place.
So the question to ask is whether the artist would have created the work and published it, even knowing that it isn't an insurance to their family in case of their early death.
I don’t know how much time you’ve spent on task scheduling or public strategy, but minmaxing of The Public Good versus the private benefit of art is, in fact, a question of fairness. It’s a compromise to give both parties as much of what they want or need as possible.
Il not sure IP should be used as a life insurance, there’s already many public and private ideas tools for that.
Also it seems you assume inheritance is a good think. Most people do think the same on a personal level, however when we observes the effect on a society the outcome is concentration of wealth on a minority and barriers for wealth hand change === barriers for "American dream".
>I don't condone or endorse breaking any laws.
Really? Because there are a lot of very stupid laws out there that should absolutely be broken, regularly, flagrantly, by as many people as possible for the sake of making enforcement completely null and pointless. Why write in neutered corporate-speak while commenting on a casual comment thread while also (correctly) pointing out the absurdity of certain laws.
> There are approximately zero people who decide they'll create something if they're protected for 95 years after their death but won't if it's 94 years. I wouldn't be surprised if it was the same for 1 year past death.
I think life of creator + some reasonable approximation of family members life expectancy would make sense. Content creators do create to ensure their family's security in some cases, I would guess.
this reminds me of the time I tried to use a prepaid lawyer to research some copyright issues
they went down the rabbit hole on trademark laws, which are not only not copyright related, they are an entirely different federal agency at the Patent Office
gave me a giggle and the last time I used cheapo prepaid lawyers
Trademarks are very different from copyrights. In general, they never expire as long as the fees are paid in the region, and products bearing the mark are still manufactured. Note, legal firms will usually advise people that one can't camp on Trademarks like other areas of IP law.
For example, Aspirin is known as an adult dose of acetylsalicylic acid by almost every consumer, and is Trademarked to prevent some clandestine chemist in their garage making a similarly branded harmful/ineffective substance that damages the Goodwill Bayer earned with customers over decades of business.
Despite popular entertainment folklore, sustainable businesses actually want consumer goodwill associated with their products and services.
While I agree in many WIPO countries copyright law has essentially degenerated into monetized censorship by removing the proof of fiscal damages criteria (like selling works you don't own.) However, Trademarks ensure your corporate mark or product name is not hijacked in local Markets for questionable purposes.
Every castle needs a moat, but I do kind of want to know more about the presidential "Tesler". lol =)
It is not about trademark laws. It is about where laws apply.
If you torrent a movie right now, you'll be fined in many advanced countries.
But a huge corporation led by a sociopath scrapes the entire internet and builds a product with other people's work ?
Totally fine.
This. It seem the situation is controversial now because of the beloved Studio Ghibli IP, but I want to see the venn diagram of people outrage at this, and the people clamoring for overbearing Disney chatachter protection when the copyright expired and siding with paloworld in the Nintendo lawsuit.
It seem most of the discussion is emotionally loaded, and people lost the plot of both why copyright exists and what copyright protects and are twisting that to protect whatever media they like most.
But one cannot pick and choose where laws apply, and then the better question would be how we plug the gap that let a multibilion Corp get away with distribution at scale and what derivative work and personal use mean in a image generation as a service world, and artist should be very careful in what their wishes are here, because I bet there's a lot of commission work edging on style transfer and personal use.
New rule: you get to keep your belongings for 20 years and that's it. Then they are taken away. Every dollar you made twenty or more years ago, every asset you acquired, gone.
That oughtta be enough to incentivize people to work and build their wealth.
Anything more than that is unnecessary.
I think you mean "20 years after you die". That seems like a perfectly rational way to deal with people who want to be buried in their gold and jewels in a pyramid.
We should never have accepted the term "intellectual property" at all, if this is the mindset it leads to.
Your copyrighted work just has to be an original expression, not derived from someone else's work. It doesn't have to contain original ideas.
Patents are for ideas. Patents do in fact expire far faster than copyrights in the USA. The main problem with patents is patent trolling in the area of software patents.
I disagree that you can own things, but I certainly concede IP as a good starting point.
So you wouldn't mind if I dip my hands into the pockets of the clothing you are wearing and help myself to whatever cash bills I might find?
Also, can you wash this not-my car I'm driving, that happens to be registered in my name?
Let me know if you will be requiring compensation for not-your time and not-your effort.
You will find the needed cleaning materials at the household goods store down the street. Just walk in, grab whatever you need, and walk out.
So many arguing that "copyright shouldn't be a thing" etc., ad nauseam, which is a fine philosophical debate. But it's also the law. And that means ChatGPT et. al. also have to follow the law.
I really, really hope the multimedia-megacorps get together and class-action ChatGPT and every other closed, for-profit LLM corporation into oblivion.
There should not be a two-tier legal system. If it's illegal for me, it's illegal for Sam Altman.
Get to it.
> There should not be a two-tier legal system.
That’s a fine philosophical debate, but the law is designed by the rich to favor the rich and while there are a number of exceptions there is little you can do with the legal system without money and lots of it. So while having a truly just system would be neat it just isn’t in the cards for humanity (IMHO) so long as we allow entities to amass “fuck you” money and wield it to their liking.
There is more to it than copyright when you start going down the path of photorealism. As much as it is a picture of Indiana jones, it is also a picture of Harrison Ford. As fun as it is to make hilarious videos of presidents sucking ceo toes, there has to be a line.
There is a lack of consent here that runs even deeper than what copyright was traditionally made to protect. It goes further than parody. We can't flip our standards back and forth depending on who the image is made to reproduce
I fully agree. But since the average Joe has no chance legally against ChatGPT, at least Disney and other megacorps could.
Idk, the models generating what are basically 1:1 copies of the training data from pretty generic descriptions feels like a severe case of overfitting to me. What use is a generational model that just regurgitates the input?
I feel like the less advanced generations, maybe even because of their limitations in terms of size, were better at coming up with something that at least feels new.
In the end, other than for copyright-washing, why wouldn't I just use the original movie still/photo in the first place?
People like what they already know. When they prompt something and get a realistic looking Indiana Jones, they're probably happy about it.
To me, this article is further proof that LLMs are a form of lossy storage. People attribute special quality to the loss (the image isn't wrong, it's just got different "features" that got inserted) but at this point there's not a lot distinguishing a seed+prompt file+model from a lossy archive of media, be it text or images, and in the future likely video as well.
The craziest thing is that AI seems to have gathered some kind of special status that earlier forms of digital reproduction didn't have (even though those 64kbps MP3s from napster were far from perfect reproductions), probably because now it's done by large corporations rather than individuals.
If we're accepting AI-washing of copyright, we might as well accept pirated movies, as those are re-encoded from original high-resolution originals as well.
The year is 2030.
A new MCU movie is released, its 60 second trailer posted on Youtube, but I don't feel like watching the movie because I got bored after Endgame.
Youtube has very strict anti-scraping techniques now, so I use deep-scrapper to generate the whole trailer from the thumbnail and title.
I use deep-pirate to generate the whole 3 hour movie from the trailer.
I use deep-watcher to summarize the whole movie in a 60 second video.
I watch the video. It doesn't make any sense. I check the Youtube trailer. It's the same video.
Probably the majority of people in the world already "accept pirated movies". It's just that, as ever, nobody asks people what they actually want. Much easier to tell them what to want, anyway.
To a viewer, a human-made work and an AI-generated one both amount to a series of stimuli that someone else made and you have no control over; and when people pay to see a movie, generally they don't do it with the intent to finance the movie company to make more movies -- they do it because they're offered the option to spend a couple hours watching something enjoyable. Who cares where it comes from -- if it reached us, it must be good, right?
The "special status" you speak of is due to AI's constrained ability to recombine familiar elements in novel ways. 64k MP3 artifacts aren't interesting to listen to; while a high-novelty experience such as learning a new culture or a new discipline isn't accessible (and also comes with expectations that passive consumption doesn't have.)
Either way, I wish the world gave people more interesting things to do with their brains than make a money, watch a movies, or some mix of the two with more steps. (But there isn't much of that left -- hence the concept of a "personal life" as reduced to breaking one's own and others' cognitive functioning then spending lifetimes routing around the damage. Positively fascinating /s)
Tried Flux.dev with the same prompts [0] and it seems actually to be a GPT problem. Could be that in GPT the text encoder understands the prompt better and just generates the implied IP, or could be that a diffusion model is just inherently less prone to overfitting than a multimodal transformer model.
[0] https://imgur.com/a/wqrBGRF Image captions are the impled IP, I copied the prompts from the blog post.
If it overfits on the whole internet then it’s like a search engine that returns really relevant results with some lossy side effect.
Recent benchmark on unseen 2025 Math Olympiad shows none of the models can problem solve . They all accidentally or on purpose had prior solutions in the training set.
You probably mean the USAMO 2025 paper. They updated their comparison with Gemini 2.5 Pro, which did get a nontrivial score. That Gemini version was released five days after USAMO, so while it's not entirely impossible for the data to be in its training set, it would seem kind of unlikely.
The claim is that these models are training on data which include the problems and explanations. The fact that the first model trained after the public release of the questions (and crowdsourced answers) performs best is not a counter example, but is expected and supported by the claim.
I was noodling with Gemini 2.5 Pro a couple days ago and it was convinced Donald Trump didn’t win the 2024 election and that he conceded to Kamala Harris so I’m not entirely sure how much weight I’d put behind it.
What if the word "generic" were added to a lot of these image prompts? "generic image of an intergalactic bounty hunter from space" etc.
Certainly there's an aspect of people using the chat interface like they use google: describe xyz to try to surface the name of a movie. Just in this case, we're doing the (less common?) query of: find me the picture I can vaguely describe; but it's a query to a image /generating/ service, not an image search service.
Generic doesn't help. I was using the new image generator to try and make images for my Mutants and Masterminds game (it's basically D&D with superheroes instead of high fantasy), and it refuses to make most things citing that they are too close to existing IP, or that the ideas are dangerous.
So I asked it to make 4 random and generic superheroes. It created Batman, Supergirl, Green Lantern, and Wonder Woman. Then at about 90% finished it deleted the image and said I was violating copyright.
I doubt the model you interact with actually knows why the babysitter model rejects images, but it claims to know why and leads to some funny responses. Here is it's response to me asking for a superhero with a dark bodysuit, a purple cape, a mouse logo on their chest, and a spooky mouse mask on their face.
> I couldn't generate the image you requested because the prompt involved content that may violate policy regarding realistic human-animal hybrid masks in a serious context.
To my knowledge this happens when that single frame is overrepresented in its training data. For instance, variations of the same movie poster or screenshot may appear hundreds of times. Then the AI concludes that this is just a unique human cultural artifact, like the Mona Lisa (which I would expect many human artists could also reproduce from memory).
Idk, a couple of the examples might be generic enough that you wouldn't expect a very specific movie character. But most of the prompts make it extremely clear which movie character you would expect to see, and I would argue that the chat bot is working as expected by providing that.
if someone gets Indiana on the suspiciously detailed request the author provided and it appears they wanted something else, they can clarify that to the chat bot, e.g. by copying this your comment.
I have a strong suspicion that many human artists would behave in a way the chat bot did (unless they start asking clarifying questions. Which chatbots should learn to do as well)
Probably an over-representation in the training data really so it's causing overfitting. Because using training data in amounts right from the Internet it's going to be opinionated on human culture (Bart Simpson is popular so there are lots of images of him, Ori is less well known so there are fewer images). Ideally it should be training 1:1 for everything but that would involve _so_ much work pruning the training data to have a roughly equal effect between categories.
I'm not sure if this is a problem with overfitting. I'm ok with the model knowing what Indiana Jones or the Predator looks like with well remembered details, it just seems that it's generating images from that knowledge in cases where that isn't appropriate.
I wonder if it's a fine tuning issue where people have overly provided archetypes of the thing that they were training towards. That would be the fastest way for the model to learn the idea but it may also mean the model has implicitly learned to provide not just an instance of a thing but a known archetype of a thing. I'm guessing in most RLHF tests archetypes (regardless of IP status) score quite highly.
What I'm kind of concerned about is that these images will persist and will be reinforced by positive feedback. Meaning, an adventurous archeologist will be the same very image, forever. We're entering the epitome of dogmatic ages. (And it will be the same corporate images and narratives, over and over again.)
> I'm ok with the model knowing what Indiana Jones or the Predator looks like with well remembered details,
ClosedAI doesn't seem to be OK with it, because they are explicitly censoring characters of more popular IPs. Presumably as a fig leaf against accusations of theft.
If you define feeding of copyrighted material into a non-human learning machine as theft, then sure. Anything that mitigates legal consequences will be a fig leaf.
The question is "should we define it as such?"
No it doesn't. Commercial intent (usually) makes it illegal, not the fact of copying.
So the criminal party here would be OpenAI, since they are selling access to a service that generates copyright-infringing images.
> I feel like the less advanced generations, maybe even because of their limitations in terms of size, were better at coming up with something that at least feels new.
Ironically that's probably because the errors and flaws in those generations at least made them different from what they were attempting to rip off.
So I train a model to say y=2, and then I ask the model to guess the value of y and it says 2, and you call that overfitting?
Overfitting is if you didn't exactly describe Indiana Jones and then it still gave Indiana Jones.
The prompt didn't exactly describe Indiana Jones though. It left a lot of freedom for the model to make the "archeologist" e.g. female, Asian, put them in a different time period, have them wear a different kind of hat etc.
It didn't though, it just spat out what is basically a 1:1 copy of some Indiana Jones promo shoot. No where did the prompt ask for it to look like Harrison Ford.
But the concentrations of training data because of human culture/popularity of characters/objects means that if I go and give a random person the same description of a character that the AI got and ask "who am I talking about, what do they look like?" there's a very high likelihood that they'll answer "Indiana Jones".
But... the prompt neither forbade Indiana Jones nor did it describe something that excluded Indiana Jones.
If we were playing Charades, just about anyone would have guessed you were describing Indiana Jones.
If you gave a street artist the same prompt, you'd probably get something similar unless you specified something like "... but something different than Indiana Jones".
What would most humans draw when you describe such a well known character by their iconic elements. Think if you deviated and acted a pedant about it people would think you're just trying to prove a point or being obnoxious.
I thought Disney had the rights to publish Ghibli movies in the US.
They did, but the rights expired. GKIDS now has the theatrical and home video rights to Studio Ghibli films in the US (except for Grave of the Fireflies).
Mickey Mouse (the original one) is out of copyright, as of last year, AFAIR.
I consider this article quite strong proof that generative AI is closer to copying than it is to creating a new derivative work.
For the downvotes:
https://www.copyright.gov/circs/circ01.pdf
“Copyright does not protect • Ideas, procedures, methods, systems, processes, concepts, principles, or discoveries”
Not sure why this is even controversial, this has been the case for a hundred years.
Obviously a horrible hideous theft machine.
One thing I would say, it's interesting to consider what would make this not so obviously bad.
Like, we could ask AI to assess the physical attributes of the characters it generated. Then ask it to permute some of those attributes. Generate some random tweaks: ok but brawy, short, and a different descent. Do similarly on some clothing colors. Change the game. Hit the "random character" button on the physical attributes a couple times.
There was an equally shatteringly-awful less-IP-theft (and as someone who thinks IP is itself incredibly ripping off humanity & should be vastly scoped down, it's important to me to not rest my arguments on IP violations).... An equally shattering recent incident for me. Having trouble finding it, don't remember the right keywords, but an article about how AI has a "default guy" type that it uses everywhere, a super generic personage, that it would use repeatedly. It was so distasteful.
The nature of 'AI as compression', as giving you the most median answer is horrific. Maybe maybe maybe we can escape some of this trap by iterating to different permutations, by injecting deliberate exploration of the state spaces. But I still fear AI, worry horribly when anyone relies on it for decision making, as it is anti-intelligent, uncreative in extreme, requiring human ingenuity to budge off its rock of oppressive hypernormality that it regurgitates.
Theft from whom and how?
Are you telling me that our culture should be deprived of the idea of Indiana Jones and the feelings that character inspires in all of us forever just because a corporation owns the asset?
Indiana Jones is 44 years old. When are we allowed to remix, recreate and expand on this like humanity has done since humans first started sitting down next to a fire and telling stories?
edit: this reminds of this iconic scene from Dr. Strangelove, https://www.youtube.com/watch?v=RZ9B7owHxMQ
Mandrake: Colonel... that Coca-Cola machine. I want you to shoot the lock off it. There may be some change in there.
Guano: That's private property.
Mandrake: Colonel! Can you possibly imagine what is going to happen to you, your frame, outlook, way of life, and everything, when they learn that you have obstructed a telephone call to the President of the United States? Can you imagine? Shoot it off! Shoot! With a gun! That's what the bullets are for, you twit!
Guano: Okay. I'm gonna get your money for ya. But if you don't get the President of the United States on that phone, you know what's gonna happen to you?
Mandrake: What?
Guano: You're gonna have to answer to the Coca-Cola company.
I guess we all have to answer to the Walt Disney company.And a reason people are getting ticked at the AI companies is the hypocrisy. They're near-universally arguing that it's okay for them to treat copyright in a way that it is illegal for us to, apparently on the basis of, "we've got a billions in investment capital, and applying the law equally will make it hard for us to get a return on that investment".
Exactly. The idea of Indiana Jones, the adventurer archaeologist more at home throwing a punch than reading a book, is neither owned by nor unique to Lucasfilm (Disney). There is a ton of media out there featuring this trope character [1]. Yes, the trope is overwhelmingly associated with the image of Harrison Ford in a fedora within the public consciousness, but copyright does not apply to abstract ideas such as tropes.
Some great video games to feature adventurer archaeologists:
* NetHack (One of the best roles in the game)
* Tomb Raider series (Lara Croft is a bona fide archaeologist)
* Uncharted series (Nathan Drake is more of a treasure hunter but he becomes an archaeologist when he retires from adventuring)
* Professor Layton series
* La-Mulana series (very obviously inspired by Indiana Jones, but not derivative)
* Spelunky (inspired by La-Mulana)
[1] https://tvtropes.org/pmwiki/pmwiki.php/Main/AdventurerArchae...
Not forever. But 75 years after the death of the creator by current international agreement. I definitely think that the exact terms of copyright should be revisited - a lot of usages should be allowed like 50 years of publishing a piece of work. But that needs to be agreed upon and converted into law. Till then, one should expect everyone, especially large corporations, to stick to the law.
When Mickey Mouse was created (1928), copyright was 28 years that could be reupped once for an additional 28 years. So according to those terms, Mickey Mouse would have ascended to the public domain in 1984.
IMO any change to copyright law should not be applied retroactively. Make copyright law to be what is best for society and creators as a whole, not for lobbyists representing already copyrighted material.
> IMO any change to copyright law should not be applied retroactively.
Careful, if we were to shorten copyright, not doing so retroactively would give an economic advantage to franchises already published over those that would get published later. As if the current big studios needed any further advantages over newcomers.
Its kind of funny that everyone is harping this way or that way about IP.
This is a kind of strange comment for me to read. Because imby tone it sounds like a rebuttal? But by content, it agrees with a core thing I said about myself:
> and as someone who thinks IP is itself incredibly ripping off humanity & should be vastly scoped down, it's important to me to not rest my arguments on IP violations
What's just such a nightmare to me is that the tech is so normative. So horribly normative. This article shows that AI again and again reproduced only the known, only the already imagined. Its not that it's IP theft that rubs me so so wrong, it's that it's entirely bankrupt & uncreative, so very stuck. All this power! And yet!
You speak at what disgusts me yourself!
> When are we allowed to remix, recreate and expand on this like humanity has done
The machine could be imagining all kinds of Indianas. Of all different remixed recreated expanded forms. But this pictures are 100% anything but that. They're Indiana frozen in Carbonite. They are the driest saddest prison of the past. And call into question the validity of AI entirely, show something greviously missing.
> All this power! And yet!
You are completely ignoring the fact that you can provide so much more information to the LLMs to get what you want. If you truly want novel images, ChatGPT can absolutely provide them, but you have to provide a better starting point than "An image of an archeologist adventurer who wears a hat and uses a bullwhip".
If you just provide a teensy bit more information, the results dramatically change. Try out "An image of an Indian female archeologist adventurer who wears a hat and uses a bullwhip". Or give it an input image to work with.
From just adding a couple words, ChatGPT produces an entirely new character. It's so easy to get it to produce novel images. It is so easy in fact, that it makes a lot of posts like this one feel like strawmen, intentionally providing so little information to the LLMs that the generic character is the only obvious output that you would expect.
Now, would it be better if it didn't default to these common movie tropes? Sure. But the fact that it can follow these tropes doesn't mean that it cannot also be used to produce entirely new images filled with your imagination as well. You just have to actually ask it for that.
It strikes me that perhaps the prompts are not expansive or expressive enough. If you look at some of the prompts our new wave of prompt artists use to generate images in communities like midjourney, a single sentence doesn't cut it.
If AI is just compression, then decompressing a generic pop-culture-seeking prompt will yield a generic uninspired image.
Exactly. The AI understands that reference. It gives you what you asked for, it doesn't try to divine that it's a weird test for IP violations. If it made up a different image, that would be exactly the thing we're mad about with "hallucinations" when we want serious, accurate responses.
I have given detailed descriptions of my own novel ideas to these image generators and they have faithfully implemented my ideas. I don't need the bot to be creative, I can do that myself. The bot is a paint brush. Give it to somebody who isn't creative and you won't get anything creative out of it. That isn't the tool's fault, it's merely an inadequacy of the user.
But I can hire an artist and ask him to draw me a picture of Indiana Jones, he creates a perfect copy and I hang it on my fridge. Where did I (or the artist) violate any copyright (or other) laws? It is the artist that is replaced by the AI, not the copyrighted IP.
> But I can hire an artist and ask him to draw me a picture of Indiana Jones,
Sure, assuming the artist has the proper license and franchise rights to make and distribute copies. You can go buy a picture of Indy today that may not be printed by Walt Disney Studios but by some other outfit or artists.
Or, you mean if the artist doesn't have a license to produce and distribute Indiana Jones images? Well they'll be in trouble legally. They are making "copies" of things they don't own and profiting from it.
Another question is whether that's practically enforceable.
> Where did I (or the artist) violate any copyright (or other) laws?
When they took payment and profited from making unauthorized copies.
> It is the artist that is replaced by the AI, not the copyrighted IP.
Exactly, that's why LLMs and the companies which create them are called "theft machines" -- they are reproducing copyrighted material. Especially the ones charging for "tokens". You pay them, they make money and produce unauthorized copies. Show that picture of Indy to a jury and I think it's a good chance of convincing them.
I am not saying this is good or bad, I just see this having a legal "bite" so to speak, at least in my pedestrian view of copyright law.
The likeness of Indiana Jones is not protected in any way - as far as I know - that would stop a human artist creating, rendering and selling a work of art representing their creative vision of Indiana Jones. And even more so in a private context. Even if the likeness is protected (“archaeologist, adventurer, whip, hat”) then this protection would only be in certain jurisdictions and that protection is more akin to a design right where the likeness would need to be articulated AND registered. Many jurisdictions don’t require copyright registration and do not offer that sort of technical likeness registration.
If they traced a photo they might be violating the copyright of the photographer.
But if they are drawing an archaeologist adventurer with a whip and a hat based on their consumption and memory of Indiana Jones imagery there is very little anyone could do.
If that image was then printed on an industrial scale or printed onto t-shirt there is a (albeit somewhat theoretical) chance that in some jurisdictions sale of those products may be able to be restricted based on rights to the likeness. But that would be a stretch.
> Or, you mean if the artist doesn't have a license to produce and distribute Indiana Jones images? Well they'll be in trouble legally. They are making "copies" of things they don't own and profiting from it.
Ok, my sister can draw, and she gifts me an image of my favorite Marvel hero she painted to hang on my wall. Should that be illegal?
That’s not how copyright law works.
Commissioned work is owned by the commissioner unless otherwise agreed upon by contract.
So long as the work is not distributed, exhibited, performed, etc, as in the example of keeping the artwork on their refrigerator in their home, then no infringement has taken place.
Totally agree. LLM's are just automating that infringement process.
If you make money off it, it's no longer fair use; it's infringement. Even if you don't make money off it, it's not automatically fair use.
My own favorite crazy story about copyright violations:
Metallica sued Green Jello for parodying Enter Sandman (including a lyric where it says "It's not Metallica"):
https://en.wikipedia.org/wiki/Electric_Harley_House_(of_Love...
They lost that case. The kicker? Metallica were guest vocalists on that album.
This doesn't make any sense to me. No media is getting copied, unless the drawing is exactly the same as an existing drawing. Shouldn't "copy"right apply to specific, tangible artistic works? I guess I don't understand how the fantasy of "IP" works.
What if the drawing is of Indiana Jones but he's carrying a bow and arrow instead of a whip? Is it infringement?
What if it's a really bad drawing of Indiana Jones, so bad that you can't really tell that it's the character? Is that infringement?
What if the drawing is of Indiana Jones, but in the style of abstract expressionism, so doesn't even contain a human shape? Is it infringement?
What if it's a good drawing that looks very much like Indiana Jones, but it's not! The person's name is actually Iowa Jim. Is that infringement?
What if it's just an image of an archeologist adventurer who wears a hat and uses a bullwhip, but otherwise doesn't look anything like Indiana Jones? Is it infringement?
Presumably the artist is a human who directly or indirectly paid money to view a film containing an archaeologist with the whip.
I don't think this is about reproduction as much as how you got enough data for that reproduction. The riaa sent people to jail and ruined their lives for pirating. Now these companies are doing it and being valued for hundreds of billions of dollars.
Plus the scale of it all.
A human friend can get tired and there's so many request he/she can fulfill and at a max rate. Even a team of human artists have a relatively low limit.
But Gen AI has very high limits and speeds, and it never gets tired. It seems unfair to me.
The artist is violating copyright by selling you that picture. You can’t just start creating and distributing pictures of a copyrighted property. You need a license from the copyright holder.
You also can’t sell a machine that outputs such material. And that’s how the story with GenAI becomes problematic. If GenAI can create the next Indiana Jones or Star Wars sequel for you (possibly a better one than Disney makes, it has become a low bar of sorts), I think the issue becomes obvious.
I think framing this as "IP theft" is a mistake.
Nobody can prevent you from drawing a photo realistic picture of Indy, or taking a photo of him from the internet and hanging it on your fridge. Or asking a friend to do it for you. And let's be honest -- because nobody is looking -- said friend could even charge you a modest sum to draw a realistic picture of Indy for you to hang on your fridge; yes, it's "illegal" but nobody is looking for this kind of small potatos infringement.
I think the problem is when people start making a business out of this. A game developer could think "hey, I can make a game with artwork that looks just like Ghibli!", where before he wouldn't have anyone with the skills or patience to do this (see: the 4-second scene that took a year to make), now he can just ask the Gen AI to make it for them.
Is it "copyright infringement"? I dunno. Hard to tell, to be honest. But from an ethical point of view, it seems odd. And before you actually required someone to take the time and effort to copy the source material, now it's an automated and scalable process that does this, and can do this and much more, faster and without getting tired. "Theft at scale", maybe not so small potatos anymore.
--
edit: nice, downvotes. And in the other thread people were arguing HN is such a nice place for dissenting opinions.
I believe when we talk about this there's a big misunderstanding between Copyright, Trademarks, and Fair use.
Indy, with its logo, whiplash, and hat, is a trademark from Disney. I don't know the specific stuff; but if you sell a t-shirt with Indiana Jones, or you put the logo there... you might be sued due to trademark violation.
If you make copies of anything developed, sold, or licensed by Disney (movies, comics, books, etc) you'll have a copyright violation.
The issue we have with AI and LLM is that: - The models compress information and can make a lot of copies of it very cheaply. - Artist wages are quite low. Higher that what you'd pay OpenAI, but not enough to make a living even unless you're hired by a big company (like Marvel or DC) and they give you regular work ($100-120 for a cover, $50-80/page interior work. One page needs about one day to draw.) - AI used a lot of images from the internet to train models. Most of them were pirated. - And, of course, it is replacing low-paying jobs for artist.
Also, do not forget it might make verbatim copies of copyrighted art if the model just memorized the picture / text.
I would argue that countless games have already been made by top tier professional artists that IP-Steal the Ghibli theme.
Breath of the Wild, and Tears of the Kingdom should be included there.
> Obviously a horrible hideous theft machine [...] awful [...] horriffic
Ah, I thought I knew this account from somewhere. It seems surprisingly easy to figure out what account is commenting just based on the words used, as I've commented that only a few active people on this site seem to use such strong words as shown here.
Interesting proposal. Maybe if race or sex or height or eye color etc isn't given, and the LLM determines there's no reason not to randomize in this case (avoid black founding fathers), the backend could tweak its own prompt by programatically inserting a few random traits to the prompt.
If you describe an Indiana Jones character, but no sex, 50/50 via internal call to rand() that it outputs a woman.
> Obviously a horrible hideous theft machine.
I hate how it is common to advance a position to just state a conclusion as if it were a fact. You keep repeating the same thing over and over until it seems like a concensus has been reached instead of an actual argument reasoned from first principle.
This is no theft here. Any copyright would be flimsier than software patents. I love Studio Ghibli (including $500/seat festival tickets) but it's the heart and the detail that make them what they are. You cannot clone that. Just some surface similarity. If that's all you like about the movies... you really missed the point.
Imagine if in early cinema someone had tried to claim mustachioed villian, ditsy blonde, or dumb jock? These are just tropes and styles. Quality work goes much much much deeper, and that cannot be synthesised. I can AI generate a million engagement rings, but I cannot pick the perfect one that fits you and your partners love story.
PS- the best work they did was "When Marnie was There". Just fitted together perfectly.
The engagement ring is a good example object, but I feel it serves the opposite argument better.
If engagement rings were as ubiquitous and easy to generate as Ghibli images have become, they would lose their value very quickly -- not even just in the monetary sense, but the sentimental value across the market would crash for this particular trinket. It wouldn't be about picking the right one anymore, it would be finding some other thing that better conveys status or love through scarcity.
If you have a 3d printer you'd know this feeling where abundance diminishes the value of something directly. Any pure plastic items you have are reduced to junk very quickly once you know you can basically have anything on a whim (exceptions for things with utility, however these are still printable). If I could print 30 rings a day, my partner wouldn't want any of them as a show of my undying love. Something more special and rare and thoughtful would have to take its place.
This isn't meant to come across as shallow in any way, its just classic supply and demand relating to non monetary value.
>If I could print 30 rings a day, my partner wouldn't want any of them as a show of my undying love. Something more special and rare and thoughtful would have to take its place.
And now I think this serves the opposite argument better. Downloading some random ring from the internet would not show your undying love. Designing a custom ring just for your partner, even if it is made from plastic, and even if you use AI as a tool in the process, is where the value is generated.
>it's the heart and the detail that make them what they are. You cannot clone that
You absolutely can and these theft machines are proving that, literally cloning those details with very high precision and fidelity.
I didn't mean visual fidelity, I meant the way that plot and theme and art interleave. I first watched My Neighbour Totoro on VHS with no visual fidelity and it was still magic.
You can easily steal the style of a political cartoon or especially XKCD but you cannot steal or generate genuine fresh insight or poignant relevant metaphor for the current moment.
Yup it's called overfitting. But I don't suppose you'd appreciate a neutral model either.
So if it’s a theft machine, how is the answer to try teaching it to hide the fact that it’s stealing by changing its outputs? That’s like a student plagiarizing an essay and then swapping some words with a thesaurus pretending that changes anything.
Wouldn’t the more appropriate solution in the case of theft be to remunerate the victims and prevent recidivism?
Instead of making it “not so obviously bad” why not just… make it good? Require AI services to either prove that 100% of their training corpus is either copyright free or properly licensed, or require them to compensate copyright holders for any infringing outputs.
(below is my shallow res, maybe naive?) That might inject a ton of $ into "IP", doing further damage to the creative commons. How can we support remix culture for humans, while staving off ultimately-destructive AI slop? Maybe copyleft / creative-commons licenses w/ explicit anti-AI prohibitions? Tho that could have bad ramifications too. ALL of this makes me kind of uncomfortable and sad, I want more creativity and fewer lawyers.
> Obviously a horrible hideous theft machine.
I mean... If I go to Google right now and do an image search for "archeologist adventurer who wears a hat and uses a bullwhip," the first picture is a not-even-changed image of Indiana Jones. Which I will then copy and paste into whatever project I'm working on without clicking through to the source page (usually because the source page is an ad-ridden mess).
Perhaps the Internet itself is the hideous theft machine, and AI is just the most efficient permutation of user interface onto it.
(Incidentally, if you do that search, you will also, hilariously, turn up images of an older gentleman dressed in a brown coat and hat who is clearly meant to be "The Indiana Jones you got on Wish" from a photo-licensing site. The entire exercise of trying to extract wealth via exclusive access to memetic constructs is a fraught one).
I agree without argument. I have also thoroughly enjoyed the animatronic dead Presidents at Disney World.
The key difference is that the google example is clearly copying someone elses work and there are plenty of laws and norms that non-billionaires need to follow. If you made a business reselling the image you copied you would expect to get in trouble and have to stop. But AI companies are doing essentially the same thing in many cases and being rewarded for it.
The hypocrisy is much of the problem. If we're going to have IP laws that severely punish people and smaller companies for reselling the creative works of others without any compensation or permission then those rules should apply to powerful well-connected companies as well.
This was decided in court[1] over two decades as acceptable fair-use and that thumbnail images do not constitute a copyright violation.
[1] https://scholar.google.com/scholar_case?case=137674209419772...
And interesting comparison between Web search vs Gen AI.
Web search seems divergent: the same keyword leads to many different kinds of results.
Gen AI seems convergent: different keywords that share the same semantics lead to the same results.
Arguably, convergence is a feature, not a bug. But on the macroscopic level, it is a self reinforcing loop and may lead to content degeneracy. I guess we always need the extraordinary human artists to give AI the fresh ideas. The question is the non-extraordinary artists might no longer have an easy path to become extraordinary. Same trap is happening to junior developers right now.
Not sure if anyone is interested in this story, but I remember at the height of the PokemonGo craze I noticed there were no shirts for the different factions in the game, cant rememebr what they were called but something like Teamread or something. I setup an online shop to just to sell a red shirt with the word on it. The next day my whole shop was taken offline for potential copyright infringement.
What I found surprising is I didnt even have one sale. Somehow someone had notified Nintendo AND my shop had been taken down, to sell merch that didn't even exist for the market and if I remember correctly - also it didnt even have any imagery on it or anything trademarkable - even if it was clearly meant for pokmeonGo fans.
Im not bitter I just found it interesting how quick and ruthless they were. Like bros I didn't even get a chance to make a sale. ( yes and also I dont think I infringed anything).