

Fair enough, I see what you’re saying.
I’ll go ahead and share the quote from the court’s decision for context:
We affirm the denial of Dr. Thaler’s copyright application. The Creativity Machine cannot be the recognized author of a copyrighted work because the Copyright Act of 1976 requires all eligible work to be authored in the first instance by a human being. Given that holding, we need not address the Copyright Office’s argument that the Constitution itself requires human authorship of all copyrighted material. Nor do we reach Dr. Thaler’s argument that he is the work’s author by virtue of making and using the Creativity Machine because that argument was waived before the agency.
I’m a little bit uncertain based on this summary of the judgement by the Stanford library on copyright and fair use:
Dr. Thaler sought review of the Copyright Office’s decision in the United States District Court for the District of Columbia. The district court affirmed the Copyright Office’s denial, holding that human authorship is a fundamental requirement under the Copyright Act of 1976. The court also rejected Dr. Thaler’s argument that he should own the copyright under the work-made-for-hire doctrine, as the work was never eligible for copyright protection in the first place
Why are they saying that “the work was never eligible for copyright in the first place”? Because Thaler claimed that the AI itself made the work? This all feels a bit like Schroedinger’s Copyrighted Work to me… the work exists, so who made it?
Generative AI fans would have you believe that they are the author and copyright holder, because they wrote a prompt.
AI companies might want to argue, like Thaler, that they made the AI, so they are the author and copyright holder.
My personal opinion is that the prompt and code are both relatively insignificant in comparison to the training data from which the probabilistic machine learning model is derived. The prompt would do nothing without the model, and OpenAI themselves said they quiet part out loud when they argued in court that the creation of a model such as theirs would be “impossible” to achieve without training off of vast amounts of copyrighted works.
“It would be impossible to train today’s leading AI models without using copyrighted materials … Limiting training data to public domain books and drawings created more than a century ago might yield an interesting experiment, but would not provide AI systems that meet the needs of today’s citizens,”
Clearly the training data itself is the most important piece of the system, which makes a lot of sense to those of us who understand how machine learning and “AI” training actually works on a technical level. They’ve admitted in plain English that their entire product and for-profit business model relies on the use of other people’s work as training data. Sounds to me like they have derived considerable value from other people’s work without any sort of license or compensation…
By that logic alone, I would argue that the real copyright holders of generative AI works ought to be, at least in part, the people who provided (wittingly or unwittingly) the training data. They are the ones who made this whole social experiment possible, after all. Data is the new code, so I’m not sure why people expect to be able to use it for free in an unrestricted way.
This is a false equivalency that equates natural learning and human agency with “machine learning”, when in that they are not remotely the same. This is a common and extremely flawed personification of a mathematical system that simply does not “learn” in the same way that a human being does.
Contrary to what seems to be a popular belief today, the creative insight of a human artist is not simply a combination of all of the other works of art that they have seen (akin to training data superimposed into a model). A human artist has the x-factors of personal agency, taste, and the constant sensory barrage of simply living as a huge part of their creative development. For every painting that a human artist sees, they see an unknowable score of other things that influence their perception of the world and art.
This is very much not a legal point that you’re arguing here, by the way, it’s a technical and practical one.
“Style” is not what’s in question. It never was, and it wasn’t a word that I used in my example.
ML models are not trained on “style”. They are trained on actual works.
And in many cases (including in OpenAI’s case) trained on an unimaginable amount of full copyrighted works, in their entirety, without license or consent from the copyright holders, often times pirated with DRM circumvented.
It’s a simple fact of the technology that OpenAI’s Ghibli filter could not have been made without training off of a large amount (probably every frame of every film, if I had to make an educated guess) of their actual artistic work. OpenAI have admitted that much themselves in court.
You seem to have forgotten that this is a social media website comments section discussion, not a court of law.
I’m sharing my personal opinion, with a background in art, music, and programming, not law.
I’m entitled to do so, and I won’t stop because it should go without saying that the copyright system matters a great deal to people who actually make things.
If you think you’re above that then I’m not sure why you’re even here, frankly. Are you here to argue that any of this is fair use? I don’t see you making that case… (Maybe slightly timidly making that case, but not really going for it.)
In the end this topic is central to human culture and society, it’s not some kind of intellectual exercise for only people in blue suits to muse about.
Welcome to “the court of public opinion”, where Texan judges and Roman popes alike can be wrong.