Japan determines copyright doesn't apply to LLM/ML training data

ericjmorey@programming.dev · 6 months ago

Japan determines copyright doesn't apply to LLM/ML training data

Bitflip@lemmy.ml · 6 months ago

Nice, time to train one with all the Nintendo leaks and generate some Zelda art and a new Mario title!

ZickZack@fedia.io · 6 months ago

train one with all the Nintendo leaks

This is fine

generate some Zelda art and a new Mario title

This is copyright infringement.

The ruling in japan (and as I predict also in other countries) is that the act of training a model (which is just a statistical estimator) is not copyrightable, so cannot be copyright infringement. This is already standard practice for everything else: You cannot copyright a mathematical function, regardless of how much data you use to fit to it (that is sensible: CERN has fit physics models to petabytes worth of data, that doesn’t mean they hold a copyright on laws of nature, they just hold the copyright on the data itself). However, if you generate something that is copyrighted, that item is still copyrighted: It doesn’t matter whether you used an AI image generator, photoshop, or a tattoo gun.

NotMyOldRedditName@lemmy.world · 6 months ago

But you could ask it to generate something in the style of Zelda, without being Zelda.

ZickZack@fedia.io · 6 months ago

And that would be completely legal, just like any random guy on deviantart can draw something in the style of e.g. Picasso without getting into trouble (unless of course they claim it was painted by picasso, but that should be obvious).

FidiFadi@lemmy.world · 6 months ago

Nintendo would have coup the government if the decision made this scenario actually possible.

yamanii@lemmy.world · 6 months ago

The japanese artists will be so happy /s

Gutless2615@ttrpg.network · 6 months ago

Correct ruling.

yamanii@lemmy.world · 6 months ago

Why can’t I pirate stuff without risking jail like AI companies?

Camelbeard@lemmy.world · 6 months ago

If you read a book you can talk about it, quote it, draw characters from that book, write your own ending, etc.

Isn’t that kind of the same? Let’s say some day we have an AI with near human intelligence, why can’t the AI be trained on copyright works, just like humans, all our school books are copyrighted works?

Geobloke@lemmy.world · 6 months ago

Because you paid for that book?

Camelbeard@lemmy.world · 6 months ago

So if AI companies pay for a book or music (like a consumer) it’s no problem? Because I don’t think this is about paying for content, it’s that content holders refuse to work with AI companies.

Mustard@lemmy.blahaj.zone · 6 months ago

Unironically yes, if AI companies paid for training data everyone would be much happier.

I sincerely doubt that NOBODY is willing to sell data to them. It’s far more likely that they have not offered anyone a fair price yet, which makes sense because that would set a precedent.

Even then, if people don’t want to sell them their copyrighted work then tough. You can’t compel people to take customers they don’t want.

Armok: God of Blood@lemmy.dbzer0.com · 6 months ago

So if I go on a free website that hosts art (ArtStation, DeviantArt, etc.) and get training data that I could have legally accessed for free…

Mustard@lemmy.blahaj.zone · 6 months ago

They’ve all already done that haha. You could argue that a human has only one life in which to remix that art but an AI is theoretically immortal, so it’s a different category of customer.

At any rate, it’s clear that AI should not have free access to copyrighted works, like news articles, academic papers, stock images, and various kinds of non deviantart art.

Euphoma@lemmy.ml · edit-2 6 months ago

I’m pretty sure its technically copyright infringement to draw the characters (if they have a design in the book in images) or write fanfic, but no one cares. The only fan stuff that actually get taken down is nintendo fan games and in the past, videos on nintendo games without permission.

General_Effort@lemmy.world · 6 months ago

Why do you think you can’t? What do you think was illegal?

lloram239@feddit.de · 6 months ago

You can. Distributing copies is illegal, not downloading them. That’s why torrents are bad and streaming sites are fine. (Some exceptions might apply depending on your country).

Raymonf@lemmy.uhhoh.com · edit-2 6 months ago

From the “source” (the Japanese one, not the broken link at the bottom):

AIによる解析・学習についての現段階の日本の著作権法上の見解を確認、権利侵害の懸念も政府に訴えた一方、生成・出力段階での出力自体の著作権の扱いや元データの著作権の扱いが未確認ですので今後質疑等で確認する予定です。

生成系AIの活用、著作権者を守るための新たな規制が必要だ

It’s an analysis of the current copyright law in Japan. It does not mean that they won’t update the law eventually. What a terrible article.

ericjmorey@programming.dev · 6 months ago

Yeah i wish I had run the source article through Google Translate first.

https://programming.dev/comment/6159535

Japan determines copyright doesn't apply to LLM/ML training data

Japan determines copyright doesn't apply to LLM/ML training data

Taggart :donor: (@mttaggart)