lemm.ee

Local All Communities Log in Sign up

Local All Communities

771

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series(www.businessinsider.com)

posted 1 year ago

by

L4sBot@lemmy.worldMB

in

technology@lemmy.world

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments

[ +- ]

stappern@lemmy.one

44 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

OkToBeTakei@lemm.ee

14 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

🇰 🌀 🇱 🇦 🇳 🇦 🇰 ℹ️@yiffit.net

7 points

1 year ago

AI and your brain are very different things

How do you know that guy isn’t an AI?

report

reply

[ +- ]

OkToBeTakei@lemm.ee

5 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

kmkz_ninja@lemmy.world

2 points

1 year ago

His point is equally valid. Can an artist be compelled to show the methods of their art? Is it as right to force an artist to give up methods if another artist thinks they are using AI to derive copyrighted work? Haven’t we already seen that LLMs are really poor at evaluating whether or not something was created by an LLM? Wouldn’t making strong laws on such an already opaque and difficult-to-prove issue be more of a burden on smaller artists vs. large studios with lawyers-in-tow.

report

reply

[ +- ]

OkToBeTakei@lemm.ee

0 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

Siliconic@discuss.online

1 point

1 year ago

Irrelevant. You will be assimilated. Resistance is futile.

report

reply

Show more comments

Show more comments

Show more comments

[ +- ]

Asuka@sh.itjust.works

3 points

1 year ago

If I read Harry Potter and wrote a novel of my own, no doubt ideas from it could consciously or subconsciously influence it and be incorporated into it. Hey is that any different from what an LLM does?

report

reply

[ +- ]

stappern@lemmy.one

1 point

1 year ago

*

Deleted by creator

report

reply

[ +- ]

newIdentity@sh.itjust.works

9 points

1 year ago

Your brain isn’t an AI model

OR IS IT?

report

reply

[ +- ]

TwilightVulpine@lemmy.world

13 points

1 year ago

You joke but AI advocates seem to forget that people have fundamentally different rights than tools and objects. A photocopier doesn’t get the right to “memorize” and “learn” from a text that a human being does. As much as people may argue that AIs work different, AIs are still not people.

And if they ever become people, the situation will be much more complicated than whether they can imitate some writer. But we aren’t there yet, even their advocates just uses them as tools.

report

reply

[ +- ]

Even_Adder@lemmy.dbzer0.com

3 points

1 year ago

You should read this article by Kit Walsh, who’s a senior staff attorney at the EFF too. The EFF is a digital rights group who most recently won a historic case: border guards now need a warrant to search your phone.

report

reply

[ +- ]

TwilightVulpine@lemmy.world

9 points

1 year ago

*

But this falls exactly under what I just said. To say that using Machine Learning to imitate an artist without permission is fine, because humans are allowed to learn to each other, is making the mistake of assigning personhood to the system, that it ought to have the same rights that human beings do. There is a distinction between the rights of humans as opposed to tools, so to say that an AI can’t be trained on someone’s works to replicate their style doesn’t need to apply to people.

Even if you support that reasoning, that still doesn’t help the writers and artists whose job is threatened by AI models based on their work. That it isn’t an exact reproduction doesn’t change that it relied on using their works to begin with, and it doesn’t change that it serves as a way to undercut them, providing a cheaper replacement for their work. Copyright law as it was, wasn’t envisioned for a world where Machine Learning exists. It doesn’t really solve the problem to say that technically it’s not supposed to cover ideas and styles. The creators will be struggling just the same.

Either the law will need to emphasize the value of human autorship first, or we will need to go through drastic socioeconomic changes to ensure that these creators will be able to keep creating despite losing market to AI. Otherwise, to simply say that AI gets to do this and change nothing else, will cause enormous damage to all sort of creative careers and wider culture. Even AI will become more limited with less fresh new creators to learn elements from.

report

reply

[ +- ]

Even_Adder@lemmy.dbzer0.com

2 points

1 year ago

The system doesn’t get personhood, it is your tool, and as said in the article:

Fair use protects reverse engineering, indexing for search engines, and other forms of analysis that create new knowledge about works or bodies of works. Here, the fact that the model is used to create new works weighs in favor of fair use as does the fact that the model consists of original analysis of the training images in comparison with one another.

It is your right, not the system’s you’re upholding.

report

reply

[ +- ]

TwilightVulpine@lemmy.world

4 points

1 year ago

There is a difference between “analyzing” and derivating. The authorship of AI-created works is also not the user’s, it takes more than a prompt for that, and that seems to be the conclusion courts are leaning towards.

Still, even if that turns out to be technically correct, it still doesn’t help the creators getting undercut who might be driven out of their careers by AI.

report

reply

Show more comments

Show more comments

Show more comments

Show more comments

Show more comments

[ +- ]

kmkz_ninja@lemmy.world

3 points

1 year ago

How do you see that as a difference? Tools are extensions of ourselves.

Restricting the use of LLMs is only restricting people.

report

reply

[ +- ]

TwilightVulpine@lemmy.world

4 points

1 year ago

When we get to the realm of automation and AI, calling tools just an “extension of ourselves” doesn’t make sense.

Especially not when the people being “extended” by Machine Learning models did not want to be “extended” to begin with.

report

reply

Show more comments

[ +- ]

TropicalDingdong@lemmy.world

7 points

1 year ago

Exactly. If I write some Loony toons fan fiction, Warner doesn’t own that. This ridiculous view of copyright (that’s not being challenged in the public discourse) needs to be confronted.

report

reply

[ +- ]

OkToBeTakei@lemm.ee

8 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

wmassingham@lemmy.world

4 points

1 year ago

*

They can own it, actually. If you use the characters of Bugs Bunny, etc., or the setting (do they have a canonical setting?) then Warner does own the rights to the material you’re using.

For example, see how the original Winnie the Pooh material just entered public domain, but the subsequent Disney versions have not. You can use the original stuff (see the recent horror movie for an example of legal use) but not the later material like Tigger or Pooh in a red shirt.

Now if your work is satire or parody, then you can argue that it’s fair use. But generally, most companies don’t care about fan fiction because it doesn’t compete with their sales. If you publish your Harry Potter fan fiction on Livejournal, it wouldn’t be worth the money to pay the lawyers to take it down. But if you publish your Larry Cotter and the Wizard’s Rock story on Amazon, they’ll take it down because now it’s a competing product.

report

reply

[ +- ]

joxese3341@sh.itjust.works

2 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

Sethayy@sh.itjust.works

0 points

1 year ago

I think its more like writing a loony toons fanfic based only on pirated material

report

reply

[ +- ]

stappern@lemmy.one

1 point

1 year ago

*

Deleted by creator

report

reply

[ +- ]

Sethayy@sh.itjust.works

1 point

1 year ago

Can’t but theyre pretty open on how they trained the model, so like almost admitted guilt (though they werent hosting the pirated content, its still out there and would be trained on). Cause unless they trained it on a paid Netflix account, there’s no way to get it legally.

Idk where this lands legally, but I’d assume not in their favour

report

reply

Show more comments

[ +- ]

CoderKat@lemm.ee

6 points

1 year ago

*

It’s honestly a good question. It’s perfectly legal for you to memorize a copyrighted work. In some contexts, you can recite it, too (particularly the perilous fair use). And even if you don’t recite a copyrighted work directly, you are most certainly allowed to learn to write from reading copyrighted books, then try to come up with your own writing based off what you’ve read. You’ll probably try your best to avoid copying anyone, but you might still make mistakes, simply by forgetting that some idea isn’t your own.

But can AI? If we want to view AI as basically an artificial brain, then shouldn’t it be able to do what humans can do? Though at the same time, it’s not actually a brain nor is it a human. Humans are pretty limited in what they can remember, whereas an AI could be virtually boundless.

If we’re looking at intent, the AI companies certainly aren’t trying to recreate copyrighted works. They’ve actively tried to stop it as we can see. And LLMs don’t directly store the copyrighted works, either. They’re basically just storing super hard to understand sets of weights, which are a challenge even for experienced researchers to explain. They’re not denying that they read copyrighted works (like all of us do), but arguably they aren’t trying to write copyrighted works.

report

reply

[ +- ]

SubArcticTundra@lemmy.ml

1 point

1 year ago

No, because you paid for a single viewing of that content with your cinema ticket. And frankly, I think that the price of a cinema ticket (= a single viewing, which it was) should be what OpenAI should be made to pay.

report

reply

[ +- ]

stappern@lemmy.one

3 points

1 year ago

*

Deleted by creator

report

reply

Technology

!technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

@L4s@lemmy.world
@autotldr@lemmings.world
@PipedLinkBot@feddit.rocks
@wikibot@lemmy.world

Community stats

17K
Monthly active users
12K
Posts
555K
Comments

Community moderators

L3s@lemmy.world
L3s@fry.gs
L4sBot@fry.gsB
L4sBot@lemmy.worldB
enu@lemmy.world

modlog legal instances join-lemmy.org

lemmy-ui-next v0.11.0 (github)lemmy v0.19.5 (github)