You are viewing a single thread.
View all comments
17 points

I can’t wait for redditors to see the opportunity to poison ChatGTP hard.

permalink
report
reply
15 points

They might not even have to. I bet there are bots already having entire discussions by themselves on there.

Anti Commercial-AI license

permalink
report
parent
reply
10 points

/r/subredditsimulator

permalink
report
parent
reply
2 points

The license does not apply to posts and replies in Reddit, right? Thank god I created a blog to post about any stuff that I want, without license or restrictions from Reddit. Before the AI breakthrough and what happened to Reddit. But even if so, do AI tools understand such a license text and evaluate if they can or cannot use the material?

permalink
report
parent
reply
7 points
*

No, that user has the license on all of their comments

permalink
report
parent
reply
5 points

From what I understand LLMs are just large heuristic machines. They gather a lot of statistics on token order and return an answer to that with something that statistically should higher than other options. There’s no “understanding”. So to answer your question, no, they don’t understand the license.

Content is most likely scraped wholesale from websites, possibly run through some clean up to possibly filter out absolute garbage, and fed into an LLM to train it. An LLM can be tricked to reveal its training data (e.g repeat “fruit” forever). It’s in those cases where copyright infringement is detected and if action can and has be taken. There are court cases currently in review, the most popular being the one against Github Copilot for infringing on the license of sourcecode it ingested.

Anti Commercial-AI license

permalink
report
parent
reply
4 points

do AI tools understand such a license text and evaluate if they can or cannot use the material?

So, this is the fun part: AI tools don’t auto-ingest material to process it. The developers choose the materials to feed into the models.

And while the tech bros can understand your licenses, they don’t give a flying fuck, because they think they’ll be billionaires beyond consequences by the time anyone discovers that their work in particular has been ripped off.

permalink
report
parent
reply

Technology

!technology@beehaw.org

Create post

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Community stats

  • 2.6K

    Monthly active users

  • 3.4K

    Posts

  • 82K

    Comments