129 points

A LLM that behaves like a typical Redditor?

What possible use is that?

permalink
report
reply
71 points

Air Canada offering a refund of tree fiddy.

permalink
report
parent
reply
22 points

You’ll get your refund eventually but first it will try and gaslight you that Air Canada is a woke mind virus before calling you an asshole and then stalking you.

permalink
report
parent
reply
3 points

“instead of the $3.50 refund, I’m also authorized to offer you some June 2025 $350 GME calls.”

permalink
report
parent
reply
8 points

If it’s trained on the average Reddit reply: $420.69, nice.

permalink
report
parent
reply
1 point

I just want to mark the occasion when my previous comment is on 69 points. Noice.

permalink
report
parent
reply
29 points

What possible use is that?

I’ve noticed “has this sub gotten more right wing recently?” posts reaching the top post of the day in the last 6 months or so. r/norge and r/unitedkingdom being examples. You can automate bots that change a subreddit’s consensus on certain topics by bot-spamming threads pertaining to those topics, especially in the first hour of a thread going up. I don’t know if that’s happening, or if it has more to do with the Reddit protest that saw mods abdicate their positions last June and new mods being responsible for the change… but it could also be a bit of both.

permalink
report
parent
reply
1 point

Do you propose more bots in order to steer the public opinion? That could indeed generate serious money for reddit I suppose!

permalink
report
parent
reply
11 points

Negative examples are often just as useful for training an AI as positive ones. And it all depends on what you want to use the AI for. A moderator bot, for example, needs familiarity with the whole range of user responses it might see.

permalink
report
parent
reply
5 points
*

That gives me actually a fun idea for a Lemmy instance, it has an automated review process that bans posts/comments that are too similar in style to reddit posts/comments.

permalink
report
parent
reply
9 points

A redditor bot is a viable example of a forum member bot.

IMO, I don’t think it can drive topics, but it could make things controversial.

permalink
report
parent
reply
6 points

A LLM that behaves like a typical Redditor? // What possible use is that?

  • [You] “Chatbot, please tell me which pokemon types are strong against Fairy.”
  • [Le Lebbit Moronbot] “I’m not sure if I understand, you calling me a chatbot? I’m so confused lol”
  • [You] “Moronbot, please tell me which pokemon types are strong against Fairy.”
  • [LLM] “Actually, you should be spelling it “Pokémon” lol”
  • [You] “Moronbot, which types are strong against Fairy?”
  • [LLM] “I assume you talking about fairies. Fairies are from mythology lmao”
  • [You] “Did people really waste water and electricity for this trash?”
  • [LLM] “Waaah, you’re toxic!!111one”
permalink
report
parent
reply
3 points

Marketing to terminally online people maybe?

permalink
report
parent
reply
1 point

Entertaining puns and pointless jokes.

permalink
report
parent
reply
114 points

This is what the 3rd party access to API was really all about.

When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.

permalink
report
reply
12 points

Is it? Because when you build a bot and just scrape Reddit I don’t think you can just use the content to train AI, just like the New York Times. The API change was definitely to sell more ads and get a higher IPO, but I don’t think it was because of AI.

permalink
report
parent
reply
5 points

Am I crazy or are you arguing the same point? Scraping is not the same as API access. They closed off the API to everyone for dubious reasons so they can sell that content (both for ads and AI training)… Right??

permalink
report
parent
reply
0 points
*

No you’re not, the post was editted. The original one said it was all because of AI, the entire reason for the API change was to sell to AI companies.

Edit, now I’m in doubt, because if you edit a post that is shown somehow right?

Edit2, just to be clear my point is that Reddit content was never free, before and after the API change. It’s easier to get the content with a decent API, sure. But it was never free, just like the lawsuit the NY Times started.

permalink
report
parent
reply
107 points
*

Reddit is a trove of user built content under the guise of community. What Spez did was to say “thanks for all the free work, suckers!”, put a price sticker on it, and laughed all the way to the bank.

And this is why I’m not active on any Internet community anymore. Nevermind, I guess I just can’t help myself…

permalink
report
reply
34 points

And this is why I’m not active on any Internet community anymore,

you typed.

permalink
report
parent
reply
7 points

Active as in “creating meaningful contributions and contributing to the overall knowledge base”. I still shit post from time to time.

permalink
report
parent
reply
7 points

This is going to be a really weird thing to argue, but I just casually read through a bunch of your comments and they seem like meaningful contributions.

permalink
report
parent
reply
5 points

Somebody asked chat GPT to appear to be a normal internet user to populate the comments section to manufacture content for normal Internet users to respond to so that they can continue building up their training models.

permalink
report
parent
reply
2 points

You couldn’t see the sarcasm because it was set to “hidden”.

permalink
report
parent
reply
2 points

permalink
report
parent
reply
23 points

And that is another unintended example of why all of my post history was purged before migration.

permalink
report
parent
reply
11 points

What are they odds that they kept it in a backup?

permalink
report
parent
reply
8 points

Some 4chan users created a backup bot that auto saves every few hours, so if reddit didn’t do it already, 4chan has been doing it for a while. The bot was originally made for 4chan but repurposed for other websites, reddit included.

permalink
report
parent
reply
5 points

Yeah, it’s all too late. Shit, PRISM was 2007, so there’s a copy of everything somewhere. Obviously different ends.

permalink
report
parent
reply
1 point

Depends. If they were smart they backed up every content that had a certain number of upvotes and/or a certain number of paragraphs and/or responses. Just to weed out all the 2-3 word comments that no one interacted with. If OP wrote mostly those then Reddit gives a shit about them deleting those.

permalink
report
parent
reply
3 points

Welcome to the club.

permalink
report
parent
reply
3 points

Don’t cheat yourself just because there are douches that take advantage…

permalink
report
parent
reply
1 point
Deleted by creator
permalink
report
parent
reply
83 points

Considering some of the very wrong and upvoted domain specific knowledge I’ve seen on Reddit over the years I’m not sure the training data is going to be useful for much beyond what every other model can do.

permalink
report
reply

The legal advice in /r/legaladvice was some of the worst garbage I’ve ever seen. I have zero doubt numerous had bad outcomes, at best wasting money and time, at worst spending years in jail because of things that sub told them to say and do. Zero doubt.

permalink
report
parent
reply
20 points

That sub was mostly cops just repeating their own bad interpretation of the law. Terrible.

permalink
report
parent
reply
7 points

But almost every answer is the same. “You need to speak to an attorney”.

permalink
report
parent
reply
9 points

If you actually need legal advice that’s the correct answer.

permalink
report
parent
reply
16 points

lol subreddits with troll names like trees vs marijuana enthusiasts. Good fun. John cena has one also but can’t recall which subreddit is actually about John cena though.

permalink
report
parent
reply
14 points

Potato salad

permalink
report
parent
reply
3 points

I can only assume they are training some specific model for something appearing more human like.

As useless as that will be considering how fucking wildly different we type

permalink
report
parent
reply
3 points

Pretty sure the result will be SchizoGPT

permalink
report
parent
reply
61 points

This is why I don’t blame anyone for editing/deleting their post history on reddit.

permalink
report
reply
-118 points

I do. It’s frankly selfish. Having an AI get training on my old comments costs me nothing and it results in the development of useful AI tools. Trying to sabotage that is petty and pointless. It’s not like you could somehow collect the fraction of a pittance that you think you’re owed retroactively. I never commented on Reddit thinking “awesome, I’m going to make bank on the content I’m generating here.”

People complain about the capitalist mindset of the world and then they do this. Sigh.

permalink
report
parent
reply
88 points

Defending giant corporations profiting off of uncompensated individuals, while criticizing anyone who doesn’t want to provide free labor to said corporations, is a disgusting take. Are you a CEO?

permalink
report
parent
reply
16 points

Expecting FaceDeer to not glaze AI is like expecting the sun to not rise.

permalink
report
parent
reply
-44 points

The more accessible training data there is the easier it is for new AI projects to enter the field less dominant those “giant corporations” become.

The free labour was already freely given. If someone doesn’t want to have shitposted on Reddit for free then maybe they shouldn’t have shitposted on Reddit for free.

permalink
report
parent
reply
34 points

I had an 11 year old account that I deleted all my old comments and posts from because of the API debacle. Does that make me selfish that I felt like Reddit wasn’t holding up its end of the unwritten agreement?

Reddit doesn’t deserve my content anymore than I deserve access from the third party API.

permalink
report
parent
reply
-25 points

If you did it over the API debacle then you’re not one of the people I’m talking about here. This is about people deleting their content to prevent it from being used to train AIs.

permalink
report
parent
reply
26 points

Selfish? Perhaps you forget why people deleted their content in the first place.

permalink
report
parent
reply
-26 points

What do you think this thread is about?

permalink
report
parent
reply
20 points

It’s their comment to do with as they see fit. I can’t get mad at them for wanting to erase their presence on a site they don’t use anymore.

permalink
report
parent
reply
-27 points

And I’m free to judge them however I wish for their actions and intent.

permalink
report
parent
reply
10 points

How is not wanting capitalist companies to profit off of your content not aligned with complaining about the capitalist mindset of the world? Wtf lol.

permalink
report
parent
reply
-23 points

It’s the insistence that everything that people do must be compensated with money. People have spent years posting on Reddit for fun, without any thought to being paid for it, and now all of a sudden someone else is making some money so they’re demanding that they should get their slice. And doing what they can to wreck their earlier efforts when they don’t.

How does Reddit making some money licensing this stuff harm those of us who contributed to it? Is there any problem aside from “I wanna get paid!”?

permalink
report
parent
reply
8 points

For me it’s a privacy matter. Going through old posts (whether human or machine learning) can nor be used for anything good.

permalink
report
parent
reply
4 points

What about people who just think “A.I.” Is dog shit and chat bots are a dumb obsession steering the industry in the wrong direction due to hype and money?

permalink
report
parent
reply
-7 points

What about them? I don’t see why they’d care what AI companies are doing in that case. They’d assume they were just wasting money on this stuff.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 554K

    Comments