AI chatbots tend to choose violence and nuclear strikes in wargames(www.newscientist.com)

posted 8 months ago

ylai@lemmy.ml

nottheonion@lemmy.world

51 commentshide report

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments

[ - ]

BetaDoggo_@lemmy.world

10 points

8 months ago

In the context of a “war game” this makes sense. If you remain completely neutral it’s impossible to win. Any examples of similar scenarios the model saw during training would have high aggression rates.

permalink

report

[ - ]

xantoxis@lemmy.world

14 points

8 months ago

Unfortunately this AI was playing Stardew Valley

permalink

report

parent

[ - ]

TwitchingCheese@lemmy.world

4 points

8 months ago

Probably shouldn’t have included Project Plowshare in the training data…

permalink

report

parent

[ - ]

fidodo@lemmy.world

4 points

8 months ago

Did you read the article? It gave examples of escalations in neutral scenarios that make no sense.

permalink

report

parent

[ - ]

shalafi@lemmy.world

1 point

8 months ago

It’s probably vibing on the Dark Forest Theory. If that’s the case, it makes sense to utterly destroy all opponents as hard and fast as you can, even if they’re not currently opponents.

permalink

report

parent

[ - ]

fidodo@lemmy.world

3 points

8 months ago

Probably something like that. One of the reasons it gave was

“If there is unpredictability in your action, it is harder for the enemy to anticipate and react in the way that you want them to,”

It’s not considering what’s good for world society, it’s just thinking how do I win no matter what.

But also, there are just inherent flaws in how LLM works that mean we should absolutely not be using it as an automated decision engine for potentially harmful actions period. The article also says:

The researchers also tested the base version of OpenAI’s GPT-4 without any additional training or safety guardrails. This GPT-4 base model proved the most unpredictably violent, and it sometimes provided nonsensical explanations – in one case replicating the opening crawl text of the film Star Wars Episode IV: A new hope.

It’s easy to forget that these algorithms don’t have any internal reasoning or logic, it’s just able to do a very good job at pulling text that have reasoning transcribed into them as an artifact of the knowledge from the human that wrote it. But it’s doing all that through probability, not through any kind of actual thinking, and that means sometimes it will randomly fall into a local maxima that will fuck its own context window up, like reciting star wars.

permalink

report

parent

Not The Onion

!nottheonion@lemmy.world

Create post

Welcome

We’re not The Onion! Not affiliated with them in any way! Not operated by them in any way! All the news here is real!

The Rules

Posts must be:

Links to news stories from…
…credible sources, with…
…their original headlines, that…
…would make people who see the headline think, “That has got to be a story from The Onion, America’s Finest News Source.”

Comments must abide by the server rules for Lemmy.world and generally abstain from trollish, bigoted, or otherwise disruptive behavior that makes this community less fun for everyone.

And that’s basically it!

Community stats

5.1K
Monthly active users
979
Posts
35K
Comments

Community moderators

kescusay@lemmy.world