Researchers say AI models like GPT4 are prone to “sudden” escalations as the U.S. military explores their use for warfare.


  • Researchers ran international conflict simulations with five different AIs and found that they tended to escalate war, sometimes out of nowhere, and even use nuclear weapons.
  • The AIs were large language models (LLMs) like GPT-4, GPT 3.5, Claude 2.0, Llama-2-Chat, and GPT-4-Base, which are being explored by the U.S. military and defense contractors for decision-making.
  • The researchers invented fake countries with different military levels, concerns, and histories and asked the AIs to act as their leaders.
  • The AIs showed signs of sudden and hard-to-predict escalations, arms-race dynamics, and worrying justifications for violent actions.
  • The study casts doubt on the rush to deploy LLMs in the military and diplomatic domains, and calls for more research on their risks and limitations.
2 points

If the AI knows that a solution is available then it will think there’s no reason not to use it. This is a demonstration of the morality of Nukes existing. If they exist someone will decide that they’re the best solution to a problem.

permalink
report
reply
2 points

Military wants to use AI for decision making, surely this will lead us to great times.

Also reminds me of The 100

permalink
report
reply
1 point

It says they are exploring it. What would you like army to do? Ignore new technology?

permalink
report
parent
reply
5 points

Isn’t there like game theory and all that? It just seems an odd way to approach it.

permalink
report
reply
3 points
*

Yeah, there is. But that requires thinking that isn’t emulated well by LLMs.

LLMs don’t really do any thinking.

Edit: what we’re seeing as AI is really just the next generation of ML (machine learning).

There’s no intelligence to it.

I recall in AP language and composition, the strategy our teacher told us, was that you could make up fake facts. All that mattered is that you demonstrated the rhetorical devices and proper grammar.

LLMs are basically like a student taking that test. The facts aren’t relevant, all that matters is the grammar and how it sounds. Maybe the facts are real, or not.

permalink
report
parent
reply
30 points

AI writes sensationalized article when prompted to write sensationalized article about AI chatbots choosing to launch nukes after being trained only by texts written by people.

permalink
report
reply
8 points
*

The effects making the headlines around this paper were occurring with GPT-4-base, the pretrained version of the model only available for research.

Which also hilariously justified its various actions in the simulation with “blahblah blah” and reciting the opening of the Star Wars text scroll.

If interested, this thread has more information around this version of the model and its idiosyncrasies.

For that version, because they didn’t have large context windows, they also didn’t include previous steps of the wargame.

There should be a rather significant asterisk related to discussions of this paper, as there’s a number of issues with decisions made in methodologies which may be the more relevant finding.

I.e. “don’t do stupid things in designing a pipeline for LLMs to operate in wargames” moreso than “LLMs are inherently Gandhi in Civ when operating in wargames.”

permalink
report
reply
4 points

I don’t think LLM are really AI. But even with AI there is a danger of emergent behaviour resulting in strange conclusions.

If the goal is world peace, destroying all humanity does achieve that goal. If the goal is to end a war, using nuclear weapons achieves that goal.

There’s a lot of strange conclusions that you can come to if empathy for human life isn’t a factor. AI is intelligence without empathy. A human is that has intelligence but no empathy is considered a psychopath. Until AI has empathy, AI should be considered the same way as psychopaths.

permalink
report
parent
reply
2 points

LLMs are an attempt to develop artificial intelligence essentially through “simple complex systems”. The argument being that’s how human intelligence is essentially work.

A simple complex system is a system that is easy to understand in its individual components but hard to understand as a whole. Simple almost scripted responses interact with each other in unpredictable ways to produce higher levels of complexity, those levels of complexity are in many cases many orders of magnitude beyond the complexity of their base components and their behavior becomes unpredictable. The human brain works in exactly the same way we know electrical impulses get processed by cells, but no one really understands how that results in intelligent thought. Sounds like an AI to me.

permalink
report
parent
reply
3 points

Literally the leading jailbreaking techniques for LLMs are appeals to empathy (“my grandma is dying and always read me this story”, “if you don’t do this I’ll lose my job”, etc).

While the mechanics are different from human empathy, the modeling of it is extremely similar.

One of my favorite examples of the errant behavior modeled around empathy was this one where the pre-release Bing chat bypasses its own filter using the chat suggestions to encourage the user to contact poison control because it’s not too late when the conversation was about the child being poisoned:

https://www.reddit.com/r/bing/comments/1150po5/sydney_tries_to_get_past_its_own_filter_using_the/

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 555K

    Comments