lemm.ee

Local All Communities Log in Sign up

Local All Communities

118

Claude 3 launched by Anthropic — new AI model leaves OpenAI's GPT-4 in the dust(www.tomsguide.com)

posted 8 months ago

by

in

technology@lemmy.world

Title is a bit dramatic, but yes, Claude 3 claims to be better than GPT 4 in most ways.

Sort:

Hot Top Controversial New Old

[ +- ]

june@lemmy.world

28 points

8 months ago

I just spent some time on Claude 3, and I see how it can be considered ‘better’ than GPT4, however I quickly found that it tends to lie about itself in subtle ways. When I called it out on an error it would say things like ‘I’ll strive to be better’. I called it out on the fact that it’s model doesn’t grow or change based on conversations it has and that it’s impossible for it to strive to do anything outside of, maybe, that chat. It then went on to show me that it couldn’t even adjust within that chat by doing the same thing 5 more times in 5 different ways.

I see the model it used for the apologies (acknowledge, apologize, state intent to do better in the future) which is appropriate for people or beings capable of learning, but it is not. I went from having a good conversation with it about a poem I wrote to being weirdly grossed out by it. GPT does a good job of not pretending to be human, and I appreciate that.

report

reply

[ +- ]

catloaf@lemm.ee

6 points

8 months ago

*

The cynic in me says that’s perfectly human behavior, though

report

reply

[ +- ]

june@lemmy.world

7 points

8 months ago

Yea that’s what I’m saying, and I don’t like it. I don’t want my LLM acting human, I want it acting like an LLM. My interactions with Claude 3 were very uncanny valley and bugged me a lot.

report

reply

[ +- ]

9bananas@lemmy.world

3 points

8 months ago

so you’re basically saying it talked itself squarely into uncanny valley?

i honestly didn’t consider that would be an issue for LLMs, but in hindsight…yeah, that’s gonna be a problem…

report

reply

Show more comments

Show more comments

[ +- ]

Hackerman_uwu@lemmy.world

22 points

8 months ago

Hey have you guys heard about ChatGPT 7? It makes chatGPT 6 look like ChatGPT 5!

Who ever thought the AI awakening would this fucking banal?

report

reply

[ +- ]

bionicjoey@lemmy.ca

9 points

8 months ago

Don’t worry, it’s not the AI awakening. It’s just people figuring out how to sell text generators.

report

reply

[ +- ]

androogee (they/she)@midwest.social

4 points

8 months ago

If you generate your own marketing copy, these things practically sell themselves

report

reply

[ +- ]

IvanOverdrive@lemm.ee

19 points

8 months ago

It does amazingly well with schemas:

When the dead rabbit was seen by the dog, it hopped. What does “it” refer to: the rabbit or the dog?

In the sentence “When the dead rabbit was seen by the dog, it hopped,” the pronoun “it” refers to the dog, not the rabbit.

This is because the rabbit is described as “dead,” meaning it cannot hop or move. The verb “hopped” implies movement, which can only logically apply to the living subject in the sentence, which is the dog.

So the intended meaning is that when the dog saw the dead rabbit, the dog itself hopped, likely in surprise or reaction to seeing the dead animal.

When the iceberg was struck by the ship, it sunk. What does “it” refer to: the iceberg or the ship?

In the sentence “When the iceberg was struck by the ship, it sunk,” the pronoun “it” refers to the ship, not the iceberg.

The reason is that icebergs, being large masses of ice floating in water, do not actually “sink” when struck. They are already floating and may crack or break apart when struck, but the word “sunk” implies something going down below the water’s surface.

Ships, on the other hand, are vessels designed to float on water. If a ship strikes an iceberg with enough force, it can cause damage that leads to the ship taking on water and eventually sinking below the surface.

So in this context, with the verb “sunk” being used, it is more logical that “it” is referring to the ship sinking after striking the iceberg, rather than describing what happened to the stationary iceberg itself.

report

reply

[ +- ]

asdfasdfasdf@lemmy.world

10 points

8 months ago

You’re right! I tried against ChatGPT 3.5:

“It” in this context likely refers to the dead rabbit, as it is the subject of the sentence and the one described as hopping.

It got the ship one right though.

report

reply

[ +- ]

IvanOverdrive@lemm.ee

3 points

8 months ago

I found that it helps if you ask chatGPT 4 to act as a Vulcan from Star Trek, it does better with logic puzzles. But it doesn’t work with 3.5.

report

reply

[ +- ]

kromem@lemmy.world

8 points

8 months ago

The sonnet model is decent, but something weird is going on with their opus model as it just terribly sucks.

Mistral-large is probably the best large model for practical purposes at this point.

report

reply

[ +- ]

bugsmith@programming.dev

4 points

8 months ago

Mistral-large is probably the best large model for practical purposes at this point.

What makes you say that? I have not performed my own comparison, but everything I have seen and read suggests that GPT4 is king, currently.

report

reply

[ +- ]

kromem@lemmy.world

11 points

8 months ago

*

It depends on the task, but in general a lot of the models have fallen into a dark pattern of Goodhart’s Law, targeting the benchmarks but suffering at other things.

So as an example, while GPT-4 used to correctly model variations of the wolf, goat, cabbage problem with token similarity hacks (i.e. using emojis instead of nouns to break pattern similarity with the standard form of the question), now it even fails for that with the most recent updates, whereas mistral-large is the only one that doesn’t need the hack at all.

report

reply

[ +- ]

bugsmith@programming.dev

3 points

8 months ago

Interesting. That’s not something I’ve heard about until now, but something I’ll surely look into.

report

reply

[ +- ]

Noodle07@lemmy.world

4 points

8 months ago

It’s called Claude though… That’s definetly not better than GPT

report

reply

[ +- ]

Player2@lemm.ee

15 points

8 months ago

Disagree, never liked OpenAI trying to claim a generic term as the name of their product.

report

reply

[ +- ]

8 points

8 months ago

Think it depends the language, in french gpt is very very close to “j’ai pété” which means “I farted”. But, yeah agree that Claude ain’t much better name

report

reply

[ +- ]

Something Burger 🍔@jlai.lu

2 points

8 months ago

https://chat-jai-pete.fr/

report

reply

[ +- ]

Noodle07@lemmy.world

-2 points

8 months ago

True, but Claude is like my grandfather name…

report

reply

[ +- ]

0 points

8 months ago

*

Exactly, neither fit right for an AI

report

reply

Technology

!technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

@L4s@lemmy.world
@autotldr@lemmings.world
@PipedLinkBot@feddit.rocks
@wikibot@lemmy.world

Community stats

18K
Monthly active users
12K
Posts
553K
Comments

Community moderators

L3s@lemmy.world
L3s@fry.gs
L4sBot@fry.gsB
L4sBot@lemmy.worldB
enu@lemmy.world

modlog legal instances join-lemmy.org

lemmy-ui-next v0.11.0 (github)lemmy v0.19.5 (github)