Over just a few months, ChatGPT went from correctly answering a simple math problem 98% of the time to just 2%, study finds(finance.yahoo.com)

posted 1 year ago

leninmummy@lemmy.ml

technology@lemmy.ml

71 commentshide report

Sort:

Hot Top Controversial New Old

[ - ]

Sagrotan@lemmy.world

5 points

1 year ago

It learns to be more human. More human than human, that’s our motto here at Tyrell.

permalink

report

[ - ]

Fisk400@lemmy.world

-2 points

1 year ago

Stop making a language model do math? We have already have calculators.

permalink

report

[ - ]

ThreeHalflings@sh.itjust.works

5 points

1 year ago

Do you think maybe it’s a simple and interesring way of discussing changes in the inner workings of the model, and that maybe people know that we already have calculators?

permalink

report

parent

[ - ]

Fisk400@lemmy.world

8 points

1 year ago

I think it’s a lazy way of doing it. OpenAI has clearly stated that math isn’t something that they are even trying to make it good at. It’s like testing how fast Usain bolt is by having him bake a cake.

If chatgpt is getting worse at math it might just be a side effect of them making it better at reading comprehension or something they want it to be good at there is no way to know that.

Measure something it is supposed to be good at.

permalink

report

parent

[ - ]

ThreeHalflings@sh.itjust.works

3 points

1 year ago

All the things it’s supported to be good at are completely subjectively judged.

That’s why, u less you have a panel of experts in your back pocket, you need something with a yes or no answer to have an interesting discussion.

If people were discussing ChatGPT’s code writing ability, you’d complain that it wasn’t designed to do that either. The problem is that it was designed to transform inputs tk relatively beliveable outputs, representative of its training set. Great. That’s not super useful. It’s actual utility comes from its emergent behaviours.

Lemme know when you make a post detailing the opinions of some university “Transform inputs to outputs” professors. Until then, well ocmrinue to discuss its behaviour in observable, verifiable and useful areas.

report

[ - ]

3 points

1 year ago

Nah, asking it to do math is perfect. People are looking for emergent qualities and things it can do that they never expected it to be able to do. The fact that it could do somewhat successful math before despite not being a calculator was fascinating, and the fact that it can’t now is interesting.

Let the devs worry about how good it is at what it is supposed to do. I want to hear about stuff like this.

permalink

report

parent

[ - ]

atomdmac@lemmy.world

1 point

1 year ago

Has it gotten better at other stuff? Are you posing a possible scenario or asserting a fact? Would be curious about specific measurements if the later.

permalink

report

parent

Show more comments

[ - ]

Send_me_nude_girls@feddit.de

44 points

1 year ago

Must be because of all the censoring. The more they try to prevent DAN jailbreaking and controversial replies, the worse it got.

permalink

report

[ - ]

condenser@lemdro.id

6 points

1 year ago

Deleted by creator

permalink

report

parent

[ - ]

Scooter411@lemmy.ml

2 points

1 year ago

It’s also terrible at 20 questions.

permalink

report

[ - ]

SokathHisEyesOpen@lemmy.ml

2 points

1 year ago

Is it really? It seems like it would be excellent at that. I have a little hand held device from the 1990’s that can play 20 questions and is almost always right. It seems that if that little device can win, ChatGPT most certainly should be able to.

Edit: I just played and it guessed what I was thinking of in 13 questions. But then it kept asking questions. I asked why it was asking questions still since it already guessed it and it said “oh, you are absolutely correct, I did guess it correctly!”. Lol, ChatGPT is funny sometimes.

permalink

report

parent

[ - ]

Scooter411@lemmy.ml

2 points

1 year ago

It always asks me if it’s sporting equipment, and when I say no, it asks me if it’s sporting equipment for inside or outside - I then have to remind it that it’s not sporting equipment and that’s not a yes or no question.

permalink

report

parent

[ - ]

Reddit_was_fun@lemmy.world

9 points

1 year ago

Guess they shouldn’t have trained it on Common Core… /s

I will see myself out.

permalink

report

Technology

!technology@lemmy.ml

Create post

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

Community stats

4K
Monthly active users
2.7K
Posts
44K
Comments

Community moderators

MinutePhrase@lemmy.ml