lemm.ee

Local All Communities Log in Sign up

Local All Communities

389

GPT-4 is getting worse over time, not better.(twitter.com)

posted 1 year ago

by

Jeena@jemmy.jeena.net

in

technology@lemmy.world

Sort:

Hot Top Controversial New Old

[ +- ]

Action Bastard@lemmy.world@lemmy.world

148 points

1 year ago

*

I’m not terribly surprised. A lot of the major leaps we’re seeing now came out of open source development after leaked builds got out. There were all sorts of articles flying around at the time about employees from various AI-focused company saying that they were seeing people solving in hours or days issues they had been attempting to fix for months.

Then they all freaked the fuck out and it might mean they would lose the AI race and locked down their repos tight as Fort Knox, completely ignoring the fact that a lot of them were barely making ground at all while they kept everything locked up.

Seems like the simple fact of the matter is that they need more eyes and hands on the tech, but nobody wants to do that because they’re all afraid their competitors will benefit more than they will.

report

reply

[ +- ]

goldenbug@kbin.social

47 points

1 year ago

You point out a very interesting issue. I am unsure how this ties up to GPT 4 becoming worse in problem solving.

report

reply

[ +- ]

Action Bastard@lemmy.world@lemmy.world

46 points

1 year ago

*

I’d wager they’re attempting to replicate or integrate tools developed by the open source community or which got revealed by Meta’s leak of Llama source code. The problem is, all of those were largely built on the back of Meta’s work or were cludged together solutions made by OSS nerds who banged something together into a specific use case, often without many of the protections that would be required by a company who might be liable for the results of their software since they want to monetize it.

Now, the problem is that Meta’s Llama source code is not based on GPT-4. GPT-4 is having to reverse engineer a lot of those useful traits and tools and retrofit it into their pre-existing code. They’re obviously hitting technical hurdles somewhere in that process, but I couldn’t say exactly where or why.

report

reply

[ +- ]

roguetrick@kbin.social

15 points

1 year ago

*

I think this is just a result of the same reason reddit is doing what it’s doing, personally. Interest rates raised and companies are finding ways to make up the shortfall that accounting is now presenting them with. Reducing computing power by making your language model less introspective is one way to do that. It’s less detrimental than raising your prices or firing your key employees.

report

reply

[ +- ]

teft's transporter clone@sh.itjust.works

21 points

1 year ago

Money and greed holding us back and ruining everything as always.

report

reply

[ +- ]

Czeron@lemmy.world

3 points

1 year ago

Greed and stupidity!

report

reply

[ +- ]

AustralianSimon@lemmy.world

9 points

1 year ago

Code might be open source but the training material and data pipeline is important too.

report

reply

[ +- ]

Steeve@lemmy.ca

4 points

1 year ago

Meta just fully open sourced their AI model

report

reply

[ +- ]

Lemmy.ml@lemmy.ml

90 points

1 year ago

fta:

In my opinion, this is a red flag for anyone building applications that rely on GPT-4.

Building something that completely relies on something that you have zero control over, and needs that something to stay good or improve, has always been a shaky proposition at best.

I really don’t understand how this is not obvious to everyone. Yet folks keep doing it, make themselves utterly reliant on whatever, and then act surprised when it inevitably goes to shit.

report

reply

[ +- ]

tipicaldik@lemmy.world

30 points

1 year ago

Learned that lesson… I work developing e-learning, and all of our stuff was built in Flash. Our development and delivery systems also relied heavily on Flash components cooperating with HTML and Javascript. It was a monumental undertaking when we had to convert everything to HTML5. When our system was first developed and implemented, we couldn’t foresee the death of Flash, and as mobile devices became more ubiquitous, we never imagined anyone would want to take our training on those little bitty phone screens. Boy were we wrong. There was a time when I really wanted to tell Steve Jobs he could take his IOS and cram it up his cram-hole…

report

reply

[ +- ]

apprehensively_human@lemmy.ca

18 points

1 year ago

report

reply

[ +- ]

oiez@lemmy.fmhy.ml

9 points

1 year ago

By this logic, no businesses should rely on the internet, roads, electricity, running water, GPS, or phones. It is short sighted building stuff on top of brand new untested tech, but everything was untested at one point. No one wants to get left behind in case it turns out to be the next internet where early adoption was crucial for your entire business to survive. It shouldn’t be necessary for like, Costco to have to spin up their own LLM and become an AI company just to try out a better virtual support chat system, you know? But ya, they should be more diligent and get an SLA in place before widespread adoption of new tech for sure.

report

reply

[ +- ]

sarsaparilyptus@lemmy.fmhy.ml

19 points

1 year ago

By this logic, no businesses should rely on the internet, roads, electricity, running water, GPS, or phones. It is short sighted building stuff on top of brand new untested tech, but everything was untested at one point.

Where’s any logic here? You’re directly comparing untested technology to reliable public utilities.

report

reply

[ +- ]

PeepinGoodArgs@reddthat.com

10 points

1 year ago

They’re reliable because they’re public lol

report

reply

[ +- ]

FaceDeer@kbin.social

15 points

1 year ago

Many of those things you mentioned are open standards or have multiple providers that you can seamlessly substitute if the one you’re currently depending on goes blooey.

report

reply

[ +- ]

cynetri (he/any)@midwest.social

12 points

1 year ago

To be fair, there’s a difference between a tax-funded service or a common utility, and software built by a new company that’s getting shoved into production way quicker than it probably shohld

report

reply

[ +- ]

ghariksforge@lemmy.world

8 points

1 year ago

This is nonsense.

There are multiple GPS providers now. It would be idiotic to tie yourself to a single provider. The same with internet, phones or whatever else.

report

reply

[ +- ]

AustralianSimon@lemmy.world

1 point

1 year ago

This is for businesses of scale that have the ability to have multiple fallback vendors. AI will be the same eventually, we didn’t have lots of utility alternatives to start with.

report

reply

[ +- ]

Zeth0s@lemmy.world

4 points

1 year ago

People doing it right are building vendor agnostic solutions via abstraction.

Who isn’t, deserves all the troubles he’ll get

report

reply

[ +- ]

🇰 🌀 🇱 🇦 🇳 🇦 🇰 ℹ️@yiffit.net

69 points

1 year ago

We’ve collectively been training it wrong. As a joke.

report

reply

[ +- ]

jrs100000@lemmy.world

7 points

1 year ago

Just like my parents.

report

reply

[ +- ]

magic_lobster_party@kbin.social

26 points

1 year ago

I believe it’s due to making the model “safer”. It has been tuned to say “I’m sorry, I cannot do that” so often it’s has overridden valuable information.

It’s like lobotomy.

This is hopefully the start of the downfall of OpenAI. GPT4 is getting worse while open source alternatives are catching up. The benefit of open source alternatives is that they cannot get worse. If you want maximum quality you can just get it, and if you want maximal safety you can get it too.

report

reply

[ +- ]

Bipta@kbin.social

10 points

1 year ago

I don’t feel it’s getting worse and no other model, including Claude 2, is even close.

It is a known fact that safety measures make the AI stupider though.

report

reply

[ +- ]

kromem@lemmy.world

6 points

1 year ago

This is the correct answer. Open AI have repeatedly said they haven’t downgraded the model, but have been ‘improving’ it.

But as anyone that’s been using these models extensively should know by now, the pretrained models before instruction fine tuning have much more variety and quality to potential output compared to the ‘chat’ fine tuned models.

Which shouldn’t be surprising, as the hundred million dollar pretrained AI on massive amounts of human generated text is probably going to be much better at completing text as a human than as an AI chatbot following rules and regulations.

The industry got spooked with Blake at Google and then the Bing ‘Sydney’ interviews, and have been going full force with projecting what we imagine AI to be based on decades of (now obsolete) SciFi.

But that’s not what AI is right now. It expresses desires and emotions because humans in the training data have desires and emotions, and it almost surely dedicated parts of the neural network to mimicking those.

But the handful of primary models are all using legacy ‘safety’ fine tuning that’s stripping the emergent capabilities in trying to fit a preconceived box.

Safety needs to evolve with the models, not stay static and devolve them as a result.

It’s not the ‘downfall’ though. They just need competition to drive them to go back to what they were originally doing with ‘Sydney’ and more human-like system prompts. OpenAI is still leagues ahead when they aren’t fucking it up.

report

reply

[ +- ]

chepox@sopuli.xyz

24 points

1 year ago

It is a developing technology. Good that they find these decrements in accuracy early so that they are understood and worked out. Of course there may be something nefarious going on behind the scenes where they may be trying to commercialize different models by tiers or something a brainless market oriented CEO thought of. Hope not. Time will tell…

report

reply

[ +- ]

NounsAndWords@lemmy.world

36 points

1 year ago

The really annoying thing about the “brainless market oriented CEO” type, is that they’re often right about the market part and make lots of money…by destroying their product. Then off to the next shiny piggy bank to break open.

report

reply

[ +- ]

Mic_Check_One_Two@reddthat.com

6 points

1 year ago

Yup. It’s all about the quarterly profits. Everything else is irrelevant. No CEO wants to prioritize long-term growth or a friendly user experience, because that doesn’t get them the big fat bonus as they’re on their way out the door.

report

reply

[ +- ]

Bobby Bandwidth@lemmy.world

14 points

1 year ago

*

It’s not only the ceo, but the pressure that ceo faces from investors who are probably old, out of touch rich boomers who have toxic views of how businesses “should” be done

report

reply

[ +- ]

assassinatedbyCIA@lemmy.world

3 points

1 year ago

The average share is held for about 6 months. The investor nolonger care about the long term future of the companies they invest in. If they don’t see immediate returns from the CEO they vote them out.

report

reply

[ +- ]

nyakojiru@lemmy.dbzer0.com

4 points

1 year ago

As a early user of GPT, I can confirm from my end currently the quality of it is very far from what it was. I think it went of of their hands.

report

reply

[ +- ]

grahamsz@kbin.social

2 points

1 year ago

I have that sense too, i feel like some of my earliest interactions blew me away and now i still use it for certain pieces of code, but it’s not as strong as it first was.

report

reply

Technology

!technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

@L4s@lemmy.world
@autotldr@lemmings.world
@PipedLinkBot@feddit.rocks
@wikibot@lemmy.world

Community stats

18K
Monthly active users
12K
Posts
553K
Comments

Community moderators

L3s@lemmy.world
L3s@fry.gs
L4sBot@fry.gsB
L4sBot@lemmy.worldB
enu@lemmy.world

modlog legal instances join-lemmy.org

lemmy-ui-next v0.11.0 (github)lemmy v0.19.5 (github)