lemm.ee

Local All Communities Log in Sign up

Local All Communities

275

The world's most-powerful AI model suddenly got 'lazier' and 'dumber.' A radical redesign of OpenAI's GPT-4 could be behind the decline in performance.(www.businessinsider.com)

posted 1 year ago

by

L4sBot@lemmy.worldMB

in

technology@lemmy.world

Users of OpenAI’s GPT-4 are complaining that the AI model is performing worse lately. Industry insiders say a redesign of GPT-4 could be to blame.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments

[ +- ]

nbailey@lemmy.ca

86 points

1 year ago

*

The model has become inbred because it’s now impossible to scrape the web without AI content getting ingested, which is full of “hallucinations” and other weird artifacts. The last opportunity to get “uncontaminated” training data was sometime in mid 2022.

Not to say that it’s causing this particular problem, but this issue will emerge eventually. Garbage in = garbage out. Eventually GPT-19 will grow a mighty Habsburg chin.

report

reply

[ +- ]

TheGod@lemmy.world

21 points

1 year ago

Removed by mod

report

reply

[ +- ]

Chaotic Entropy@feddit.uk

15 points

1 year ago

All the articles with very specific titles, but then incredibly generic content, piss me off to no end.

report

reply

[ +- ]

NotYourSocialWorker@feddit.nu

6 points

1 year ago

Part of the reason why debugging windows is such a pain. Another part is the so called experts in the forums.

report

reply

[ +- ]

Chaotic Entropy@feddit.uk

7 points

1 year ago

Make sure your drivers are up to date! Another job well done.

report

reply

Show more comments

[ +- ]

Ultraviolet@lemmy.world

6 points

1 year ago

Also the articles that are plagiarized but run through a thesaurus bot to bypass search engine penalties for being plagiarized, often to the point of incomprehensibility. Yes, I’d love to read an article about my favorite vagabondlike, Deceased Cells.

report

reply

[ +- ]

jantin@lemmy.world

28 points

1 year ago

*

Maybe not yet, but…

Spez will turn Reddit into a bot farm and sell this as training data
Musk turns Twitter into a bigoted cesspool and will sell this as training data, which will subsequently be flagged for low quality (also: a botfarm)
Threads is a corporate ad dashboard (and we already know how easy it is to GPT copy) and Zuck will sell this as training data
Facebook is either dead or only good for boomers and Poles
blogs are dead
Fediverse is out there waiting to be scraped but possibly too small to sustain a big model

We’te getting there, hopefully.

report

reply

[ +- ]

cyberpunk007@lemmy.world

7 points

1 year ago

Scrapped?.. Or scraped?

report

reply

[ +- ]

jantin@lemmy.world

8 points

1 year ago

absolutely scraped, fixed

report

reply

[ +- ]

damnYouSun@sh.itjust.works

10 points

1 year ago

Also We'te, which I believe is a Klingon name.

report

reply

Show more comments

[ +- ]

cybersandwich@lemmy.world

3 points

1 year ago

…is Facebook popular with Polish people? Or was this a weird polish joke I don’t get?

report

reply

[ +- ]

theIdeaOfNorth@szmer.info

1 point

1 year ago

Very. Twitter never took off among general population (only politicians, journalists, botfarms and people who troll politicians and journalists), tiktok is for kids, Instagram is popular but again, rather among influencers and people who need to show off pictures not as a default SM app. I don’t really know where did Americans and west Europeans move from Facebook.

report

reply

[ +- ]

TitanLaGrange@lemmy.world

4 points

1 year ago

I suspect future models are going to have to put some more focus on learning using techniques more like what humans use, and on cognition.

Like, compared to a human these language models need very large quantities of text input. When humans are first learning language they get lots of visual input along with language input, and can test their understanding with trial-and-error feedback from other intelligent actors. I wonder if perhaps those factors greatly increase the rate at which understanding develops.

Also, humans tend to cogitate on inputs while ingesting them during learning. So if the information in new inputs disagrees with current understanding, those inputs are less likely to affect current understanding (there’s a whole ‘how to change your mind’ thing here that is necessary for people to use, but if we’re training a model on curated data that’s probably less important for early model training).

I don’t know details of how model training works, but it would be interesting to know if anyone is using a progressive learning technique where the model that is being trained is used to judge new training data before it is used as a training input to update the model’s weights. That would be kind of like how children learn by starting with very simple words and syntax and building up conceptual understanding gradually. I’d assume so, since it’s an obvious idea, but I haven’t heard about it.

report

reply

[ +- ]

TitanLaGrange@lemmy.world

2 points

1 year ago

For fun I asked ChatGPT about that progressive learning approach, and it seems to like the idea.

I wish I had more time to undertake some experiments in model training, this seems like it would be a really fun research direction.

Sorry for the ‘wall of AI text’:

The idea you’re describing seems to be a form of curriculum learning. In curriculum learning, models are trained in a progressive manner, starting with simple concepts (or in this case, simpler text data) and gradually moving to more complex ones. This approach is motivated by the way human learning often works, where we learn easier topics before moving on to harder ones.

The strategy you’ve suggested, where the model itself determines the complexity or understandability of the next round of training inputs, is interesting. While the traditional approach to curriculum learning is to manually design the learning progression based on human knowledge and intuition, the approach you’re suggesting is more dynamic and autonomous, letting the model guide its own learning process.

As of my last update in September 2021, I can’t confirm any specific projects or papers that have employed exactly this strategy in the domain of large language models. However, there are some related works in the field. For example, some reinforcement learning and computer vision projects have used a similar idea where models adaptively choose their next training samples based on their current understanding. This has been referred to as active learning, and is a promising avenue for improving the efficiency of model training.

However, there would be several technical challenges in applying this approach to large language models. For one, there would be the challenge of how to assess the “understandability” of a text sample in a reliable and useful way. This would likely require significant work to define and might also have to involve some form of reinforcement learning or other feedback mechanisms. Nonetheless, it’s a fascinating idea and could potentially be an interesting direction for future research in machine learning.

report

reply

[ +- ]

minorninth@lemmy.world

4 points

1 year ago

That hasn’t happened yet. Most likely they quantized GPT-4 more. It’s still based on the same training data.

report

reply

[ +- ]

AccidentalLemming@lemmy.world

5 points

1 year ago

*

Deleted by creator

report

reply

[ +- ]

RIotingPacifist@lemmy.world

6 points

1 year ago

Nah GPT makes it a lot easier, it’s the thing it’s actually good at.

Before they were autogenerated with bad English, GPT can generate good English that is equally devoid of content

report

reply

Technology

!technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

@L4s@lemmy.world
@autotldr@lemmings.world
@PipedLinkBot@feddit.rocks
@wikibot@lemmy.world

Community stats

17K
Monthly active users
12K
Posts
554K
Comments

Community moderators

L3s@lemmy.world
L3s@fry.gs
L4sBot@fry.gsB
L4sBot@lemmy.worldB
enu@lemmy.world

modlog legal instances join-lemmy.org

lemmy-ui-next v0.11.0 (github)lemmy v0.19.5 (github)