LLMs still don't understand the word "no", much like their creators(www.quantamagazine.org)

posted 5 months ago

David Gerard@awful.systemsM

techtakes@awful.systems

31 commentshide report

Sort:

Hot Top Controversial New Old

[ - ]

BradleyUffner@lemmy.world

53 points

5 months ago

LLMs don’t understand any words.

permalink

report

[ - ]

flere-imsaho@awful.systems

17 points

5 months ago

yes. and you wouldn’t believe¹ what’s in the replies when you make this simple and obvious statement.

¹ who i am kidding. of course you know.

permalink

report

parent

[ - ]

MojoMcJojo@lemmy.world

4 points

5 months ago

I both agree and disagree. I think of them as golems. They do understand how to respond, but that’s as deep as it goes. It’s simulated understanding, but a very very good simulation… Okay maybe I do agree.

permalink

report

parent

[ - ]

BradleyUffner@lemmy.world

19 points

5 months ago

I think that at best you could say that they understand the relationship between tokens. But even that requires a really generous definition of the word “understand”.

permalink

report

parent

[ - ]

Jimmyeatsausage@lemmy.world

11 points

5 months ago

There’s a saying…“Knowledge is knowing a tomato is a fruit. Wisdom is knowing not to put it in fruit salad.”

Meanwhile, LLMs are telling us to put glue on pizza so the cheese sticks. Even if the technology could eventually deliver on the promise, by the time we get there, nobody intelligent will trust it because the tech bros are, again, throwing half-baked garbage out into the world to try and be first to market.

permalink

report

parent

Show more comments

[ - ]

froztbyte@awful.systems

38 points

5 months ago

it’s almost like this thing has no internal conceptual representation! I know this can’t possibly be, millions of promptfans and prompfondlers have told me it can’t be so, but it sure does look that way! wild!

permalink

report

[ - ]

Kogasa@programming.dev

-4 points

5 months ago

It must have some internal models of some things, or else it wouldn’t be possible to consistently make coherent and mostly reasonable statements. But the fact that it has a reasonable model of things like grammar and conversation doesn’t imply that it has a good model of literally anything else, which is unlike a human for whom a basic set of cognitive skills is presumably transferable. Still, the success of LLMs in their actual language-modeling objective is a promising indication that it’s feasible for a ML model to learn complex abstractions.

permalink

report

parent

[ - ]

sc_griffith@awful.systems

26 points

5 months ago

if I copy a coherent sentence into my clipboard, my clipboard becomes capable of consistently making coherent statements

permalink

report

parent

[ - ]

Kogasa@programming.dev

-5 points

5 months ago

Yes, but that’s not how LLMs work. My statement depends heavily on the fact that a LLM like GPT is coaxed into coherence by unsupervised or semi-supervised training. That the training process works is the evidence of an internal model (of language/related concepts), not just the fact that something outputs coherent statements.

permalink

report

parent

Show more comments

[ - ]

flere-imsaho@awful.systems

16 points

5 months ago

it doesn’t. that’s why we’re calling it “spicy autocompletion” .

permalink

report

parent