Comment by skibidi@lemmy.world on Amazon cloud boss echoes NVIDIA CEO on coding being dead in the water: "If you go forward 24 months from now, it's possible that most developers are not coding"

An inherent flaw in transformer architecture (what all LLMs use under the hood) is the quadratic memory cost to context. The model needs 4 times as much memory to remember its last 1000 output tokens as it needed to remember the last 500. When coding anything complex, the amount of code one has to consider quickly grows beyond these limits. At least, if you want it to work.

This is a fundamental flaw with transformer - based LLMs, an inherent limit on the complexity of task they can ‘understand’. It isn’t feasible to just keep throwing memory at the problem, a fundamental change in the underlying model structure is required. This is a subject of intense research, but nothing has emerged yet.

Transformers themselves were old hat and well studied long before these models broke into the mainstream with DallE and ChatGPT.

Our Rules

Follow the lemmy.world rules.

Only tech related content.

Be excellent to each another!

Mod approved content bots can post up to 10 articles per day.

Threads asking for personal tech support may be deleted.

Politics threads may be removed.

No memes allowed as posts, OK to post as comments.

Only approved bots from the list below, to ask if your bot can be added please contact us.

Check for duplicates before posting, duplicates may be removed

Amazon cloud boss echoes NVIDIA CEO on coding being dead in the water: "If you go forward 24 months from now, it's possible that most developers are not coding"(www.windowscentral.com)

Technology

!technology@lemmy.world

Our Rules

Approved Bots

Community stats

Community moderators