You are viewing a single thread.
View all comments View context
64 points

AI models are often multiple gigabytes, tbh it’s a good sign that it’s not “AI” marketing bullshit (less of a risk with open source projects anyway). I’m pretty wary of “AI” audio software that’s only a few megabytes.

permalink
report
parent
reply
1 point

Why are they that big? Is it more than code? How could you get to gigabytes of code?

permalink
report
parent
reply
10 points
*

It’s basically a huge graph/flowchart.

permalink
report
parent
reply
-6 points

It’s really nothing of the sort.

permalink
report
parent
reply
6 points
*

The current wave of AI is around Large Language Models or LLMs. These are basically the result of a metric fuckton of calculation results generated from running a load of input data in, in different ways. Given these are often the result of things like text, pictures or audio that have been distilled down into numbers, you can imagine we’re talking a lot of data.

(This is massively simplified, by someone who doesn’t entirely understand it themselves)

permalink
report
parent
reply
-2 points
*
Deleted by creator
permalink
report
parent
reply
7 points

They’re composed of many big matrices, which scale quadratically in size. A 32x32 matrix is 4x the size of a 16x16 matrix.

permalink
report
parent
reply
51 points

Currently, AI means Artificial Neural Network (ANN). That’s only one specific approach. What ANN boils down to is one huge system of equations.

The file stores the parameters of these equations. It’s what’s called a matrix in math. A parameter is simply a number by which something is multiplied. Colloquially, such a file of parameters is called an AI model.

2 GB is probably an AI model with 1 billion parameters with 16 bit precision. Precision is how many digits you have. The more digits you have, the more precise you can give a value.

When people talk about training an AI, they mean finding the right parameters, so that the equations compute the right thing. The bigger the model, the smarter it can be.

Does that answer the question? It’s probably missing a lot.

permalink
report
parent
reply
10 points

Tensorflowlite models are tiny, but they’re potentially as much an audio revolution as synthetizer were in the 70s. It’s hard to tell if that’s what we’re looking at here.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 557K

    Comments