58 points
*

[Edit: indeed, its actually good that it’s 2gb]

2gb plugin??!

Btw, does it work with tenacity?

permalink
report
reply
64 points

AI models are often multiple gigabytes, tbh it’s a good sign that it’s not “AI” marketing bullshit (less of a risk with open source projects anyway). I’m pretty wary of “AI” audio software that’s only a few megabytes.

permalink
report
parent
reply
10 points

Tensorflowlite models are tiny, but they’re potentially as much an audio revolution as synthetizer were in the 70s. It’s hard to tell if that’s what we’re looking at here.

permalink
report
parent
reply
1 point

Why are they that big? Is it more than code? How could you get to gigabytes of code?

permalink
report
parent
reply
51 points

Currently, AI means Artificial Neural Network (ANN). That’s only one specific approach. What ANN boils down to is one huge system of equations.

The file stores the parameters of these equations. It’s what’s called a matrix in math. A parameter is simply a number by which something is multiplied. Colloquially, such a file of parameters is called an AI model.

2 GB is probably an AI model with 1 billion parameters with 16 bit precision. Precision is how many digits you have. The more digits you have, the more precise you can give a value.

When people talk about training an AI, they mean finding the right parameters, so that the equations compute the right thing. The bigger the model, the smarter it can be.

Does that answer the question? It’s probably missing a lot.

permalink
report
parent
reply
10 points
*

It’s basically a huge graph/flowchart.

permalink
report
parent
reply
7 points

They’re composed of many big matrices, which scale quadratically in size. A 32x32 matrix is 4x the size of a 16x16 matrix.

permalink
report
parent
reply
6 points
*

The current wave of AI is around Large Language Models or LLMs. These are basically the result of a metric fuckton of calculation results generated from running a load of input data in, in different ways. Given these are often the result of things like text, pictures or audio that have been distilled down into numbers, you can imagine we’re talking a lot of data.

(This is massively simplified, by someone who doesn’t entirely understand it themselves)

permalink
report
parent
reply
34 points

It seems reasonable given it includes multiple AI models.

permalink
report
parent
reply
7 points

2gb is pretty normal for an AI model. I have some small LLM models on my PC and they’re about 7-10gb big. The big ones take up even more space.

permalink
report
parent
reply
-8 points

Isn’t tenacity a joke project made by 4channers

permalink
report
parent
reply
15 points

That fork is sneedacity, which is very dead.

permalink
report
parent
reply
3 points

Gotcha, thank you for the info. Gotta admit their made-up words are pretty funny

permalink
report
parent
reply
9 points

Tenacity is a Audacity fork without telemetry

permalink
report
parent
reply
15 points

Isn’t the telemetry in Audacity opt-in anyway?

permalink
report
parent
reply
51 points

I thought audacity was tarnished with spyware or something these days. Is it safe again?

permalink
report
reply
88 points

after looking into it:
it’s not and it never was.
a) it’s open source, so nobody’s putting that shit in there without getting caught
b) it had an opt-in error reporting feature that would send data back… that was the entire thing…

permalink
report
parent
reply
22 points

What? You must be joking. Really? The entire thing was about opt-in error reporting?

… seriously, that can’t be it, can it?

permalink
report
parent
reply
32 points
*

Not really that simple, it was an apparent change to the privacy policy that vaguely anticipated collection of arbitrary user data, which shook the confidence of the open source community on the project. The fact this happened right after audacity was sold was the cherry on top.

https://github.com/audacity/audacity/issues/1213

Changes were eventually reverted or revised.

permalink
report
parent
reply
10 points

yep… really just that…

i’ve used it forever with a very restrictive firewall and i’ve never seen it do anything unexpected… or any phoning home at all…

permalink
report
parent
reply
-7 points

in 2021 Audacity was acquired by a company called MuseGroup who added unnecessary telemetry and they admit that they do provide the data the collect to third parties. It’s spyware as far as I’m concerned.

permalink
report
parent
reply
6 points

Point a has always me me wonder, is that accurate? Are there actually people going through the code to make sure open source isn’t malicious? I can barely read my coworkers code… Let alone a strangers.

permalink
report
parent
reply
6 points

people are definitely going through the code on a project as popular as audacity…
less well known stuff is much less scrutinized, of course

permalink
report
parent
reply
3 points

Its way less work than going through the code to check for telemetry unless it is an intentionally hidden attack- just use Wireshark and check if there is network traffic other than checking for an update on program start.

permalink
report
parent
reply
2 points

If a project is popular people will make changes to it every day. But you can look at the repo and judge for yourself.

permalink
report
parent
reply
-7 points

That’s not entirely true, Audacity was acquired by a company called MuseGroup who added unnecessary telemetry and they admit that they do provide the data the collect to third parties. It’s spyware as far as I’m concerned.

permalink
report
parent
reply
5 points

i don’t believe you

permalink
report
parent
reply
74 points

It was a pull request to add opt-out analytics that got blown out of proportion, where the real issue was the EULA and how tonedeaf of a move it was considering the community around Audacity. IIRC, they ended up replacing it with opt-in analytics.

permalink
report
parent
reply
12 points

Not really, but there is a fork called tenacity which fixes this

permalink
report
parent
reply
40 points

I’ve been using the OpenVINO plugins for a few weeks and it’s genuinely impressive. Noise cancelling is one thing, but the transcription tool is amazing. I can create subtitles from conference recordings in minutes and create transcripts of recorded zoom calls, etc. and it does it for multiple languages.

That’s the kind of shit I like using AI for.

permalink
report
reply
4 points

music generation and remixing

any insight as to what this is?

permalink
report
parent
reply
34 points

The music separation and speech transcription plug-ins actually sound nice. Obviously that will depend on how reliable they actually are.

permalink
report
reply
14 points
*

I just tried the OpenVINO transcription on a random speech-over-music mp3 I happened to have: it works great, FAR better quality than I expected (I think I was expecting Youtube quality, but this is much cleaner and clearer). Perfect capitalization, good sentence breaks, adequate punctuation (commas, periods, question marks).

Only problem is that I can’t figure out how to copy the transcription so I can paste it outside Audacity: the transcriptions show up attached to specific portions of sound, like track labels. While it will save me the trouble of having to actually transcribe audio manually, to get them out of Audacity and into a word processor it looks like I may still be stuck copying each “label” individually unless I can find a way to copy or export them.

EDITED TO ADD: I just answered my own question, lol. File -> Export Other -> Export Labels -> .txt file

permalink
report
parent
reply
32 points

I read “Audacity ads” and thought for a moment they had gone to the dark side

permalink
report
reply
27 points

We already had a scare with them, but turns out it was very unfair overreaction to the project.

In this case I’m happy as long as it’s hardware platform independent and uses open source released models.

AI music art has been for a long time in the hands of industry moguls and us peasants have had nothing. So I’m happy with anything that puts this power in the hands of the everyman.

permalink
report
parent
reply
15 points
*

Was it unfair? I haven’t been following since they got bought out by spyware?

EDIT: Audacity was acquired by a company called MuseGroup in 2021 who added unnecessary telemetry and they admit that they do provide the data the collect to third parties. Some claim the changes were reverted but I haven’t confirmed that myself so until I see there is no telemetry it’s spyware as far as I’m concerned.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 554K

    Comments