I feel like the amount of training data required for these AIs serves as a pretty compelling argument as to why AI is clearly nowhere near human intelligence. It shouldn’t take thousands of human lifetimes of data to train an AI if it’s truly near human-level intelligence. In fact, I think it’s an argument for them not being intelligent whatsoever. With that much training data, everything that could be asked of them should be in the training data. And yet they still fail at any task not in their data.
Put simply; a human needs less than 1 lifetime of training data to be more intelligent than AI. If it hasn’t already solved it, I don’t think throwing more training data/compute at the problem will solve this.
There is no “intelligence”, ai is a pr word. Just a language model that feeds on a lot of data.
Oh yeah we’re 100% agreed on that. I’m thinking of the AI evangelicals who will argue tooth and nail that LLMs have “emergent properties” of intelligence, and that it’s simply an issue of training data/compute power before we’ll get some digital god being. Unfortunately these people exist, and they’re depressingly common. They’ve definitely reduced in numbers since AI hype has died down though.
Definitely not the same thing. Just because you can make use of the end result of major efforts does not somehow magically give you access to all the knowledge from those major efforts.
You can use a smart phone easily, but that doesn’t mean you magically know how to make one.
You’ve had the entire history of evolution to get the instinct you have today.
Nature Vs Nurture is a huge ongoing debate.
Just because it takes longer to train doesn’t mean it’s not intelligent, kids develop slower than chimps.
Also intelligent doesn’t really mean anything, I personally think Intelligence is the ability to distillate unusable amounts of raw data and intuit a result beneficial to one’s self. But very few people agree with me.
I see intelligence as filling areas of concept space within an econiche in a way that proves functional for actions within that space. I think we are discovering more that “nature” has little commitment, and is just optimizing preparedness for expected levels of entropy within the functional eco-niche.
Most people haven’t even started paying attention to distributed systems building shared enactive models, but they are already capable of things that should be considered groundbreaking considering the time and finances of development.
That being said, localized narrow generative models are just building large individual models of predictive process that doesn’t by default actively update information.
People who attack AI for just being prediction machines really need to look into predictive processing, or learn how much we organics just guess and confabulate ontop of vestigial social priors.
But no, corpos are using it so computer bad human good, even though the main issue here is the humans that have unlimited power and are encouraged into bad actions due to flawed social posturing systems and the confabulating of wealth with competency.
A human lifetime worth of video is not anywhere close to equalling a human lifetime of actual corporeal existence, even in the perfect scenario where the AI is as capable as a human brain.
Strange to equate the other senses to performance in intellectual tasks but sure. Do you think feeding data from smells, touch, taste, etc. into an AI along with the video will suddenly make it intelligent? No, it will just make it more likely to guess what something smells like. I think it’s very clear that our current approach to AI is missing something much more fundamental to thought than that, it’s not just a dataset problem.
Humans don’t live that long. That’s only about 1.5 million 30 min videos, which isn’t a huge amount for a whole day’s worth of scraping.
I would be lucky if I get to watch more than 10000 videos in my entire lifetime.
Properly following licensing, right?
No, see, because it’s “learning like a human”, and everybody knows that you’re allowed to bypass any licensing for learning. /s
But seriously I don’t know how they make the jump to these conclusions either.
This is a massive strawman argument. No one is saying you shouldn’t have a license to view the content in order to train an AI on it. Most of the information used to train these models is publicly available and licensed for public viewing.
Just because something is available for public viewing does not mean it’s licensed for anything except personal use.
The strawman here is that since physical people benefit from personal use exceptions in the law, machine learning software should too. But why should they? Since when is a piece of software ran by a corporation equivalent to an individual person?
instead of focusing on their products and improving them for everyone, some shitty ceo is pushing their shitty ai agenda down everyones throat.
Nvidia’s biggest product is absolutely AI by a massive landslide, I’m pretty sure I read that the point of them downloading these videos and doing the training is to build a pipeline for their AI users to do the same with their own shit. (Can’t be bothered to double-check cuz I really don’t care)
So they aren’t downloading all this video to make a crazy AI model. They’re downloading all this video to make a tool for their AI customers to use, you may not agree but improving their product is exactly what they’re doing.
Can’t be bothered to double-check cuz I really don’t care
For FUCK SAKE, why do you even bother posting your garbage opinions then? and with such authority too!
So they use VMs to simulate user accounts, in future this will be blocked and whatever new AI startup is there won’t have the option to do so. Competition blocked. Forever.