You realize that if cases like this are won then only the “giant fucking corporations” are going to be able to afford the datasets to train AI with?
Harvesting the dataset isn’t the problem. Using copyrighted work in a paid product is the problem. Individuals could still train their own models for personal use
I don’t think you’re familiar with the sort of resources necessary to train a useful LLM up from scratch. Individuals won’t have access to that for personal use.
I’m not familiar with the exact amount of resources, but I know it takes a lot. My point was about what specifically is in contention here.
Also, you were the one pointing out that this case could entrench “giant fucking corporations” in the space. But if they’re the only ones who can afford the resources to train them, then this case won’t have an effect on that entrenchment