You are viewing a single thread.
View all comments View context
7 points

You realize that if cases like this are won then only the “giant fucking corporations” are going to be able to afford the datasets to train AI with?

permalink
report
parent
reply
9 points

Harvesting the dataset isn’t the problem. Using copyrighted work in a paid product is the problem. Individuals could still train their own models for personal use

permalink
report
parent
reply
8 points
*

I don’t think you’re familiar with the sort of resources necessary to train a useful LLM up from scratch. Individuals won’t have access to that for personal use.

permalink
report
parent
reply
3 points

I’m not familiar with the exact amount of resources, but I know it takes a lot. My point was about what specifically is in contention here.

Also, you were the one pointing out that this case could entrench “giant fucking corporations” in the space. But if they’re the only ones who can afford the resources to train them, then this case won’t have an effect on that entrenchment

permalink
report
parent
reply
2 points

So we shouldn’t do anything about it, and just let big corps scoop up all the data they want, regardless of ownership?

permalink
report
parent
reply

Technology

!technology@beehaw.org

Create post

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:


This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Community stats

  • 2.7K

    Monthly active users

  • 3.4K

    Posts

  • 81K

    Comments