You are viewing a single thread.
View all comments
6 points

So I guess there are two paths of training data. Some company selling it explicitly, and the companies just scraping accessible data. Not that either is “good”, but at least with public data, you only have the AI company profiting.

permalink
report
reply
6 points

Yep. That’s why the two things I say Automattic MUST do to make things right are about proper consent controls for Automattic’s use of data and sale to AI vendors, but the third thing is a proposed proactive defense against scrapers.

permalink
report
parent
reply
5 points

Making the web un-scrapable to prevent AI is a terrible idea that won’t even work. You’re talking about DRM against the user’s browser… to read publicly-available text… as if the LLM genie can get shoved back in its bottle.

permalink
report
parent
reply
2 points
*

No? That’s not what NightShade is. NightShade isn’t DRM.

https://nightshade.cs.uchicago.edu/whatis.html

permalink
report
parent
reply

Furry Technologists

!tech@pawb.social

Create post

Science, Technology, and pawbs

Community stats

  • 33

    Monthly active users

  • 198

    Posts

  • 1.2K

    Comments