Stubsack: weekly thread for sneers not worth an entire post, week ending Sunday 27 October 2024

Not sure how effective it is, but if Elon’s stealing your data for his autoplag no matter what, you might as well try to force-feed it as much poison as you can.

permalink

report

parent

reply

[ - ]

Soyweiser@awful.systems

6 points

18 days ago

I saw people say they would add 10% opaque layers of the musk with Epstein’s accomplice (whos name i forgot for a second and too lazy to look her up) photo. Would be nice if there was a tool to do so automatically. (Not that i post on twitter anymore).

permalink

report

parent

reply

[ - ]

swlabr@awful.systems

6 points

18 days ago

tbh that sounds like a pretty easy script to write! Too bad I am not near a computer rn

report

reply

[ - ]

12 points

18 days ago

It’s almost completely ineffective, sorry. It’s certainly not as effective as exfiltrating weights via neighborly means.

On Glaze and Nightshade, my prior rant hasn’t yet been invalidated and there’s no upcoming mathematics which tilt the scales in favor of anti-training techniques. In general, scrapers for training sets are now augmented with alignment models, which test inputs to see how well the tags line up; your example might be rejected as insufficiently normal-cat-like.

I think that “force-feeding” is probably not the right metaphor. At scale, more effort goes into cleaning and tagging than into scraping; most of that “forced” input is destined to be discarded or retagged.

permalink

report

parent

reply

[ - ]

froztbyte@awful.systemsOP

11 points

18 days ago

yeah this is the thing I’ve been thinking a lot about

fucking reCaptcha is literally mass-weaponising users for data filtration, and there is no good counter besides just not using reCaptcha (which is something one can’t easily pull off without things like regulatory action, massive reputational problems that make people gtfo, etc)

I have similar worries about cloudflare being such a massive chokepoint and using that position to enable “ai bot filter” services. feels extremely monopolistic, but ianal and I’m not entirely sure what the case grounds/structure on that would be (if any)

the only other viable strategy at the moment is fully breaking contact with any potential bad traffic systems, and that’s extremely fucking dire because that’s yet another nail in the coffin of the increasingly less open internet

permalink

report

parent

reply

Show more comments

[ - ]

luciole (he/him)@beehaw.org

6 points

18 days ago

I thought they were gonna do that themselves by feeding on their own outputs littered all over the www. Maybe they can use some help.

permalink

report

parent

reply

[ - ]

froztbyte@awful.systemsOP

7 points

18 days ago

that’s also happening, but yeah it’s going to have to be a team effort

permalink

report

parent

reply

[ - ]

antifuchs@awful.systems

10 points

18 days ago

They added sleeps to training jobs? Sounds like they deserve a raise for improving energy efficiency instead…

permalink

report

parent

reply

[ - ]

sc_griffith@awful.systems

7 points

17 days ago

*

we call it clogging, folks, we put a little clog in the machine

permalink

report

parent

reply

[ - ]

froztbyte@awful.systemsOP

5 points

17 days ago

ooh I like that

permalink

report

parent

reply

[ - ]

o7___o7@awful.systems

7 points

17 days ago

*

I’m sorry Mr. Musk, grok’s a bit constipated today. Someone fed it too much cheese. Then it started hallucinating.

permalink

report

parent

reply

[ - ]

gerikson@awful.systems

14 points

18 days ago

Forget Gladwell

All nonfiction writers can end up writing incorrect or controversial things, but why does every Gladwell book push half-formed and inaccurate theories? For years, my loose feeling about Gladwell was that he writes like someone who doesn’t care about being correct, which is not a way I would describe any other author I’ve encountered. There is something uniquely odd about his work.

permalink

report

reply

[ - ]

swlabr@awful.systems

9 points

18 days ago

Obligatory IBCK ep on gladwell

permalink

report

parent

reply

[ - ]

David Gerard@awful.systemsM

4 points

18 days ago

Nerds fear getting Malcolm Gladwell book for Christmas

permalink

report

parent

reply

[ - ]

gerikson@awful.systems

19 points

18 days ago

*

The Bookseller: Penguin Random House underscores copyright protection in AI rebuff

Penguin Random House (PRH) has amended its copyright wording across all imprints globally, confirming it will appear “in imprint pages across our markets”. The new wording states: “No part of this book may be used or reproduced in any manner for the purpose of training artificial intelligence technologies or systems”, and will be included in all new titles and any backlist titles that are reprinted.

Now that the content mafia has realized GenAI isn’t gonna let them get rid of all the expensive and troublesome human talent. it’s time to give Big AI a wedgie.

permalink

report

reply

[ - ]

BlueMonday1984@awful.systems

12 points

18 days ago

*

Now that the content mafia has realized GenAI isn’t gonna let them get rid of all the expensive and troublesome human talent. it’s time to give Big AI a wedgie.

Considering the massive(ly inflated) valuations running around Big AI and the massive amounts of stolen work that powers the likes of CrAIyon, ChatGPT, DALL-E and others, I suspect the content mafia is likely gonna try and squeeze every last red cent they can out of the AI industry.

permalink

report

parent

reply

[ - ]

YourNetworkIsHaunted@awful.systems

12 points

17 days ago

At some point, something is going to reveal that all the money in AI has gone into power costs for datacenters and NVidia chips and that the AI companies themselves aren’t doing so hot. I hope it’s the discovery process for some of the inevitable lawsuits.

permalink

report

parent

reply

[ - ]

David Gerard@awful.systemsM

10 points

17 days ago

it’s pretty publicly known

the VCs are gonna take one heckuva bath

permalink

report

parent

reply

[ - ]

bitofhope@awful.systems

14 points

18 days ago

It’s weird how rarely I see people point this, but in theory this kind of boilerplate should be technically meaningless. If copyright protections include the privilege to use the work for training a machine learning algorithm, you need explicit permission anyway. OTOH if it’s fair use or otherwise not something copyright law is concerned with, the copyright holder’s objection doesn’t matter.

For the record, I think AI models are derivative works and thus they’re not only infringing on typical “all rights reserved” works, but also things such as Free software whose license terms require attribution if used in derivative work, and especially share-alike copyleft licensed work.

permalink

report

parent

reply

[ - ]

gerikson@awful.systems

12 points

17 days ago

I thinkt it’s pretty well-lknown that Spotify got all its initial music from Oink. They moved fast, got dominant, and were able to present the record labels with a big audience prepared to pay for streaming music. The labels quickly ensured they’d get the lion’s share of that revenue.

OpenAI and friends tried the same thing - scrape everything, build AGI, reap the rewards. Except it didn’t work, and they’re in a much worse position morally. Even if they can get a judgement that what they’re doing is legal, it will cost them a lot in litigation fees, coupled with the public perception that these culture vampires are ripping off the poor honest author. Not a good place to be in.

permalink

report

parent

reply

[ - ]

Soyweiser@awful.systems

13 points

18 days ago

Update on the state of AI drug discovery companies: AI Does Not Make It Easy by Derek Lowe

permalink

report

reply

[ - ]

skillissuer@discuss.tchncs.de

10 points

18 days ago

lmao incredible

Then there’s BenevolentAI. I first wrote about them in 2018, as the company stated that it had “created a bioscience machine brain, purpose-built to discover new medicines and cures for disease.” How’s the machine brain doing these days? Well, the company’s lead program failed in the clinic last year, and in April announced major layoffs.

just who buys this shit? this reads like refined crypto nonsense

permalink

report

parent

reply

[ - ]

Soyweiser@awful.systems

11 points

18 days ago

People who don’t get much tech/overdose on hype and heard of alphafold. Imagine how much worse things could be now after the Nobel.

permalink

report