Glad this is becoming a meme(fedia.io)

posted 8 months ago

lledrtx@lemmy.world

memes@lemmy.ml

65 commentshide report

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

Natanael@slrpnk.net

19 points

8 months ago

Training from scratch and retraining is expensive. Also, they want to avoid training on ML outputs as samples, they want primarily human made works as samples, and after the initial public release of LLMs it has become harder to create large datasets without ML stuff in them

permalink

report

parent

[ - ]

Scrubbles@poptalk.scrubbles.tech

13 points

8 months ago

There was a good paper that came out recently saying that training on ml data will result in a collapse of cohesion. It’s going to be real interesting, I don’t know if they’ll be able to train as easily ever again

permalink

report

parent

[ - ]

Iron Lynx@lemmy.world

4 points

8 months ago

I recall spotting a few things about Image Generators having their training data contaminated using generated images, and the output becoming significantly worse. So yeah, I guess LLMs and IGA’s need natural sources, or it gets more inbred than the Habsburgs.

permalink

report

parent

[ - ]