Nearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray? [Article from October 3](www.zdnet.com)

posted 11 months ago

rufus@discuss.tchncs.de

localllama@sh.itjust.works

18 commentshide report

They are referencing this paper: LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset from September 30.

The paper itself provides some insight on how people use LLMs and the distribution of the different use-cases.

The researchers had a look at conversations with 25 LLMs. Data is collected from 210K unique IP addresses in the wild on their Vicuna demo and Chatbot Arena website.

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments View context

[ - ]

micheal65536@lemmy.micheal65536.duckdns.org

5 points

11 months ago

Stable Diffusion 2 base model is trained using what we would today refer to as a “censored” dataset. Stable Diffusion 1 dataset included NSFW images, the base model doesn’t seem particularly biased towards or away from them and can be further trained in either direction as it has the foundational understanding of what those things are.

permalink

report

parent

LocalLLaMA

!localllama@sh.itjust.works

Create post

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Community stats

54
Monthly active users
218
Posts
830
Comments

Community stats

Community moderators