You are viewing a single thread.
View all comments View context
26 points

Google points to your content so others can find it.

OpenAI scrapes your content to use to make more content.

permalink
report
parent
reply
-23 points

That’s not a meaningful distinction, I spent all day using a Copilot search engine because the answers I wanted were scattered across a bunch of different documentation sites.

It was both using the AI models to interpret my commands (not generation at all), and then only publishes content to me specifically.

permalink
report
parent
reply
10 points

It’s absolutely a meaningful distinction. Search engines push people to tour website where you can capitalize on your audience however you see fit. LLM’s take your content, through them through the mixer and sell it back to people. It’s the difference between a movie reviewer explaining a movie and a dude in an alley selling a pirated copy of the movie.

permalink
report
parent
reply
-1 points
*

A) An LLM does not inherently sell you anything. Some companies charge you to run and use their LLMs (OpenAI), and some companies publish their LLMs open source for anyone to use (Meta, Microsoft). With neural chips starting to pop in PCs and phones, pretty soon anyone will be able to run an open source LLM locally on their machine, completely for free.

B) LLMs still rarely regurgitate the exact same original source. This would be more like someone in the back alley putting on their own performance of the movie and morphing it and adjusting it in real time based on your prompts and comments, which is a lot closer to parody and fair use than blatant piracy.

permalink
report
parent
reply
13 points

Technically it is meaningful, fair use is for specifically things that don’t replace the original in function.

permalink
report
parent
reply
-3 points

Depends on what the function was. If the function was to drive ad revenue to your site, then sure, if the function was to get information into the public, then it’s not replacing the function so much as altering and updating it.

permalink
report
parent
reply
14 points

I’m talking about the training phase of LLMs.that is the portion that is doing the scraping and generation of copy written data.

You using an already trained LLM to do some searches is not the same thing.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 18K

    Monthly active users

  • 11K

    Posts

  • 517K

    Comments