15
Robots.txt for LLMs (matt-rickard.com)
top 5 comments
sorted by: hot top controversial new old
[-] NeoNachtwaechter@lemmy.world 7 points 1 year ago

creators of LLMs would like to know [...] that they haven’t been trained on copyrighted data.

I'm not quite sure about that LOL

Hasn't Google recently announced that the whole internet now belongs to them for the purpose of training their next models?

[-] seasonone@opidea.xyz 3 points 1 year ago* (last edited 1 year ago)

Hasn’t Google recently announced that the whole internet now belongs to them for the purpose of training their next models?

When did this happen? I mean I am aware of their privacy policy but this??

[-] M_Reimer@lemmy.world 5 points 1 year ago

Forget about it. With all these nasty LLM stuff companies take it as granted that they can steal everything and everywhere.

this post was submitted on 22 Jul 2023
15 points (89.5% liked)

Technology

59419 readers
2993 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS