18

Elon has responded to the criticism and is increasing the limits to a whopping:

Verified accounts: 8000 posts/day
Unverified accounts: 800 posts/day
New unverified accounts: 400 posts/day
you are viewing a single comment's thread
view the rest of the comments
[-] rlspam@sh.itjust.works 0 points 1 year ago

How true is the LLM data scraping threat?

[-] nottheengineer@feddit.de 1 points 1 year ago

Meta has shown that getting huge amounts of training data can lead to great results with a model that's much simpler than what openAI uses and it looks like they are taking a more open approach to LLMs because of that. Twitter has shitloads of possible training data, but it's Twitter so that data isn't great.

Elon is known to be afraid of AGIs becoming hostile, so that explains the decision.

I don't think it'll slow down AI development too much. There are new Llama-based models coming out every month that are better than the previous ones.

Reddit is a much better source of data and if they don't want to lose SEO, their data can still be gathered by scraping even after the API changes take effect.

this post was submitted on 01 Jul 2023
18 points (95.0% liked)

Technology

59419 readers
2993 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS