Technology

75300 readers

4078 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

Twitter tells users to touch grass, adds new rule limiting how many tweets you can read per day (lemmy.world)

submitted 2 years ago by NevermindNoMind@lemmy.world to c/technology@lemmy.world

20 comments fedilink hide all child comments

Elon has responded to the criticism and is increasing the limits to a whopping:

Verified accounts: 8000 posts/day
Unverified accounts: 800 posts/day
New unverified accounts: 400 posts/day

you are viewing a single comment's thread
view the rest of the comments

[–] rlspam@sh.itjust.works 0 points 2 years ago (1 children)

How true is the LLM data scraping threat?

[–] nottheengineer@feddit.de 1 points 2 years ago

Meta has shown that getting huge amounts of training data can lead to great results with a model that's much simpler than what openAI uses and it looks like they are taking a more open approach to LLMs because of that. Twitter has shitloads of possible training data, but it's Twitter so that data isn't great.

Elon is known to be afraid of AGIs becoming hostile, so that explains the decision.

I don't think it'll slow down AI development too much. There are new Llama-based models coming out every month that are better than the previous ones.

Reddit is a much better source of data and if they don't want to lose SEO, their data can still be gathered by scraping even after the API changes take effect.