this post was submitted on 08 Jun 2025
277 points (93.7% liked)

Fuck AI

3052 readers
1539 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] halcyoncmdr@lemmy.world 32 points 1 day ago (1 children)

[Citation needed]

If anything the LLMs have gotten less useful and started hallucinating even more obviously now.

7 months ago: https://web.archive.org/web/20241210232635/https://openlm.ai/chatbot-arena/ Now: https://web.archive.org/web/20250602092229/https://openlm.ai/chatbot-arena/

You can see that o1-mini, a silver (almost gold) model, is now a middle-of-the-road copper model.

Note that Chatbot Arena calculates its score relatively - they'll show two outputs (without the model names), and people select the output they prefer. The preferences are ordered. Not sure what accounts for gold/silver/copper.