this post was submitted on 08 Jun 2025
277 points (93.7% liked)
Fuck AI
3052 readers
1539 users here now
"We did it, Patrick! We made a technological breakthrough!"
A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
[Citation needed]
If anything the LLMs have gotten less useful and started hallucinating even more obviously now.
7 months ago: https://web.archive.org/web/20241210232635/https://openlm.ai/chatbot-arena/ Now: https://web.archive.org/web/20250602092229/https://openlm.ai/chatbot-arena/
You can see that o1-mini, a silver (almost gold) model, is now a middle-of-the-road copper model.
Note that Chatbot Arena calculates its score relatively - they'll show two outputs (without the model names), and people select the output they prefer. The preferences are ordered. Not sure what accounts for gold/silver/copper.