this post was submitted on 25 Aug 2025

172 points (98.3% liked)

Fuck AI

3818 readers

584 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 1 year ago

MODERATORS

VerbFlow@lemmy.world

MrMcGasion@lemmy.world

TootSweet@lemmy.world

BigMikeInAustin@lemmy.world

cynar@lemmy.world

drmeanfeel@lemmy.world

pavnilschanda@lemmy.world

CriticalMedicine@lemmy.world

WonderfulWanderer@lemmy.world

Communist@lemmy.ml

eatCasserole@lemmy.world

SpaceNoodle@lemmy.world

NutWrench@lemmy.world

Soup@lemmy.cafe

iAvicenna@lemmy.world

Tinks@lemmy.world

wizblizz@lemmy.world

corus_kt@lemmy.world

Prandom_returns@lemm.ee

JimSamtanko@lemm.ee

TrickDacy@lemmy.world

TheFriar@lemm.ee

ArmokGoB@lemmy.dbzer0.com

HawlSera@lemm.ee

andrew_bidlaw@sh.itjust.works

MeDuViNoX@sh.itjust.works

33550336@lemmy.world

Nougat@fedia.io

Lost_My_Mind@lemmy.world

Sterile_Technique@lemmy.world

Quill7513@slrpnk.net

glowing_hans@sopuli.xyz

e8d79@discuss.tchncs.de

ThefuzzyFurryComrade@pawb.social

172

Top AI models fail spectacularly when faced with slightly altered medical questions (jamanetwork.com)

submitted 1 day ago by Pro@programming.dev to c/fuck_ai@lemmy.world

19 comments fedilink hide all child comments

cross-posted from: https://programming.dev/post/36289727

Comments

Reddit.

Our findings reveal a robustness gap for LLMs in medical reasoning, demonstrating that evaluating these systems requires looking beyond standard accuracy metrics to assess their true reasoning capabilities.6 When forced to reason beyond familiar answer patterns, all models demonstrate declines in accuracy, challenging claims of artificial intelligence’s readiness for autonomous clinical deployment.

A system dropping from 80% to 42% accuracy when confronted with a pattern disruption would be unreliable in clinical settings, where novel presentations are common. The results suggest that these systems are more brittle than their benchmark scores suggest.

you are viewing a single comment's thread
view the rest of the comments

[–] Peanutbjelly@sopuli.xyz -3 points 15 hours ago

"LLMs are not intelligent because they do not know anything. They repeat patterns in observed data."

we are also predictive systems, but that doesn't mean we are identical to LLMs. "LLMs are not intelligent because they do not know anything." is just not true, without saying humans are not intelligent and do not know anything. there are some unaddressed framing issues in how it's being thought about.

they "know" how to interpret a lot of things in a way that is much more environmentally adaptable than a calculator. language is just a really weird eco-niche, and there is very little active participation, and the base model is not updated as environments change.

this is not saying humans and LLMs are identical, this is saying that instead of the real differences, the particular aspect your are claiming shows LLMs aren't intelligent... is a normal part of intelligent systems.

this is a spot somewhere in between "human intelligence is the only valid shape of intelligence" and "LLMs are literally humans"

as for vocabulary i'm always willing to help for those that can't find or figure out tools to self-learn.

when i talk about 'tribal' aspects, i refer to the collapsing of complexity towards a binary narrative to fit to fit the preferences of your tribe, for survival reasons. i also refer to this as dumb ape brain, because it's a simplification of the world to the degree that i would expect from literal apes trying to survive in the jungle, and not people trying to better understand the world around them. which is important when shouting your opinions to each-other in big social movements. this is actually something you can map to first principles and how we use the errors our models experience in order to notice things, and how we contextualize the sensory experience after the fact. what i mean is, we have a good understanding of this, but nobody wants to hear it from the people who actually care.

'laziness' should be a lack of epistemic vigilance, not a failure to comply to the existing socio-economic hierarchy and hustle culture. i say this because ignorance in this area is literally killing us all, including the billionaires that don't care what LLMs are, but will use every tool they can to maximize paperclips. i'd assume that jargon should at least have salience here... since paperclip maximizing is OG anti-AI talk, but turns out is very important for framing issues in human intelligence as well.

please try to think of something wholesome before continuing, because tribal (energy saving) rage is basically a default on social media, but it's not conducive to learning.

RLHF = reinforcement learning with human feedback. basically upvoting/downvoting to alter future model behaviour, which often leads to sycophantic biases. important if you care about LLMs causing psychotic breaks.

"inter-modal dissonance" is where the different models using different representations make sense of things, but might not match up.

an example is vision = signal saying you are alone in the room

audio signal saying there is someone behind you.

you look behind you, and you collapse the dissonance, confirming with your visual modality whether the audio modality was being reliable. since both are attempting to be accurate, if there is no precision weighting error (think hallucinations) a wider system should be able to resolve whether the audio processing was mistaken, or there is something to address that isn't being picked up via the visual modality (if ghosts were real, they would fit here i guess.)

this is how different systems work together to be more confident about the environment they are both fairly ignorant of (outside of distribution.)

like cooperative triangulation via predictive sense-making.

i promise complex and new language is used to understand things, not just to hide bullshitting (like jordon peterson)

i'd be stating this to the academics, but they aren't the ones being confidently wrong about a subject they are unwilling to learn about. i fully encourage going and listening to the academics to better understand what LLMs and humans actually are.

"speak to your target audience." is literally saying "stay in a confirmation bubble, and don't mess with other confirmation bubbles." while partial knowledge can be manipulated to obfuscate, this particular subject revolves around things that help predict and resist manipulation and deception.

frankly this stuff should be in the educational core right now because knowing how intelligence works is... weirdly important for developing intelligence.

because it's really important for people to generally be more co-constructive in the way they adjust their understanding of things, while resisting a lot of failure states that are actually the opposite of intelligence.

your effort in attempting this communication is appreciated and valuable. sorry that it is very energy consuming, something that is frustrating due to people like jordon peterson or the same creationist cults mired in the current USA fascism problem, who, much like the relevant politicians aren't trying to understand anything, but to waste your energy so they can do what they want without addressing the dissonance. so they can maximize paperclips.

all of this is important and relevant. shit's kinda whack by design, so i don't blame people for having difficulty, but effort to cooperatively learn is appreciated.