this post was submitted on 10 Jun 2025

76 points (94.2% liked)

Programming

23003 readers

231 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

founded 2 years ago

MODERATORS

snowe@programming.dev

Ategon@programming.dev

MaungaHikoi@lemmy.nz

UlrikHD@programming.dev

Trusting your own judgement on 'AI' is a huge risk (www.baldurbjarnason.com)

submitted 4 months ago by cm0002@lemmy.world to c/programming@programming.dev

61 comments fedilink hide all child comments

OC below by @HaraldvonBlauzahn@feddit.org

What called my attention is that assessments of AI are becoming polarized and somewhat a matter of belief.

Some people firmly believe LLMs are helpful. But programming is a logical task and LLMs can't think - only generate statistically plausible patterns.

The author of the article explains that this creates the same psychological hazards like astrology or tarot cards, psychological traps that have been exploited by psychics for centuries - and even very intelligent people can fall prey to these.

Finally what should cause alarm is that on top that LLMs can't think, but people behave as if they do, there is no objective scientifically sound examination whether AI models can create any working software faster. Given that there are multi-billion dollar investments, and there was more than enough time to carry through controlled experiments, this should raise loud alarm bells.

you are viewing a single comment's thread
view the rest of the comments

[–] WhirlpoolBrewer 7 points 4 months ago (1 children)

I don't think the current common implementation of AI systems are "thinking" and I'll base my argument on Oxford's definitions of words. Thinking is defined as "the process of using one's mind to consider or reason about something". I'll ignore the word "mind" and focus on the word "reason". I don't think what AIs are doing counts as reasoning as defined by Oxford. Let's go to that definition: "the power of the mind to think, understand, and form judgments by a process of logic". I take issue with the assertion that they form judgments. For completeness, but I don't think it's definition is particularly relevant here, a judgment is: "the ability to make considered decisions or come to sensible conclusions".

I think when you ask an LLM how many 'r's there are in Strawberry and questions along this line you can see they can't form judgments. These basic but obscure questions are where you see that the ability to form judgements isn't there. I would also add that if you "form judgments" you probably don't need to be reminded you formed a judgment immediately after forming one. Like if I ask an LLM a question, and it provides an answer, I can convince it that it was wrong whether or not I'm making junk up or not. I can tell it it made a mistake and it will blindly change it's answer whether it made a mistake or not. That also doesn't feel like it's able to reason or make judgments.

This is where all the hype falls flat for me. It feels like sometimes it looks like a concrete wall, but occasionally that concrete wall is made of wet paper. You can see how impressive the tool is and how paper thin it is at the same time. It's cool, it's useful, it's fake, and that's ok. Just be aware of what the tool is.

[–] Kuinox@lemmy.world 0 points 4 months ago (1 children)

I think when you ask an LLM how many 'r’s there are in Strawberry and questions along this line you can see they can’t form judgments.

Like a LLMs you are making the wrong affirmation based lacking knowledge.
Current LLMs input, and output tokens, they dont ever see the individual letters, they see tokens, for straberry, they see 3 tokens:

They dont have any information on what characters are in this tokens. So they come up with something. If you learned a language only by speaking, you'll be unable to write it down correctly (except purely phonetical systems), instead you'll come up with what you think the word should be written.

I would also add that if you “form judgments” you probably don’t need to be reminded you formed a judgment immediately after forming one.

You come up with the judgment before you are aware of it: https://www.unsw.edu.au/newsroom/news/2019/03/our-brains-reveal-our-choices-before-were-even-aware-of-them--st

can tell it it made a mistake and it will blindly change it’s answer whether it made a mistake or not. That also doesn’t feel like it’s able to reason or make judgments.

That's also how the brain can works, it come up with a plausible explanation after having the result.
See the experience which are spoken about here: https://www.youtube.com/watch?v=wfYbgdo8e-8

I showed the same behavior in humans of some behavior you observed in LLMs, does this means that by your definition, humans doesnt think ?

[–] WhirlpoolBrewer 8 points 4 months ago (1 children)

If the LLM could reason, shouldn't it be able to say "my token training prevents me from understanding the question as asked. I don't know how many 'r's there are in Strawberry, and I don't have a means of finding that answer"? Or at least something similar right? If I asked you what some word in a language you didn't know, you should be able to say "I don't know that word or language". You may be able to give me all sorts of reasons why you don't know it, and that's all fine. But you would be aware that you don't know and would be able to say "I don't know".

If I understand you correctly, you're saying the LLM gets it wrong because it doesn't know or understand that words are built from letters because all it knows are tokens. I'm saying that's fine, but it should be able to reason that it doesn't know the answer, and say that. I assert that it doesn't know that it doesn't know what letters are, because it is incapable of coming to that judgement about its own knowledge and limitations.

Being able to say what you know and what you don't know are critical to being able to solve logic problems. Knowing which information is missing and can be derived from known things, and which cannot be derived is key to problem solving based on reason. I still assert that LLMs cannot reason.

[–] Kuinox@lemmy.world -1 points 4 months ago* (last edited 4 months ago) (1 children)

I’m saying that’s fine, but it should be able to reason that it doesn’t know the answer, and say that.

That is of course a big problem. They try to guess too much stuff, but it's also why it kinda works. Symbolics AI have the opposite problem, they are rarely useful, because they can't guess stuff, they are rooted in hard logic, and cannot come up with a reasonable guess.
Now humans also try to guess stuff and sometimes get it wrong, it's required in order to produce results from our thinking and not be stuck in a state where we don't have enough data to do anything, like a symbolic AI.

Now, this is becoming a spectrum, humans are somewhere in the middle of LLMs and symbolics AI.
LLMs are not completely unable to say what they know and doesnt know, they are just extremely bad at it from our POV.

The probleme with "does it think" is that it doesn't give any quantity or quality.

[–] WhirlpoolBrewer 1 points 3 months ago (1 children)

Is the argument that LLMs are thinking because they make guesses when they don't know things combined with no provided quantity or quality to describe thinking?

If so, I would suggest that the word "guessing" is doing a lot of heavy lifting here. The real question would be "is statistics guessing"? I would say guessing and statistics are not the same thing, and Oxford would agree. An LLM just grabs tokens based on training data on what word or token most likely comes next, it will just be using what the statistically most likely next token or word is. I don't think grabbing the highest likely next token counts as guessing. That feels very algorithmic and statistical to me. It is also possible I'm missing the argument still.

[–] Kuinox@lemmy.world 2 points 3 months ago (1 children)

Is the argument that LLMs are thinking because they make guesses

No, it's that you can't root the argument that they don't think over the fact they make stuff up, because humans too. You could root it in the amount of things it guess wrong, but it's extremely hard to measure.
Again, I'm not claiming that they think, but that we don't know until one or the other is proven.
Right now, thinking one, or the other is true, is belief.

[–] WhirlpoolBrewer 1 points 3 months ago (1 children)

I think you can make a strong argument that they don't think rooted in words should mean something and that statistics and thinking don't mean the same thing. To me, that feels like a fairly valid argument.

[–] Kuinox@lemmy.world 1 points 3 months ago* (last edited 3 months ago) (1 children)

So you think you need words to be able to think ? Monkeys, birds, human babies are unable to think then ?

[–] WhirlpoolBrewer 1 points 3 months ago (1 children)

My apologies, I was too vague. I'm saying "thinking" by definition is not "statistics". Where Monkeys, birds, and human babies all "think", LLMs use algorithms and "statistics". I also think that "statistics" not meaning the same thing that "thinking" is a valid argument. I would go farther and say it's important that words have meaning. That is what I was attempting to convey. I'm happy to clear up anything I was unclear about.

[–] Kuinox@lemmy.world 2 points 3 months ago (1 children)

You are mistaking how LLMs are trained to how they work.
It's not because it's been trained with statistics, that they compute, or think using statistics.
For example, to do additions, internally LLMs do trignonometry: https://arxiv.org/abs/2502.00873
They do probably use statistics for tons of stuff internally, but humans do too: guessing, bias, tendency, preferences.
Anthropics researcher found that their LLMs have "features" for concepts.

[–] WhirlpoolBrewer 1 points 3 months ago (1 children)

I don't think you can disconnect how an LLM was trained from how it operates. If you train an LLM to use trigonometry to solve addition problems, I think you will find the LLM will do trigonometry to solve addition problems. If you train an LLM in only Russian, it will speak Russian. I would suggest that regardless of what you train it on it will choose the statistically most likely next token based on its training data.

I would also suggest we don't know the exact training data being used on most LLMs, so as outsiders we can't say one way or another on how the LLM is being trained to do anything. We can try to extrapolate from posts like the one that you linked to how the LLM was trained though. In general if that is how the LLM is coming to its next token, then the training data must be really heavily weighted in that manner.

[–] Kuinox@lemmy.world 1 points 3 months ago (1 children)

I don’t think you can disconnect how an LLM was trained from how it operates

You can, heck the example I gave show exaclty this:

If you train an LLM to use trigonometry to solve addition problems, I think you will find the LLM will do trigonometry to solve addition problems.

It was not trained to do trigonometry to solve addition problem, it was trained to respond to additions, trigonometry is how the statiscal part, the backpropagation, found a way to make the neurons solve additions.

In general if that is how the LLM is coming to its next token, then the training data must be really heavily weighted in that manner.

You are mixing up stuff, the way LLM are trained does not impose anything about how the neurons gets organised to get better score at inferrence.

[–] WhirlpoolBrewer 1 points 3 months ago (1 children)

I would point out I think you might be overly confident in the manner in which it was trained addition. I'm open to being wrong here, but when you say "It was not trained to do trigonometry to solve addition problem", that suggests to me either you know how it was trained, or are making assumptions about how it was trained. I would suggest unless you work at one of these companies, you probably are not privy to their training data. This is not an accusation, I think that is probably a trade secret at this point. And if the idea that there would be nobody training an LLM to do addition in this manner, I invite you to glance the Wikipedia article on addition. Really glance at literally any math topic on Wikipedia. I didn't notice any trigonometry in this entry but I did find the discussion around finding the limits of logarithmic equations in the "Related Operations" section: https://en.m.wikipedia.org/wiki/Addition. They also cite convolution as another way to add in which they jump straight to calculus: https://en.m.wikipedia.org/wiki/Convolution.

This is all to say, I would suggest that we don't know how they're training LLMs. We don't know what that training data is or how it is being used exactly. What we do know is that LLMs work on tokens and weights. The weights and statistical relevance to each of the other tokens depends on the training data, which we don't have access to.

I know this is not the point, but up until this point I've been fairly pedantic and tried to use the correct terminology, so I would point out that technically LLMs have "tensors" not "neurons". I get that tensors are designed to behave like neurons, and this is just me being pedantic. I know what you mean when you say neurons, just wanted to clarify and be consistent. No shade intended.

[–] Kuinox@lemmy.world 1 points 3 months ago* (last edited 3 months ago) (1 children)

that suggests to me either you know how it was trained, or are making assumptions about how it was trained.

The general way it is trained is known, specifics and technics are not known, but the public do know how one of the flagship model was trained, the training process of deepseek r1 was documented in their research paper: https://arxiv.org/pdf/2501.12948

Which I did read a chunk when it was released.

And if the idea that there would be nobody training an LLM to do addition in this manner, I invite you to glance the Wikipedia article on addition.

The LLMs have multiple way to do additions, i'll showcase two as an example, I asked ChatGPT 4.1 to solve a big addition. Here it's output:

 I need to add these two large numbers carefully.

First, I'll write them one under the other, aligned at the right:

                  21,500,996,531,314,339
+ 96,357,764,557,683,165,397,721
----------------------------------


Working from right to left:

Units place: 9 + 1 = 10. Write 0, carry 1.
Tens place: 3 + 2 + 1(carry) = 6. Write 6, no carry.
Hundreds place: 3 + 7 = 10. Write 0, carry 1.
Thousands place: 4 + 7 + 1(carry) = 12. Write 2, carry 1.
Ten thousands place: 1 + 9 + 1(carry) = 11. Write 1, carry 1.
Hundred thousands place: 3 + 3 + 1(carry) = 7. Write 7, no carry.
Millions place: 1 + 5 = 6. Write 6, no carry.
Ten millions place: 5 + 6 = 11. Write 1, carry 1.
Hundred millions place: 3 + 1 + 1(carry) = 5. Write 5, no carry.
Billions place: 5 + 3 = 8. Write 8, no carry.
Ten billions place: 9 + 8 = 17. Write 7, carry 1.
Hundred billions place: 9 + 6 + 1(carry) = 16. Write 6, carry 1.
Trillions place: 0 + 7 + 1(carry) = 8. Write 8, no carry.
Ten trillions place: 0 + 5 = 5. Write 5, no carry.
Hundred trillions place: 5 + 5 = 10. Write 0, carry 1.
Quadrillions place: 1 + 4 + 1(carry) = 6. Write 6, no carry.
Ten quadrillions place: 2 + 6 = 8. Write 8, no carry.

For the remaining places, I just copy the digits from the second number:
7, 7, 5, 3, 6, 9

Therefore, the answer is:
96,357,786,056,867,851,761,260

You can notice, the whole reasoning is correct, but it wrote the wrong response, I can expand more on this if you want (I do some research on it on my free time)
This reasoning of decomposing the addition was of course learned from training data.
Now, the trigonometry used to calculate additions that i talked earlier, is not for writing a "reasoning" but when it try to write the correct response. It was created by the backpropagation trying to find a local minimum that can solve additions in order to more accuratly predict the next token.

so I would point out that technically LLMs have “tensors” not “neurons”.
I get that tensors are designed to behave like neurons, and this is just me being pedantic. I know what you mean when you say neurons, just wanted to clarify and be consistent. No shade intended.

Artificial neurons were made to behave like neurons: https://en.wikipedia.org/wiki/Artificial_neuron
And the terminology used, is neurons, cf the paper i sent earlier about how they do additions: https://arxiv.org/pdf/2502.00873

[–] WhirlpoolBrewer 1 points 3 months ago (1 children)

I don't doubt that it can perform addition in multiple ways. I would go as far as saying it can probably attempt to perform addition in more ways than the average person as it has probably been trained on a bunch of math. Can it perform it correctly? Sometimes. That's ok, people make mistakes all the time too. I don't take away from LLMs just because they make mistakes. The ability to do math in multiple ways is not evidence of thinking though. That is evidence that it's been trained on at least a fair bit of math. I would say if you train it on a lot of math, it will attempt to do a lot of math. That's not thinking, that's just increasing the weighting on tokens related to math. If you were to train an LLM on nothing but math and texts about math, then asked it an art question, it would respond somewhat nonsensically with math. That's not thinking, that's just choosing the statistically most likely next token.

I had no idea about artificial neurons, TIL. I suppose that makes "neural networks" make more sense. In my readings on ML they always seemed to go straight to the tensor and overlook the neuron. They would go over the functions to help populate the weights but never used that term. Now I know.

[–] Kuinox@lemmy.world 1 points 3 months ago

I've been re reading my response and my bad, I meant "artificial neurons were inspired from neurons", not to behave like, they have little in common.

If you were to train an LLM on nothing but math and texts about math, then asked it an art question, it would respond somewhat nonsensically with math.

If you asked an human that speak german and nothing else, a question in english, it would also respond in german (that they cant understand you).
LLMs sometimes (not often enough) do respond they don't know.