this post was submitted on 01 Sep 2025
92 points (97.9% liked)

Technology

4052 readers
346 users here now

Which posts fit here?

Anything that is at least tangentially connected to the technology, social media platforms, informational technologies and tech policy.


Post guidelines

[Opinion] prefixOpinion (op-ed) articles must use [Opinion] prefix before the title.


Rules

1. English onlyTitle and associated content has to be in English.
2. Use original linkPost URL should be the original link to the article (even if paywalled) and archived copies left in the body. It allows avoiding duplicate posts when cross-posting.
3. Respectful communicationAll communication has to be respectful of differing opinions, viewpoints, and experiences.
4. InclusivityEveryone is welcome here regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.
5. Ad hominem attacksAny kind of personal attacks are expressly forbidden. If you can't argue your position without attacking a person's character, you already lost the argument.
6. Off-topic tangentsStay on topic. Keep it relevant.
7. Instance rules may applyIf something is not covered by community rules, but are against lemmy.zip instance rules, they will be enforced.


Companion communities

!globalnews@lemmy.zip
!interestingshare@lemmy.zip


Icon attribution | Banner attribution


If someone is interested in moderating this community, message @brikox@lemmy.zip.

founded 2 years ago
MODERATORS
 

Researchers convinced ChatGPT to do things it normally wouldn’t with basic psychology.

you are viewing a single comment's thread
view the rest of the comments
[–] kayohtie@pawb.social 2 points 1 day ago (1 children)

Birds string together words they hear when they can repeat them, and end up with the short phrases they seem to make. It's extremely rare for them to actually understand meaning, most often it's simply association which is why you often get nonsensical responses that still sort of make sense, or sounds out of them that sound like words but just...aren't. The simalcrum of language without containing any. Often words can be linked by that, and our own brain wants to find words and sometimes we decipher ones, similar to seeing shapes in noise -- we just tend to realize that it's actually just recognition, not real.

What's happening here is the equivalent of recording a bird and playing back it's recording to itself to get a new response, as a chain. It's predictive text feeding itself, in a simplistic but not inaccurate manner given how language models actually work at a technical level, tokenizing the input to train and create matrices of language vectors that contain word fragments, and often loop back on themselves or into yet more matrices of options. This is the "beam size" option some models have when run, selecting how many search routines should be created simultaneously for things that map to probability values that make sense.

Our own reasoning is far more complicated. Sometimes we think it's just words, but our brain will seamlessly weave inner monologue into concepts and imagery or ideas without text and back again, sometimes into sounds or other things. We stitch together everything so seamlessly because it all actually has meaning for us.

LLMs having "reasoning" at all is operating by the Sapir-whorf hypothesis, which would imply there is no reasoning without language. And even animals can fucking reason without language. We absolutely did too. Sapir-whorf was an infantile thought experiment turned theory of language that's been patently proven wrong even when it makes for great sci-fi (see Arrival).

This isn't the difference between hearing a song live and played aloud, or midi/samples vs instruments. This is that part of our consciousness operates in some absolute wild ways that we can still only classify at a high level because the complexities are so far beyond what we can describe with models that, by comparison, are simplistic as hell.

Put another way, without transcriptions of "that's right, the square hole", if you showed two photos to a model and asked "where does this piece go" it's just going to "see" the shape in both, recognize the image->word mapping and come up with a response fitting that, without ever being able to "realize" it can go into the square hole without being prompted, because it can't invent.

Only parrot.

[–] nymnympseudonym@lemmy.world 1 points 1 day ago

we think it’s just words, but our brain will seamlessly weave inner monologue into concepts

Are you familiar with latent space representation?

Because yes, that's how LRM's work, cycling tokens in latent space multiple times before sending to upper layers and decoding into human words

https://arxiv.org/abs/2412.06769