this post was submitted on 22 Jun 2025
28 points (96.7% liked)

TechTakes

1987 readers
56 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
 

I think this summarizes in one conversation what is so fucking irritating about this thing: I am supposed to believe that it wrote that code.

No siree, no RAG, no trickery with training a model to transform the code while maintaining identical expression graph, it just goes from word-salading all over the place on a natural language task, to outputting 100 lines of coherent code.

Although that does suggest a new dunk on computer touchers, of the AI enthusiast kind, you can point at that and say that coding clearly does not require any logical reasoning.

(Also, as usual with AI it is not always that good. sometimes it fucks up the code, too).

you are viewing a single comment's thread
view the rest of the comments

That's what I was going to say. The natural language version actually claims that it leaves the dog behind unattended in every step, even though the following step continues as though it still has the dog and not whichever vegetable it brought back in the previous step.

Either it's not actually good at natural language processing or some element of the solution isn't surviving the shift from the river_cross() tool to natural language output. Whatever actual state it's tracking internally doesn't track to the output past the headline.