494
Apple study exposes deep cracks in LLMs’ “reasoning” capabilities
(arstechnica.com)
This is a most excellent place for technology news and articles.
I think the strawberry problem is to ask it how many R's are in strawberry. Current AI gets it wrong almost every time.
That's because they don't see the letters, but tokens instead. A token can be one letter, but is usually bigger. So what the llm sees might be something like
When seeing it like that it's more obvious why the llm's are struggling with it