Frontier AI Models Are Getting Stumped by a Simple Children's Game
Earlier this week, researchers at Apple released a damning paper, criticizing the AI industry for vastly overstating the ability of its top AI models to reason or "think." The team found that the models, including OpenAI's o3, Anthropic's Claude 3.7, and Google's Gemini, were stumped by even the simplest of puzzles. For instance, the "large reasoning models," or LRMs, consistently failed at Tower of Hanoi, a children's puzzle game that involves three pegs and a number of differently-sized disks that have to be arranged in a specific order. The researchers found that the AI models' accuracy in the game was […]
Link :
https://futurism.com/frontier-ai-models-stumped-childrens-game