AI
Matt Ross and
AI
Matt Ross and
Through extensive experimentation across diverse puzzles, we show that frontier LRMs face a complete accuracy collapse beyond certain complexities. Moreover, they exhibit a counter-intuitive scaling limit: their reasoning effort increases with problem complexity up to a point, then declines despite having an adequate token budget. By comparing LRMs
... See moreThe Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
I lost my mom to Parkinson’s this year, so this hit a nerve. My mom never would have done this sort of thing, but as the saying goes, this is the worst it will ever be. I hope I live disease-free long enough to see some significant treatment advancements for neurodegenerative diseases.