Testing theory of mind in large language models and humans -...

Testing theory of mind in large language models and humans - Nature Human Behaviour

RelatedInsightsHighlights

Tracing the Thoughts of a Large Language Model

Thumbnail of www-x-com-intuitmachine-status-1997131059805163542-6a7e1155a0ea437f

You know how some people seem to have a magic touch with LLMs? They get incredible, nuanced results while everyone else gets generic junk. The common wisdom is that this is a technical skill. A list of secret hacks, keywords, and formulas you have to learn. But a new paper suggests this... See more

Carlos E. Perez

x.com

🧵 1/8 The Illusion of Thinking: Are reasoning models like o1/o3, DeepSeek-R1, and Claude 3.7 Sonnet really "thinking"? 🤔 Or are they just throwing more compute towards pattern matching? The new Large Reasoning Models (LRMs) show promising gains on math and coding benchmarks, but we found their fundamental limitations... See more

Mehrdad Farajtabar

x.com

You Are Not a Parrot

Elizabeth Weil nymag.com

LLMs perform about as well as humans do on empirical tests designed to diagnose deficits in theory of mind.

Tracing the Thoughts of a Large Language Model

You Are Not a Parrot

Recall