AI

Zhaofeng Wu Reasoning skills of large language models are often overestimated

A polite disagreement bot ring is flooding Bluesky — reply guy as a (dis)service

BBrad Barrish

Announcing Grok

x.ai
Thumbnail of Announcing Grok

pphoebe

Superhuman performance of a large language model on the reasoning tasks of a physician

arxiv.org
Thumbnail of Superhuman performance of a large language model on the reasoning tasks of a physician

SHOW-1 and Showrunner Agents in Multi-Agent Simulations

fablestudio.github.io
Thumbnail of SHOW-1 and Showrunner Agents in Multi-Agent Simulations

Brian Klaas The Death of the Student Essay—and the Future of Cognition

Abhishek Agarwalx.com

Imaginary Friends Grew Up: We Panicked