Gradually, then Suddenly: Upon the Threshold

RelatedInsightsHighlights

Thumbnail of www-x-com-karpathy-status-1891720635363254772-4531aa693c6a491c

I was given early access to Grok 3 earlier today, making me I think one of the first few who could run a quick vibe check. Thinking ✅ First, Grok 3 clearly has an around state of the art thinking model ("Think" button) and did great out of the box on my Settler's of Catan... See more

Andrej Karpathy

x.com

GPT 4.5 + interactive comparison :) Today marks the release of GPT4.5 by OpenAI. I've been looking forward to this for ~2 years, ever since GPT4 was released, because this release offers a qualitative measurement of the slope of improvement you get out of scaling pretraining compute (i.e. simply training a bigger... See more

Andrej Karpathy x.com

How many years until open source models take over? Is it never? I've been manually testing 20-30 different models all claiming impressive scores on benchmarks against OpenAI and Anthropic. What I've found: * Super tiny models are becoming insanely good * Medium... See more

Hrishi x.com