GitHub - EleutherAI/sae-auto-interp

Anthropicx.com
Dario Amodeix.com

Tracing the Thoughts of a Large Language Model

anthropic.comanthropic.com
Thumbnail of Tracing the Thoughts of a Large Language Model