GitHub - EleutherAI/sae-auto-interp

GitHub - EleutherAI/sae-auto-interp

GitHub - EleutherAI/sae-auto-interp

EleutherAI Institute github.com

RelatedInsightsHighlights

The Urgency of Interpretability: Why it's crucial that we understand how AI models work https://t.co/Mz8R23uxgy

Dario Amodei x.com

Tracing the Thoughts of a Large Language Model

anthropic.com anthropic.com

Thumbnail of Tracing the Thoughts of a Large Language Model

Mapping the Mind of a Large Language Model

anthropic.com anthropic.com