GitHub - EleutherAI/sae-auto-interp
GitHub - EleutherAI/sae-auto-interp
EleutherAI Institute
github.com
Related
Insights
Highlights
7
7
New Anthropic research: Tracing the thoughts of a large language model. We built a "microscope" to inspect what happens inside AI models and use it to understand Claude’s (often complex and surprising) internal mechanisms. https://t.co/PboGlLFnHG
Anthropic
x.com
4
4
The Urgency of Interpretability: Why it's crucial that we understand how AI models work https://t.co/Mz8R23uxgy
Dario Amodei
x.com
7
7
Tracing the Thoughts of a Large Language Model
anthropic.com
anthropic.com
Unlock unlimited Related cards