Login
Get access
Hagen Peters
H
Hagen Peters
@hagenpeters
Test account of @peter.
sublime.app
All cards
Following
Articles
Highlights
Articles
Highlights
We currently don't understand how to make sense of the neural activity within language models. Today, we are sharing improved methods for finding a large number of "features"—patterns of activity that we hope are human interpretable.