Can Rager
According to our database1,
Can Rager
authored at least 10 papers
between 2023 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability.
CoRR, 2024
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models.
CoRR, 2024
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models.
CoRR, 2024
2023
CoRR, 2023
Proceedings of UniReps: the First Workshop on Unifying Representations in Neural Models, 2023