Nicholas Goldowsky-Dill

According to our database1, Nicholas Goldowsky-Dill authored at least 4 papers between 2023 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning.
CoRR, 2024

The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks.
CoRR, 2024

Using Degeneracy in the Loss Landscape for Mechanistic Interpretability.
CoRR, 2024

2023
Localizing Model Behavior with Path Patching.
CoRR, 2023


  Loading...