Eric J. Michaud

Orcid: 0000-0001-7912-1953

According to our database1, Eric J. Michaud authored at least 14 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Geometry of Concepts: Sparse Autoencoder Feature Structure.
CoRR, 2024

Efficient Dictionary Learning with Switch Sparse Autoencoders.
CoRR, 2024

Survival of the Fittest Representation: A Case Study with Modular Addition.
CoRR, 2024

Not All Language Model Features Are Linear.
CoRR, 2024

Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models.
CoRR, 2024

Opening the AI black box: program synthesis via mechanistic interpretability.
CoRR, 2024

2023
Precision Machine Learning.
Entropy, January, 2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback.
Trans. Mach. Learn. Res., 2023

The Quantization Model of Neural Scaling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Omnigrok: Grokking Beyond Algorithmic Data.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Towards Understanding Grokking: An Effective Theory of Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2020
Examining the Causal Structures of Deep Neural Networks Using Information Theory.
Entropy, 2020

Understanding Learned Reward Functions.
CoRR, 2020

Examining the causal structures of deep neural networks using information theory.
CoRR, 2020


  Loading...