Aleksandar Makelov
According to our database1,
Aleksandar Makelov
authored at least 6 papers
between 2018 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control.
CoRR, 2024
Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching.
CoRR, 2023
Proceedings of the International Conference on Machine Learning, 2023
2022
PhD thesis, 2022
2018
Proceedings of the 6th International Conference on Learning Representations, 2018