Alex Mallen

According to our database1, Alex Mallen authored at least 9 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Automatically Interpreting Millions of Features in Large Language Models.
CoRR, 2024

Balancing Label Quantity and Quality for Scalable Elicitation.
CoRR, 2024

Neural Networks Learn Statistics of Increasing Complexity.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Eliciting Latent Knowledge from Quirky Language Models.
CoRR, 2023

Representation Engineering: A Top-Down Approach to AI Transparency.
CoRR, 2023

When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
When Not to Trust Language Models: Investigating Effectiveness and Limitations of Parametric and Non-Parametric Memories.
CoRR, 2022

Koopman-theoretic Approach for Identification of Exogenous Anomalies in Nonstationary Time-series Data.
CoRR, 2022

2021
Deep Probabilistic Koopman: Long-term time-series forecasting under periodic uncertainties.
CoRR, 2021


  Loading...