Alessandro Stolfo

According to our database1, Alessandro Stolfo authored at least 12 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Improving Instruction-Following in Language Models through Activation Steering.
CoRR, 2024

Confidence Regulation Neurons in Language Models.
CoRR, 2024

Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis.
CoRR, 2023

A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Longtonotes: OntoNotes with Longer Coreference Chains.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Distilling Reasoning Capabilities into Smaller Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Distilling Multi-Step Reasoning Capabilities of Large Language Models into Smaller Models via Semantic Decompositions.
CoRR, 2022

A Simple Unsupervised Approach for Coreference Resolution using Rule-based Weak Supervision.
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022


  Loading...