Acyr Locatelli

According to our database1, Acyr Locatelli authored at least 10 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

2022
2023
2024
0
1
2
3
4
5
6
7
8
9
10
6
1
3

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier.
CoRR, 2024

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models.
CoRR, 2024

Understanding Likelihood Over-optimisation in Direct Alignment Algorithms.
CoRR, 2024

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts.
CoRR, 2024

To Code, or Not To Code? Exploring Impact of Code in Pre-training.
CoRR, 2024

Aya 23: Open Weight Releases to Further Multilingual Progress.
CoRR, 2024

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SnapKV: LLM Knows What You are Looking for Before Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022
Exploring Low Rank Training of Deep Neural Networks.
CoRR, 2022


  Loading...