2025
Command A: An Enterprise-Ready Large Language Model.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, April, 2025

Rope to Nope and Back Again: A New Hybrid Attention Strategy.
CoRR, January, 2025

2024
Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier.
CoRR, 2024

Aya 23: Open Weight Releases to Further Multilingual Progress.
CoRR, 2024

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024