2025

Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training.

[DOI]

William Merrill

Shane Arora

Dirk Groeneveld

Hannaneh Hajishirzi

CoRR, May, 2025

2 OLMo 2 Furious.

[DOI]

CoRR, January, 2025

OLMoE: Open Mixture-of-Experts Language Models.

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

OLMoE: Open Mixture-of-Experts Language Models.

[DOI]

CoRR, 2024

CaLMQA: Exploring culturally specific long-form question answering across 23 languages.

[DOI]

CoRR, 2024

OLMo: Accelerating the Science of Language Models.

[DOI]

CoRR, 2024

OLMo: Accelerating the Science of Language Models.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024