2025
Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking.
CoRR, June, 2025

Precise Information Control in Long-Form Text Generation.
CoRR, June, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.
CoRR, February, 2025

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation.
CoRR, January, 2025

HELMET: How to Evaluate Long-context Models Effectively and Thoroughly.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly.
CoRR, 2024

How to Train Long-Context Language Models (Effectively).
CoRR, 2024

Long-Context Language Modeling with Parallel Context Encoding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Improving Interpersonal Communication by Simulating Audiences with Language Models.
CoRR, 2023

Enabling Large Language Models to Generate Text with Citations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023