2025

Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking.

[DOI]

Wuwei Zhang

Fangcong Yin

Howard Yen

Danqi Chen

Xi Ye

CoRR, June, 2025

Precise Information Control in Long-Form Text Generation.

[DOI]

CoRR, June, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.

[DOI]

Kenneth C. Enevoldsen

Hippolyte Gisserot-Boukhlef

Lester James V. Miranda

CoRR, February, 2025

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation.

[DOI]

CoRR, January, 2025

HELMET: How to Evaluate Long-context Models Effectively and Thoroughly.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

How to Train Long-Context Language Models (Effectively).

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly.

[DOI]

CoRR, 2024

Long-Context Language Modeling with Parallel Context Encoding.

[DOI]

Howard Yen

Tianyu Gao

Danqi Chen

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Improving Interpersonal Communication by Simulating Audiences with Language Models.

[DOI]

CoRR, 2023

Enabling Large Language Models to Generate Text with Citations.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023