2025
EvidenceBench: A Benchmark for Extracting Evidence from Biomedical Papers.
CoRR, April, 2025

Single-Pass Document Scanning for Question Answering.
CoRR, April, 2025

2024
Measuring Risk of Bias in Biomedical Reports: The RoBBR Benchmark.
CoRR, 2024

BIRCO: A Benchmark of Information Retrieval Tasks with Complex Objectives.
CoRR, 2024

IR2: Information Regularization for Information Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024