Nandan Thakur

Orcid: 0000-0001-6107-2460

According to our database1, Nandan Thakur authored at least 17 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor.
CoRR, 2024

Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

2023
MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages.
Trans. Assoc. Comput. Linguistics, 2023

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation.
CoRR, 2023

Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval.
CoRR, 2023

HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution.
CoRR, 2023

Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard.
CoRR, 2023

SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Evaluating Embedding APIs for Information Retrieval.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

2022
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages.
CoRR, 2022

Domain Adaptation for Memory-Efficient Dense Retrieval.
CoRR, 2022

Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval.
Proceedings of the Thirty-First Text REtrieval Conference, 2022

GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models.
CoRR, 2021

BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021


  Loading...