Xueguang Ma

Orcid: 0000-0003-3430-4910

According to our database1, Xueguang Ma authored at least 43 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Toward Best Practices for Training Multilingual Dense Retrieval Models.
ACM Trans. Inf. Syst., March, 2024

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs.
CoRR, 2024

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark.
CoRR, 2024

Fine-Tuning LLaMA for Multi-Stage Text Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Augmenting Black-box LLMs with Medical Textbooks for Biomedical Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Unifying Multimodal Retrieval via Document Screenshot Embedding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks.
Trans. Mach. Learn. Res., 2023

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models.
CoRR, 2023

Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering.
CoRR, 2023

Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard.
CoRR, 2023

Zero-Shot Listwise Document Reranking with a Large Language Model.
CoRR, 2023

SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Tevatron: An Efficient and Flexible Toolkit for Neural Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Enhancing Sparse Retrieval via Unsupervised Learning.
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023

TheoremQA: A Theorem-driven Question Answering Dataset.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Few-shot In-context Learning on Knowledge Base Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Precise Zero-Shot Dense Retrieval without Relevance Labels.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Personalized multi-faceted trust modeling to determine trust links in social media and its potential for misinformation management.
Int. J. Data Sci. Anal., 2022

Towards Best Practices for Training Multilingual Dense Retrieval Models.
CoRR, 2022

Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval.
CoRR, 2022

Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Another Look at DPR: Reproduction of Training and Replication of Retrieval.
Proceedings of the Advances in Information Retrieval, 2022

Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study.
Proceedings of the Advances in Information Retrieval, 2022

Pseudo-Relevance Feedback with Dense Retrievers in Pyserini.
Proceedings of the 26th Australasian Document Computing Symposium, 2022

2021
Sparsifying Sparse Representations for Passage Retrieval by Top-k Masking.
CoRR, 2021

Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval.
CoRR, 2021

A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques.
CoRR, 2021

A Replication Study of Dense Passage Retriever.
CoRR, 2021

Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations.
CoRR, 2021

Vera: Prediction Techniques for Reducing Harmful Misinformation in Consumer Health Search.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Pyserini: A Python Toolkit for Reproducible Information Retrieval Research with Sparse and Dense Representations.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

e-Health for Older Adults: Navigating Misinformation.
Proceedings of the 7th International Conference on Information and Communication Technologies for Ageing Well and e-Health, 2021

Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications.
Proceedings of the Second International Conference on Design of Experimental Search & Information REtrieval Systems, 2021

Scientific Claim Verification with VerT5erini.
Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis, 2021

2020
H2oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine.
Proceedings of the Twenty-Ninth Text REtrieval Conference, 2020

2006
Web-Based Education Accountability System and Organizational Changes: An Actor-Network Approach.
Int. J. Web Based Learn. Teach. Technol., 2006

2005
Building a Web-Based Accountability System in a Teacher Education Program.
Interact. Learn. Environ., 2005


  Loading...