Donald Metzler

Orcid: 0000-0003-4276-6269

Affiliations:
  • Google


According to our database1, Donald Metzler authored at least 132 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Watermark for Black-Box Language Models.
CoRR, 2024

Impact of Preference Noise on the Alignment Performance of Generative Language Models.
CoRR, 2024

Best-of-Venom: Attacking RLHF by Injecting Poisoned Preference Data.
CoRR, 2024

Gen-IR @ SIGIR 2024: The Second Workshop on Generative Information Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

SEMQA: Semi-Extractive Multi-Source Question Answering.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Searching Personal Collections.
Proceedings of the Information Retrieval: Advanced Topics and Techniques., 2024

2023
Report on the 1st Workshop on Generative Information Retrieval (Gen-IR 2023) at SIGIR 2023.
SIGIR Forum, December, 2023

Efficient Transformers: A Survey.
ACM Comput. Surv., 2023

PaRaDe: Passage Ranking using Demonstrations with Large Language Models.
CoRR, 2023

Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting.
CoRR, 2023

Gen-IR @ SIGIR 2023: The First Workshop on Generative Information Retrieval.
CoRR, 2023

[email protected] 2023: The First Workshop on Generative Information Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Surprise: Result List Truncation via Extreme Value Theory.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

UL2: Unifying Language Learning Paradigms.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Transcending Scaling Laws with 0.1% Extra Compute.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

How Does Generative Retrieval Scale to Millions of Passages?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DSI++: Updating Transformer Memory with New Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

PaRaDe: Passage Ranking using Demonstrations with LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Emergent Abilities of Large Language Models.
Trans. Mach. Learn. Res., 2022

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models.
CoRR, 2022

Retrieval Augmentation for T5 Re-ranker using External Sources.
CoRR, 2022

Unifying Language Learning Paradigms.
CoRR, 2022

Atomized Search Length: Beyond User Models.
CoRR, 2022

Retrieval-Enhanced Machine Learning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Transformer Memory as a Differentiable Search Index.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Confident Adaptive Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A New Generation of Perspective API: Efficient Multilingual Character-level Transformers.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Stochastic Retrieval-Conditioned Reranking.
Proceedings of the ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Madrid, Spain, July 11, 2022

HyperPrompt: Prompt-based Task-Conditioning of Transformers.
Proceedings of the International Conference on Machine Learning, 2022

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Scale Efficiently: Insights from Pretraining and Finetuning Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption.
Proceedings of the Tenth International Conference on Learning Representations, 2022

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Stretching Sentence-pair NLI Models to Reason over Long Documents and Clusters.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Rethinking search: making domain experts out of dilettantes.
SIGIR Forum, 2021

Search and Discovery in Personal Email Collections.
Found. Trends Inf. Retr., 2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers.
CoRR, 2021

The Benchmark Lottery.
CoRR, 2021

Are Pre-trained Convolutions Better than Pre-trained Transformers?
CoRR, 2021

Rethinking Search: Making Experts out of Dilettantes.
CoRR, 2021

Label Smoothed Embedding Hypothesis for Out-of-Distribution Detection.
CoRR, 2021

Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study.
Proceedings of the WSDM '21, 2021

Synthesizer: Rethinking Self-Attention for Transformer Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

OmniNet: Omnidirectional Representations from Transformers.
Proceedings of the 38th International Conference on Machine Learning, 2021

HyperGrid Transformers: Towards A Single Model for Multiple Tasks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Long Range Arena : A Benchmark for Efficient Transformers.
Proceedings of the 9th International Conference on Learning Representations, 2021

Are Pretrained Convolutions Better than Pretrained Transformers?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

How Reliable are Model Diagnostics?
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections.
CoRR, 2020

Matching Cross Network for Learning to Rank in Personal Search.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Stabilizing Neural Search Ranking Models.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Separate and Attend in Personal Email Search.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Parameter Tuning in Personal Search Systems.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Choppy: Cut Transformer for Ranked List Truncation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Multitask Mixture of Sequential Experts for User Activity Streams.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Attribute-based Propensity for Unbiased Learning in Recommender Systems: Algorithm and Case Studies.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Improving Recommendation Quality in Google Drive.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Sparse Sinkhorn Attention.
Proceedings of the 37th International Conference on Machine Learning, 2020

Reverse Engineering Configurations of Neural Text Generation Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Personalized Online Spell Correction for Personal Search.
Proceedings of the World Wide Web Conference, 2019

Domain Adaptation for Enterprise Email Search.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Multi-view Embedding-based Synonyms for Email Search.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Revisiting Online Personal Search Metrics with the User in Mind.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Combining Decision Trees and Neural Networks for Learning-to-Rank in Personal Search.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

2018
Position Bias Estimation for Unbiased Learning to Rank in Personal Search.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Learning with Sparse and Biased Feedback for Personal Search.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multi-Task Learning for Email Search Ranking with Auxiliary Query Clustering.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Learning from User Interactions in Personal Search via Attribute Parameterization.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

2016
Learning to Rank with Selection Bias in Personal Search.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

2014
Learning to efficiently rank on big data.
Proceedings of the 23rd International World Wide Web Conference, 2014

2012
Effective query formulation with multiple information sources.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Experimental methods for information retrieval.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Structured Event Retrieval over Microblog Archives.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Data integration from open internet sources to combat sex trafficking of minors.
Proceedings of the 13th Annual International Conference on Digital Government Research, 2012

FindAll: a local search engine for mobile phones.
Proceedings of the Conference on emerging Networking Experiments and Technologies, 2012

2011
A Feature-Centric View of Information Retrieval
The Information Retrieval Series 27, Springer, ISBN: 978-3-642-22898-8, 2011

USC/ISI at TREC 2011: Microblog Track.
Proceedings of The Twentieth Text REtrieval Conference, 2011

A cascade ranking model for efficient ranked retrieval.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Parameterized concept weighting in verbose queries.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Cross-corpus relevance projection.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Pseudo test collections for learning web search ranking functions.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Unsupervised Mining of Lexical Variants from Noisy Text.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011

When close enough is good enough: approximate positional indexes for efficient ranked retrieval.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

An Empirical Evaluation of Data-Driven Paraphrase Generation Techniques.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
The anatomy of an ad: structured indexing and retrieval for sponsored search.
Proceedings of the 19th International Conference on World Wide Web, 2010

Measuring the reusability of test collections.
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010

Learning concept importance using a weighted dependence model.
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010

UMD and USC/ISI: TREC 2010 Web Track Experiments with Ivory.
Proceedings of The Nineteenth Text REtrieval Conference, 2010

Learning to efficiently rank.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Relevance and ranking in online dating systems.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Ranking under temporal constraints.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Improved latent concept expansion using hierarchical markov random fields.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Exploiting site-level information to improve web search.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Online expansion of rare queries for sponsored search.
Proceedings of the 18th International Conference on World Wide Web, 2009

Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

Building enriched document representations using aggregated anchor text.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Improving search relevance for implicitly temporal queries.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Search Engine Adaptation by Feedback Control Adjustment for Time-sensitive Query.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Semi-parametric and Non-parametric Term Weighting for Information Retrieval.
Proceedings of the Advances in Information Retrieval Theory, 2009

Search Engines - Information Retrieval in Practice.
Pearson Education, ISBN: 978-0-13-136489-9, 2009

2008
Beyond bags of words: effectively modeling dependence and features in information retrieval.
SIGIR Forum, 2008

A Statistical View of Binned Retrieval Models.
Proceedings of the Advances in Information Retrieval , 2008

Generalized inverse document frequency.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

To swing or not to swing: learning when (not) to advertise.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
Linear feature-based models for information retrieval.
Inf. Retr., 2007

CIIR Experiments for TREC Legal 2007.
Proceedings of The Sixteenth Text REtrieval Conference, 2007

Latent concept expansion using markov random fields.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Using gradient descent to optimize language modeling smoothing parameters.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Pseudo-Aligned Multilingual Corpora.
Proceedings of the IJCAI 2007, 2007

Similarity Measures for Short Segments of Text.
Proceedings of the Advances in Information Retrieval, 2007

Automatic feature selection in the markov random field model for information retrieval.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
Indri TREC Notebook 2006: Lessons Learned From Three Terabyte Tracks.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Improving the estimation of relevance models using large external corpora.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Estimation, sensitivity, and generalization in parameterized retrieval models.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Beyond Bags of Words: Modeling Implicit User Preferences in Information Retrieval.
Proceedings of the Proceedings, 2006

2005
Analysis of Statistical Question Classification for Fact-Based Questions.
Inf. Retr., 2005

Indri at TREC 2005: Terabyte Track.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

UMass Robust 2005: Using Mixtures of Relevance Models for Query Expansion.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

A Markov random field model for term dependencies.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

The recap system for identifying information flow.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Similarity measures for tracking information flow.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2004
Combining the language model and inference network approaches to retrieval.
Inf. Process. Manag., 2004

Indri at TREC 2004: Terabyte Track.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

Formal multiple-bernoulli models for language modeling.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

An Inference Network Approach to Image Retrieval.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004


  Loading...