Stephen E. Robertson

Orcid: 0000-0003-4115-6215

  • Microsoft Research

According to our database1, Stephen E. Robertson authored at least 165 papers between 1974 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



A Brief History of Search Results Ranking.
IEEE Ann. Hist. Comput., 2019

Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Average Precision at n.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Forward to the Past: Notes towards a Pre-history of Web Search.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Probabilistic models in IR and their relationships.
Inf. Retr., 2014

Effective and Robust Query-Based Stemming.
ACM Trans. Inf. Syst., 2013

Probabilistic group recommendation via information matching.
Proceedings of the 22nd International World Wide Web Conference, 2013

Modelling Score Distributions Without Actual Scores.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

On Using Fewer Topics in Information Retrieval Evaluations.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

A domain-independent approach to finding related entities.
Inf. Process. Manag., 2012

A Theory of Information Matching
CoRR, 2012

On real-time ad-hoc retrieval evaluation.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

On per-topic variance in IR evaluation.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

On Smoothing Average Precision.
Proceedings of the Advances in Information Retrieval, 2012

The Web, the Home and the Search Engine.
Proceedings of the Connected Home - The Future of Domestic Life., 2012

Introduction to special issue on the second international conference on the theory of information retrieval.
Inf. Retr., 2011

Modeling score distributions in information retrieval.
Inf. Retr., 2011

A Unified Relevance Retrieval Model by Eliteness Hypothesis
CoRR, 2011

On the Contributions of Topics to System Evaluation.
Proceedings of the Advances in Information Retrieval, 2011

On the choice of effectiveness measures for learning to rank.
Inf. Retr., 2010

Extending average precision to graded relevance judgments.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Simple Evaluation Metrics for Diversified Search Results.
Proceedings of the 3rd International Workshop on Evaluating Information Access, 2010

Expected browsing utility for web search evaluation.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Probabilistic Retrieval Models and Binary Independence Retrieval (BIR) Model.
Proceedings of the Encyclopedia of Database Systems, 2009

Average Precision at n.
Proceedings of the Encyclopedia of Database Systems, 2009

A few good topics: Experiments in topic set reduction for retrieval evaluation.
ACM Trans. Inf. Syst., 2009

The Probabilistic Relevance Framework: BM25 and Beyond.
Found. Trends Inf. Retr., 2009

Microsoft Research at TREC 2009: Web and Relevance Feedback Track.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

Incorporating User Behavior Information in IR Evaluation.
Proceedings of the Workshop on Understanding the User, 2009

Deep versus shallow judgments in learning to rank.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Where to stop reading a ranked list?: threshold optimization using truncated score distributions.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Score Distributions in Information Retrieval.
Proceedings of the Advances in Information Retrieval Theory, 2009

Rethinking the ESP game.
Proceedings of the 27th International Conference on Human Factors in Computing Systems, 2009

On the history of evaluation in IR.
J. Inf. Sci., 2008

Karen Spärck Jones.
J. Assoc. Inf. Sci. Technol., 2008

Probabilistic relevance ranking for collaborative filtering.
Inf. Retr., 2008

SoftRank: optimizing non-smooth rank metrics.
Proceedings of the International Conference on Web Search and Web Data Mining, 2008

Relevance Feedback Track Overview: TREC 2008.
Proceedings of The Seventeenth Text REtrieval Conference, 2008

A new rank correlation coefficient for information retrieval.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

A new interpretation of average precision.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Selecting good expansion terms for pseudo-relevance feedback.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Modelling A User Population for Designing Information Retrieval Metrics.
Proceedings of the 2nd International Workshop on Evaluating Information Access, 2008

The study of information retrieval: a long view.
Proceedings of the 2nd International Conference on Information Interaction in Context, 2008

Using Terms from Citations for IR: Some First Results.
Proceedings of the Advances in Information Retrieval , 2008

Comparing citation contexts for information retrieval.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Selecting Query Term Alternations for Web Search by Exploiting Query Contexts.
Proceedings of the ACL 2008, 2008

Ambiguous requests: implications for retrieval tests, systems and theories.
SIGIR Forum, 2007

In memoriam: Karen Spärck Jones.
J. Documentation, 2007

On rank-based effectiveness measures and optimization.
Inf. Retr., 2007

In Memoriam: Karen Spärck Jones.
Inf. Process. Manag., 2007

Parallel methods for the update of partitioned inverted files.
Aslib Proc., 2007

Hits hits TREC: exploring IR evaluation results with network analysis.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Expanding a Test Collection for Citation-based IR Experiments.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 2007, 8th International Conference, Carnegie Mellon University, Pittsburgh, PA, USA, May 30, 2007

INEX 2007 Evaluation Measures.
Proceedings of the Focused Access to XML Documents, 2007

On Score Distributions and Relevance.
Proceedings of the Advances in Information Retrieval, 2007

The Last Half-Century: A Perspective on Experimentation in Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2007

On document relevance and lexical cohesion between query terms.
Inf. Process. Manag., 2006

Window-based Enterprise Expert Search.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Creating a Test Collection for Citation-based IR Experiments.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

On the science of search: statistical approaches, evaluation, optimisation.
Proceedings of the International Workshop on Research Issues in Digital Libraries, 2006

CISR at INEX 2006.
Proceedings of the Comparative Evaluation of XML Information Retrieval Systems, 2006

INEX 2006 Evaluation Measures.
Proceedings of the Comparative Evaluation of XML Information Retrieval Systems, 2006

XML-Structured Documents: Retrievable Units and Inheritance.
Proceedings of the Flexible Query Answering Systems, 7th International Conference, 2006

Optimisation methods for ranking functions with multiple parameters.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

On GMAP: and other transformations.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

On Event Spaces and Probabilistic Models in Information Retrieval.
Inf. Retr., 2005

Parallel methods for the generation of partitioned inverted files.
Aslib Proc., 2005

Microsoft Cambridge at TREC 14: Enterprise Track.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Relevance weighting for query independent evidence.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Field-Weighted XML Retrieval Based on BM25.
Proceedings of the Advances in XML Information Retrieval and Evaluation, 2005

Understanding inverse document frequency: on theoretical arguments for IDF.
J. Documentation, 2004

A new unified probabilistic model.
J. Assoc. Inf. Sci. Technol., 2004

Parallel computing for passage retrieval.
Aslib Proc., 2004

Microsoft Cambridge at TREC 13: Web and Hard Tracks.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

Exploiting hyperlink recommendation evidence in navigational web search.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

Parsimonious language models for information retrieval.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

Simple BM25 extension to multiple weighted fields.
Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, 2004

Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002.
SIGIR Forum, 2003

Query Expansion with Long-Span Collocates.
Inf. Retr., 2003

Applying Machine Learning to Text Segmentation for Information Retrieval.
Inf. Retr., 2003

On Collection Size and Retrieval Effectiveness.
Inf. Retr., 2003

Microsoft Cambridge at TREC-12: HARD track.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Building a filtering test collection for TREC 2002.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Parallel Computing for Term Selection in Routing/Filtering.
Proceedings of the Advances in Information Retrieval, 2003

Challenges posed by web-based retrieval of scientific papers: Okapi participation in TIPS.
J. Inf. Sci., 2002

A tool for comparative evaluation in an interactive environment.
J. Inf. Sci., 2002

Comparing the Performance of Adaptive Filtering and Ranked Output Systems.
Inf. Retr., 2002

Threshold Setting and Performance Optimization in Adaptive Filtering.
Inf. Retr., 2002

Introduction to the Special Issue: Overview of the TREC Routing and Filtering Tasks.
Inf. Retr., 2002

Microsoft Cambridge at TREC 2002: Filtering Track.
Proceedings of The Eleventh Text REtrieval Conference, 2002

The TREC 2002 Filtering Track Report.
Proceedings of The Eleventh Text REtrieval Conference, 2002

Relative and absolute term selection criteria: a comparative study for English and Japanese IR.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Using self-supervised word segmentation in Chinese information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Deciphering cluster representations.
Inf. Process. Manag., 2001

Microsoft Cambridge at TREC-10: Filtering and Web Tracks.
Proceedings of The Tenth Text REtrieval Conference, 2001

The TREC 2001 Filtering Track Report.
Proceedings of The Tenth Text REtrieval Conference, 2001

TREC-10 Web Track Experiments at MSRA.
Proceedings of The Tenth Text REtrieval Conference, 2001

Flexible Pseudo-Relevance Feedback Using Optimization Tables.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Effective Site Finding Using Link Anchor Information.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Flexible Pseudo-Relevance Feedback for NTCIR-2.
Proceedings of the Third Second Workshop Meeting on Evaluation of Chinese & Japanese Text Retrieval and Text Summarization, 2001

Relevance Feedback for Best Match Term Weighting Algorithms in Information Retrieval.
Proceedings of the Second DELOS Network of Excellence Workshop on Personalisation and Recommender Systems in Digital Libraries, 2001

Salton Award Lecture: On theoretical argument in information retrieval.
SIGIR Forum, 2000

Threshold setting in adaptive filtering.
J. Documentation, 2000

Experimentation as a way of life: Okapi at TREC.
Inf. Process. Manag., 2000

A probabilistic model of information retrieval: development and comparative experiments - Part 2.
Inf. Process. Manag., 2000

A probabilistic model of information retrieval: development and comparative experiments - Part 1.
Inf. Process. Manag., 2000

Probability-Based Chinese Text Processing and Retrieval.
Comput. Intell., 2000

Microsoft Cambridge at TREC-9: Filtering Track.
Proceedings of The Ninth Text REtrieval Conference, 2000

The TREC-9 Filtering Track Final Report.
Proceedings of The Ninth Text REtrieval Conference, 2000

Parallel Search Using Partitioned Inverted Files.
Proceedings of the Seventh International Symposium on String Processing and Information Retrieval, 2000

Evaluation in Information Retrieval.
Proceedings of the Lectures on Information Retrieval, Third European Summer-School, 2000

The TREC-9 Filtering Track.
SIGIR Forum, 1999

Inf. Retr., 1999

Okapi/Keenbow at TREC-8.
Proceedings of The Eighth Text REtrieval Conference, 1999

PLIERS at TREC8 - Appendix.
Proceedings of The Eighth Text REtrieval Conference, 1999

Proceedings of The Eighth Text REtrieval Conference, 1999

The TREC-8 Filtering Track Final Report - Figures.
Proceedings of The Eighth Text REtrieval Conference, 1999

The TREC-8 Filtering Track Final Report.
Proceedings of The Eighth Text REtrieval Conference, 1999

PLIERS: A Parallel Information Retrieval System Using MPI.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1999

Process and Outcome: On the Evaluation of IR Systems in the Age of Interaction, GUIs and Multimedia.
Proceedings of Mira 99: Evaluating Interactive Information Retrieval, 1999

Clustering Information Retrieval Search Outputs.
Proceedings of the Challenge of Image Retrieval, 1999

Cyril W. Cleverdon (In Memoriam).
J. Am. Soc. Inf. Sci., 1998

Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive.
Proceedings of The Seventh Text REtrieval Conference, 1998

Proceedings of The Seventh Text REtrieval Conference, 1998

Evaluation of Interfaces for IRS: Modelling End-User Searching Behaviour.
Proceedings of the 20th Annual BCS-IRSG Colloquium on IR, 1998

Laboratory experiments with Okapi: participation in the TREC programme.
J. Documentation, 1997

Research and evaluation in information retrieval.
J. Documentation, 1997

Overview of the Okapi projects.
J. Documentation, 1997

Parallel computing in information retrieval - an updated review.
J. Documentation, 1997

Application of probabilistic methods to Chinese.
J. Documentation, 1997

Okapi at TREC-6 Automatic ad hoc, VLC, routing, filtering and QSDR.
Proceedings of The Sixth Text REtrieval Conference, 1997

Okapi Chinese Text Retrieval Experiments at TREC-6.
Proceedings of The Sixth Text REtrieval Conference, 1997

On Relevance Weights with Little Relevance Information.
Proceedings of the SIGIR '97: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1997

Gerard (Gerry) Salton.
J. Documentation, 1996

Evaluating Interactive Systems in TREC.
J. Am. Soc. Inf. Sci., 1996

Okapi at TREC-5.
Proceedings of The Fifth Text REtrieval Conference, 1996

Interactive Thesaurus Navigation: Intelligence Rules OK?
J. Am. Soc. Inf. Sci., 1995

Large Test Collection Experiments on an Operational, Interactive System: Okapi at TREC.
Inf. Process. Manag., 1995

Okapi at TREC-4.
Proceedings of The Fourth Text REtrieval Conference, 1995

Documentation note Query-Document Symmetry and Dual Models.
J. Documentation, 1994

Okapi at TREC-3.
Proceedings of The Third Text REtrieval Conference, 1994

Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval.
Proceedings of the 17th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval. Dublin, 1994

Okapi at TREC-2.
Proceedings of The Second Text REtrieval Conference, 1993

Workshop Report - Use of Training Materials in Constructing Routing Queries.
Proceedings of The Second Text REtrieval Conference, 1993

On the Evaluation of IR Systems.
Inf. Process. Manag., 1992

Okapi at TREC.
Proceedings of The First Text REtrieval Conference, 1992

Machine Learning and Relevance Feedback.
Proceedings of The First Text REtrieval Conference, 1992

Evaluation of online catalogues: Eliciting information from the user.
Inf. Process. Manag., 1991

B.C. Brookes and information science education: a personal note.
J. Inf. Sci., 1990

On Term Selection for Query Expansion.
J. Documentation, 1990

On sample sizes for non-matched-pair IR experiments.
Inf. Process. Manag., 1990

Weighting, ranking and relevance feedback in a front - end system.
J. Inf. Sci., 1986

On Relevance weight estimation and Query Expansion.
J. Documentation, 1986

Probabilistic Automatic Indexing by Learning from Human indexers.
J. Documentation, 1984

The Unified Probabilistic Model for IR.
Proceedings of the Research and Development in Information Retrieval, 1982

Term Frequency and Term Value.
Proceedings of the Theoretical Issues in Information Retrieval, Proceedings of the Fourth International Conference on Information Storage and Retrieval, Oakland, California, USA, May 31, 1981

Probabilistic Models of Indexing and Searching.
Proceedings of the Information Retrieval Research, 1980

On Fuzzy sets: Reply to Cerny.
J. Am. Soc. Inf. Sci., 1979

Ranking in Principle.
J. Documentation, 1978

On the nature of fuzz: A diatribe.
J. Am. Soc. Inf. Sci., 1978

The probabilistic character of relevance.
Inf. Process. Manag., 1977

Relevance weighting of search terms.
J. Am. Soc. Inf. Sci., 1976

Information science and the phenomenon of information.
J. Am. Soc. Inf. Sci., 1976

Explicit and implicit variables in information retrieval (IR) systems.
J. Am. Soc. Inf. Sci., 1975

In Defence of Relevance.
J. Am. Soc. Inf. Sci., 1974
