ChengXiang Zhai

Orcid: 0000-0002-6434-3702

Affiliations:
  • University of Illinois Urbana-Champaign, IL, USA


According to our database1, ChengXiang Zhai authored at least 435 papers between 1988 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2017, "For contributions to information retrieval and text data mining".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Understanding the social construction of juvenile delinquency: insights from semantic analysis of big-data historical newspaper collections.
J. Comput. Soc. Sci., October, 2024

User Simulation for Evaluating Information Access Systems.
Found. Trends Inf. Retr., 2024

Large language models for whole-learner support: opportunities and challenges.
Frontiers Artif. Intell., 2024

Large Language Models for Relevance Judgment in Product Search.
CoRR, 2024

Prejudice and Caprice: A Statistical Framework for Measuring Social Discrimination in Large Language Models.
CoRR, 2024

Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement.
CoRR, 2024

If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents.
CoRR, 2024

Tutorial on User Simulation for Evaluating Information Access Systems on the Web.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

CharmBana: Progressive Responses with Real-Time Internet Search for Knowledge-Powered Conversations.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Large Language Models and Future of Information Retrieval: Opportunities and Challenges.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

TextData: Save What You Know and Find What You Don't.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Scaling Collaborative Learning: Using the Community Digital Library to Enrich Course Content.
Proceedings of the 55th ACM Technical Symposium on Computer Science Education, 2024

UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Exploring AI-powered Multimodal Analogies for Science Education.
Proceedings of the Joint Proceedings of the Human-Centric eXplainable AI in Education and the Leveraging Large Language Models for Next Generation Educational Technologies Workshops (HEXED-L3MNGET 2024) co-located with 17th International Conference on Educational Data Mining (EDM 2024), 2024

AnaDE1.0: A Novel Data Set for Benchmarking Analogy Detection and Extraction.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
KEBLM: Knowledge-Enhanced Biomedical Language Models.
J. Biomed. Informatics, 2023

C-PMI: Conditional Pointwise Mutual Information for Turn-level Dialogue Evaluation.
CoRR, 2023

Noise-Robust Dense Retrieval via Contrastive Alignment Post Training.
CoRR, 2023

Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders.
CoRR, 2023

Dense Sparse Retrieval: Using Sparse Language Models for Inference Efficient Dense Retrieval.
CoRR, 2023

Competence-Based Analysis of Language Models.
CoRR, 2023

CAM: A Large Language Model-based Creative Analogy Mining Framework.
Proceedings of the ACM Web Conference 2023, 2023

OVERVIEW OF THE TREC 2023 PRODUCT PRODUCT SEARCH TRACK.
Proceedings of the Thirty-Second Text REtrieval Conference Proceedings (TREC 2023), 2023

To Asymmetry and Beyond: Structured Pruning of Sequence to Sequence Models for Improved Inference Efficiency.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

Quick Dense Retrievers Consume KALE: Post Training KullbackLeibler Alignment of Embeddings for Asymmetrical dual encoders.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

Rethinking Conversational Agents in the Era of LLMs: Proactivity, Non-collaborativity, and Beyond.
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023

Sparse Modular Activation for Efficient Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Retrieving Webpages Using Online Discussions.
Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, 2023

An Exploration of Large Language Models for Verification of News Headlines.
Proceedings of the IEEE International Conference on Data Mining, 2023

Exploring Large Language Models for Low-Resource IT Information Extraction.
Proceedings of the IEEE International Conference on Data Mining, 2023

Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Incorporating Task-Specific Concept Knowledge into Script Learning.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

The CDL: An Online Platform for Creating Community-based Digital Libraries.
Proceedings of the Computer Supported Cooperative Work and Social Computing, 2023

Tutorial on User Simulation for Evaluating Information Access Systems.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Augmenting nutritional metabolomics with a genome-scale metabolic model for assessment of diet intake.
Proceedings of the 14th ACM International Conference on Bioinformatics, 2023

Measuring the Effect of Influential Messages on Varying Personas.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

When to Use What: An In-Depth Comparative Empirical Analysis of OpenIE Systems for Downstream Applications.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Learning by Applying: A General Framework for Mathematical Reasoning via Enhancing Explicit Knowledge Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Structural and Textual Information Fusion for Symptom and Disease Representation Learning.
IEEE Trans. Knowl. Data Eng., 2022

AutoML to Date and Beyond: Challenges and Opportunities.
ACM Comput. Surv., 2022

Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT.
CoRR, 2022

Sparse*BERT: Sparse Models are Robust.
CoRR, 2022

Differential Query Semantic Analysis: Discovery of Explicit Interpretable Knowledge from E-Com Search Logs.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Toward a big data analysis system for historical newspaper collections research.
Proceedings of the PASC '22: Platform for Advanced Scientific Computing Conference, Basel, Switzerland, June 27, 2022

Drink Bleach or Do What Now? COVID-HeRA: A Study of Risk-Informed Health Decision Making in the Presence of COVID-19 Misinformation.
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

I3A: An Intelligent Interactive Information Agent Model for Information Retrieval.
Proceedings of the ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Madrid, Spain, July 11, 2022

PRE: A Precision-Recall-Effort Optimization Framework for Query Simulation.
Proceedings of the ICTIR '22: The 2022 ACM SIGIR International Conference on the Theory of Information Retrieval, Madrid, Spain, July 11, 2022

An Optimization Approach to Automatic Construction of Browsable Concept Index for Organizing Online Educational Content.
Proceedings of the IEEE International Conference on Knowledge Graph, 2022

Fine Grained Categorization of Drug Usage Tweets.
Proceedings of the Social Computing and Social Media: Design, User Experience and Impact, 2022

Language Model Pre-Training with Sparse Latent Typing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

RATE: A Reliability-Aware Tester-Based Evaluation Framework of User Simulators.
Proceedings of the Advances in Information Retrieval, 2022

CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Entity Set Co-Expansion in StackOverflow.
Proceedings of the IEEE International Conference on Big Data, 2022

Improving Candidate Retrieval with Entity Profile Generation for Wikidata Entity Linking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Domain Representative Keywords Selection: A Probabilistic Approach.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Interactive Information Retrieval: Models, Algorithms, and Evaluation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

DarkJargon.net: A Platform for Understanding Underground Conversation with Latent Meaning.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

An Exploration of Tester-based Evaluation of User Simulators for Comparing Interactive Retrieval Systems.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

AdaReNet: Adaptive Reweighted Semi-supervised Active Learning to Accelerate Label Acquisition.
Proceedings of the PETRA '21: The 14th PErvasive Technologies Related to Assistive Environments Conference, Virtual Event, Greece, 29 June, 2021

Scaling Up Data Science Course Projects: A Case Study.
Proceedings of the L@S'21: Eighth ACM Conference on Learning @ Scale, 2021

Neural-Answering Logical Queries on Knowledge Graphs.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Text2Mol: Cross-Modal Molecule Retrieval with Natural Language Queries.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Towards Dark Jargon Interpretation in Underground Forums.
Proceedings of the Advances in Information Retrieval, 2021

A Study of Distributed Representations for Figures of Research Articles.
Proceedings of the Advances in Information Retrieval, 2021

Team Skeletor at Touché 2021: Argument Retrieval and Visualization for Controversial Questions.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Fine-Grained Chemical Entity Typing with Multimodal Knowledge Representation.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

TriGORank: A Gene Ontology Enriched Learning-to-Rank Framework for Trigenic Fitness Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Axiomatic thinking for information retrieval: introduction to special issue.
Inf. Retr. J., 2020

A Level-wise Taxonomic Perspective on Automated Machine Learning to Date and Beyond: Challenges and Opportunities.
CoRR, 2020

Drink bleach or do what now? Covid-HeRA: A dataset for risk-informed health decision making in the presence of COVID19 misinformation.
CoRR, 2020

Towards a Soft Faceted Browsing Scheme for Information Access.
CoRR, 2020

A Study of Methods for the Generation of Domain-Aware Word Embeddings.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

FigExplorer: A System for Retrieval and Exploration of Figures from Collections of Research Articles.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Collective Development of Large Scale Data Science Products via Modularized Assignments: An Experience Report.
Proceedings of the 51st ACM Technical Symposium on Computer Science Education, 2020

CrowdQM: Learning Aspect-Level User Reliability and Comment Trustworthiness in Discussion Forums.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2020

Leveraging Book Indexes for Automatic Extraction of Concepts in MOOCs.
Proceedings of the L@S'20: Seventh ACM Conference on Learning @ Scale, 2020

Explanation Mining.
Proceedings of the L@S'20: Seventh ACM Conference on Learning @ Scale, 2020

Finding Contextually Consistent Information Units in Legal Text.
Proceedings of the Natural Legal Language Processing Workshop 2020 co-located with the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD 2020), 2020

Leveraging Personalized Sentiment Lexicons for Sentiment Analysis.
Proceedings of the ICTIR '20: The 2020 ACM SIGIR International Conference on the Theory of Information Retrieval, 2020

Multi-task Learning for Multilingual Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Empirical Analysis of Impact of Query-Specific Customization of nDCG: A Case-Study with Learning-to-Rank Methods.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Semantic Text Analysis for Detection of Compromised Accounts on Social Networks.
Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2020

Transductive Ensemble Learning for Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Predicting Opioid Overdose Crude Rates with Text-Based Twitter Features (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Cooperative Reasoning on Knowledge Graph and Corpus: A Multi-agentReinforcement Learning Approach.
CoRR, 2019

Improving N-gram Language Models with Pre-trained Deep Transformer.
CoRR, 2019

Learning to Order Sub-questions for Complex Question Answering.
CoRR, 2019

Quantifying and Visualizing the Demand and Supply Gap from E-commerce Search Data using Topic Models.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Help Me Search: Leveraging User-System Collaboration for Query Construction to Improve Accuracy for Difficult Queries.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Learning to Diversify for E-commerce Search with Multi-Armed Bandit.
Proceedings of the SIGIR 2019 Workshop on eCommerce, 2019

Neural Machine Translation with Soft Prototype.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

WOSView Demo: A Tool to Explore the Web of Slides.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019

Web of Slides: Automatic Linking of Lecture Slides to Facilitate Navigation.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019

Automatic Assessment of Complex Assignments using Topic Models.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019

LiveDataLab: A Cloud-Based Platform to Facilitate Hands-on Data Science Education at Scale.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019

Adapting Sequence to Sequence Models for Text Normalization in Social Media.
Proceedings of the Thirteenth International Conference on Web and Social Media, 2019

A Generative Model for Discovering Action-Based Roles and Community Role Compositions on Community Question Answering Platforms.
Proceedings of the Thirteenth International Conference on Web and Social Media, 2019

Multi-Agent Dual Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Figure Retrieval from Collections of Research Articles.
Proceedings of the Advances in Information Retrieval, 2019

TILM: Neural Language Models with Evolving Topical Influence.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Analysis of Adaptive Training for Learning to Rank in Information Retrieval.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Exploring Multi-Objective Exercise Recommendations in Online Education Systems.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Non-local Attention Learning on Large Heterogeneous Information Networks.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Non-Autoregressive Machine Translation with Auxiliary Regularization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Web Search Result De-duplication and Clustering.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Web Search Relevance Feedback.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

SOFSAT: Towards a Setlike Operator based Framework for Semantic Analysis of Text.
SIGKDD Explor., 2018

Identifying Compromised Accounts on Social Media Using Statistical Text Analysis.
CoRR, 2018

NRF: A Naive Re-identification Framework.
Proceedings of the 2018 Workshop on Privacy in the Electronic Society, 2018

A Large-Scale Empirical Study on Android Runtime-Permission Rationale Messages.
Proceedings of the 2018 IEEE Symposium on Visual Languages and Human-Centric Computing, 2018

LinkSO: a dataset for learning to retrieve similar question answer pairs on software development forums.
Proceedings of the 4th ACM SIGSOFT International Workshop on NLP for Software Engineering, 2018

A Tutorial on Probabilistic Topic Models for Text Data Retrieval and Analysis.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

A Taxonomy of Queries for E-commerce Search.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Towards Optimization of e-Commerce Search and Discovery.
Proceedings of the SIGIR 2018 Workshop On eCommerce co-located with the 41st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2018), 2018

Modeling Diverse Relevance Patterns in Ad-hoc Retrieval.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Are we on the Right Track?: An Examination of Information Retrieval Methodologies.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Mining Android App Descriptions for Permission Requirements Recommendation.
Proceedings of the 26th IEEE International Requirements Engineering Conference, 2018

VisAGE: Integrating external knowledge into electronic medical record visualization.
Proceedings of the Biocomputing 2018: Proceedings of the Pacific Symposium, 2018

Learning to Rank and Discover for E-Commerce Search.
Proceedings of the Machine Learning and Data Mining in Pattern Recognition, 2018

CLaDS: a cloud-based virtual lab for the delivery of scalable hands-on assignments for practical data science education.
Proceedings of the 23rd Annual ACM Conference on Innovation and Technology in Computer Science Education, 2018

Mining MOOC Lecture Transcripts to Construct Concept Dependency Graphs.
Proceedings of the 11th International Conference on Educational Data Mining, 2018

JIM: Joint Influence Modeling for Collective Search Behavior.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Multi-Attribute Topic Feature Construction for Social Media-based Prediction.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Exploiting Knowledge Graph to Improve Text-based Prediction.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval.
SIGIR Forum, 2017

Document Language Models, Query Models, and Risk Minimization for Information Retrieval.
SIGIR Forum, 2017

Report on the SIGIR 2017 Workshop on Axiomatic Thinking for Information Retrieval and Related Tasks (ATIR).
SIGIR Forum, 2017

Dynamic credit allocation in scientific literature.
Scientometrics, 2017

Modeling the Influence of Popular Trending Events on User Search Behavior.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Numerical Facet Range Partition: Evaluation Metric and Methods.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Constructing and Embedding Abstract Event Causality Networks from Text Snippets.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Probabilistic Topic Models for Text Data Retrieval and Analysis.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

On Application of Learning to Rank for E-Commerce Search.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Axiomatic Thinking for Information Retrieval: And Related Tasks.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Modeling MOOC Student Behavior With Two-Layer Hidden Markov Models.
Proceedings of the Fourth ACM Conference on Learning @ Scale, 2017

A Probabilistic Approach for Discovering Difficult Course Topics Using Clickstream Data.
Proceedings of the Fourth ACM Conference on Learning @ Scale, 2017

ContextCare: Incorporating Contextual Information Networks to Representation Learning on Medical Forum Data.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Information Retrieval Evaluation as Search Simulation: A General Formal Framework for IR Evaluation.
Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval, 2017

High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm.
Proceedings of the 34th International Conference on Machine Learning, 2017

Identifying Humor in Reviews using Background Text Sources.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Study of Feature Construction for Text-based Forecasting of Time Series Variables.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

TextScope: Enhance human perception via text mining.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Text-based geolocation prediction of social media users with neural networks.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Temporal reflected logistic regression for probabilistic heart failure survival score prediction.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

HEMnet: Integration of Electronic Medical Records with Molecular Interaction Networks and Domain Knowledge for Survival Analysis.
Proceedings of the 8th ACM International Conference on Bioinformatics, 2017

Framing Electronic Medical Records as Polylingual Documents in Query Expansion.
Proceedings of the AMIA 2017, 2017

Towards Privacy-Preserving Evaluation for Information Retrieval Models Over Industry Data Sets.
Proceedings of the Information Retrieval Technology, 2017

Dual-Clustering Maximum Entropy with Application to Classification and Word Embedding.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Non-native text analysis: A survey.
Nat. Lang. Eng., 2016

Personalized generation of word clouds from tweets.
J. Assoc. Inf. Sci. Technol., 2016

Towards a game-theoretic framework for text data retrieval.
IEEE Data Eng. Bull., 2016

Numerical Range Facets Partition: Evaluation Metric and Methods.
CoRR, 2016

DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.
Bioinform., 2016

A Sequential Decision Formulation of the Interface Card Model for Interactive IR.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Learning Query and Document Relevance from a Web-scale Click Graph.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

An Exploration of Automated Grading of Complex Assignments.
Proceedings of the Third ACM Conference on Learning @ Scale, 2016

Scaling up Online Question Answering via Similar Question Retrieval.
Proceedings of the Third ACM Conference on Learning @ Scale, 2016

Blind Men and The Elephant: Thurstonian Pairwise Preference for Ranking in Crowdsourcing.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Generative Feature Language Models for Mining Implicit Features from Customer Reviews.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Mobile App Retrieval for Social Media Users via Inference of Implicit Intent in Social Media Text.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Exploiting temporal divergence of topic distributions for event detection.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

A conditional probabilistic model for joint analysis of symptoms, diseases, and herbs in traditional Chinese medicine patient records.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016

PaReCat: Patient Record Subcategorization for Precision Traditional Chinese Medicine.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016

MeTA: A Unified Toolkit for Text Retrieval and Analysis.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

2015
Understanding User Intents in Online Health Forums.
IEEE J. Biomed. Health Informatics, 2015

Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval.
SIGIR Forum, 2015

Overcoming bias to learn about controversial topics.
J. Assoc. Inf. Sci. Technol., 2015

Negative query generation: bridging the gap between query likelihood retrieval models and relevance.
Inf. Retr. J., 2015

OpinoFetch: a practical and efficient approach to collecting opinions on arbitrary entities.
Inf. Retr. J., 2015

Exploiting ontology graph for predicting sparsely annotated gene function.
Bioinform., 2015

MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence.
Bioinform., 2015

Information Retrieval as Card Playing: A Formal Model for Optimizing Interactive Retrieval Interface.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Towards a Game-Theoretic Framework for Information Retrieval.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Leveraging User Reviews to Improve Accuracy for Mobile App Retrieval.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Retrieval of Relevant Opinion Sentences for New Products.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

SpecLDA: Modeling Product Reviews and Specifications to Generate Augmented Specifications.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Joint adaptive loss and l2/l0-norm minimization for unsupervised feature selection.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Beomap: Ad Hoc Topic Maps for Enhanced Exploration of Social Media Data.
Proceedings of the Engineering the Web in the Big Data Era - 15th International Conference, 2015

Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

Mining Coordinated Intent Representation for Entity Search and Recommendation.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

SyntacticDiff: Operator-based transformation for comparative text mining.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Hotspots of news articles: Joint mining of news text & social media to discover controversial points in news.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Recommending forum posts to designated experts.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
Exploiting rich user information for one-class collaborative filtering.
Knowl. Inf. Syst., 2014

Content-based citation analysis: The next generation of citation analysis.
J. Assoc. Inf. Sci. Technol., 2014

Bug characteristics in open source software.
Empir. Softw. Eng., 2014

User modeling in search logs via a nonparametric bayesian approach.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

A two-dimensional click model for query auto-completion.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

VIRLab: A Platform for Privacy-Preserving Evaluation for Information Retrieval Models.
Proceedings of the Proceeding of the 1st International Workshop on Privacy-Preserving IR: When Information Retrieval Meets Privacy and Security co-located with 37th Annual International ACM SIGIR conference, 2014

VIRLab: a web-based virtual lab for learning and studying information retrieval models.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

A Constrained Hidden Markov Model Approach for Non-Explicit Citation Context Extraction.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

The Fudan-UIUC Participation in the BioASQ Challenge Task 2a: The Antinomyra system.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

Mining Semi-Structured Online Knowledge Bases to Answer Natural Language Questions on Community QA Websites.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Unsupervised Feature Selection for Multi-View Clustering on Text-Image Web News Data.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Revisiting the Divergence Minimization Feedback Model.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Random walks on adjacency graphs for mining lexical relations from big text data.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

SideEffectPTM: an unsupervised topic model to mine adverse drug reactions from health forums.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

Resolving healthcare forum posts via similar thread retrieval.
Proceedings of the 5th ACM Conference on Bioinformatics, 2014

Text Classification.
Proceedings of the Data Classification: Algorithms and Applications, 2014

2013
MiTexCube: MicroTextCluster Cube for online analysis of text cells and its applications.
Stat. Anal. Data Min., 2013

Supporting Keyword Search in Product Database: A Probabilistic Approach.
Proc. VLDB Endow., 2013

Leveraging comparable corpora for cross-lingual information retrieval in resource-lean language pairs.
Inf. Retr., 2013

A learning approach to optimizing exploration-exploitation tradeoff in relevance feedback.
Inf. Retr., 2013

Content-aware click modeling.
Proceedings of the 22nd International World Wide Web Conference, 2013

Ranking explanatory sentences for opinion summarization.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Structural Parse Tree Features for Text Representation.
Proceedings of the 2013 IEEE Seventh International Conference on Semantic Computing, 2013

Understanding evolution of research themes: a probabilistic generative model for citations.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

EventCube: multi-dimensional search and mining of structured and text data.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Robust Unsupervised Feature Selection.
Proceedings of the IJCAI 2013, 2013

Axiomatic Analysis and Optimization of Information Retrieval Models.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Exploiting Forum Thread Structures to Improve Thread Clustering.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Information Retrieval with Time Series Query.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Statistical Translation Language Model for Twitter Search.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Content coverage maximization on word networks for hierarchical topic summarization.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Mining entity attribute synonyms via compact clustering.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Unsupervised identification of synonymous query intent templates for attribute intents.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Mining causal topics in text data: iterative topic modeling with time series feedback.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Compact explanatory opinion summarization.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

FindiLike: a preference driven entity search engine for evaluating entity retrieval and opinion summarization.
Proceedings of the 2013 workshop on Living labs for information retrieval evaluation, 2013

A probabilistic mixture model for mining and analyzing product search log.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Leveraging medical thesauri and physician feedback for improving medical literature retrieval for case queries.
J. Am. Medical Informatics Assoc., 2012

Opinion-based entity ranking.
Inf. Retr., 2012

Integer linear programming for Constrained Multi-Aspect Committee Review Assignment.
Inf. Process. Manag., 2012

CloudSpeller: query spelling correction by using a unified hidden markov model with web-scale resources.
Proceedings of the 21st World Wide Web Conference, 2012

Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions.
Proceedings of the 21st World Wide Web Conference 2012, 2012

FindiLike: preference driven entity search.
Proceedings of the 21st World Wide Web Conference, 2012

Tapping into knowledge base for concept feedback: leveraging conceptnet to improve search results for difficult queries.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

A generalized hidden Markov model with discriminative training for query spelling correction.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

SympGraph: a framework for mining clinical notes through symptom relation graphs.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Building enriched web page representations using link paths.
Proceedings of the 23rd ACM Conference on Hypertext and Social Media, 2012

A Discriminative Model for Query Spelling Correction with Latent Structural SVM.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Reliability Prediction of Webpages in the Medical Domain.
Proceedings of the Advances in Information Retrieval, 2012

A Log-Logistic Model-Based Interpretation of TF Normalization of BM25.
Proceedings of the Advances in Information Retrieval, 2012

Axiomatic Analysis of Translation Language Model for Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2012

Score Transformation in Linear Combination for Multi-criteria Relevance Ranking.
Proceedings of the Advances in Information Retrieval, 2012

BiasTrust: teaching biased users about controversial topics.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Mining long-lasting exploratory user interests from search history.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Query likelihood with negative query generation.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Unsupervised discovery of opposing opinion networks from forum discussions.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

InCaToMi: integrative causal topic miner between textual and non-textual time series data.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Click patterns: an empirical representation of complex query intents.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Unbiased learning of controversial topics.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

Enriching text representation with frequent pattern mining for probabilistic topic modeling.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

Predicting future popularity trend of events in microblogging platforms.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

A Survey of Text Classification Algorithms.
Proceedings of the Mining Text Data, 2012

A Survey of Text Clustering Algorithms.
Proceedings of the Mining Text Data, 2012

An Introduction to Text Mining.
Proceedings of the Mining Text Data, 2012

2011
OpinRank Review Dataset.
Dataset, July, 2011

Diagnostic Evaluation of Information Retrieval Models.
ACM Trans. Inf. Syst., 2011

Efficient Keyword-Based Search for Top-K Cells in Text Cube.
IEEE Trans. Knowl. Data Eng., 2011

BeeSpace Navigator: exploratory analysis of gene function using semantic indexing of biological literature.
Nucleic Acids Res., 2011

Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA.
Inf. Retr., 2011

Geographical topic discovery and comparison.
Proceedings of the 20th International Conference on World Wide Web, 2011

Automatic construction of a context-aware sentiment lexicon: an optimization approach.
Proceedings of the 20th International Conference on World Wide Web, 2011

Mining named entities with temporally correlated bursts from multilingual web news streams.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Beyond search: statistical topic models for text analysis.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Learning online discussion structures by conditional random fields.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

A boosting approach to improving pseudo-relevance feedback.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

When documents are very long, BM25 fails!
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Unsupervised query segmentation using clickthrough for information retrieval.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Latent aspect rating analysis without aspect keyword supervision.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Content-driven trust propagation framework.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Axiomatic Analysis and Optimization of Information Retrieval Models.
Proceedings of the Advances in Information Retrieval Theory, 2011

LPTA: A Probabilistic Model for Latent Periodic Topic Analysis.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Exploiting Thread Structures to Improve Smoothing of Language Models for Forum Post Retrieval.
Proceedings of the Advances in Information Retrieval, 2011

Adaptive term frequency normalization for BM25.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Lower-bounding term frequency normalization.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Interactive sense feedback for difficult queries.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Improving retrieval accuracy of difficult queries through generalizing negative document language models.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Automatic query reformulation with syntactic operators to alleviate search difficulty.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells.
Proceedings of the 2011 Conference on Intelligent Data Understanding, 2011

Structural Topic Model for Latent Topical Structure Analysis.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Web N-gram workshop 2010.
SIGIR Forum, 2010

BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects.
Nucleic Acids Res., 2010

Introduction to special issue on learning to rank for information retrieval.
Inf. Retr., 2010

Discovery of gene network variability across samples representing multiple classes.
Int. J. Bioinform. Res. Appl., 2010

Identifying overrepresented concepts in gene lists from literature: a statistical approach based on Poisson mixture model.
BMC Bioinform., 2010

Towards natural question guided search.
Proceedings of the 19th International Conference on World Wide Web, 2010

Positional relevance model for pseudo-relevance feedback.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Estimation of statistical translation models based on mutual information for ad hoc information retrieval.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Latent aspect rating analysis on review text data: a rating regression approach.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

TopCells: Keyword-based search of top-k aggregated documents in text cube.
Proceedings of the 26th International Conference on Data Engineering, 2010

Summarizing Contrastive Viewpoints in Opinionated Text.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Aggregation of Multiple Judgments for Evaluating Ordered Lists.
Proceedings of the Advances in Information Retrieval, 2010

Shallow Information Extraction from Medical Forum Data.
Proceedings of the COLING 2010, 2010

Exploiting Structured Ontology to Organize Scattered Online Opinions.
Proceedings of the COLING 2010, 2010

Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions.
Proceedings of the COLING 2010, 2010

Medical Case-based Retrieval by Leveraging Medical Ontology and Physician Feedback: UIUC-IBM at ImageCLEF 2010.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

PTM: probabilistic topic mapping model for mining parallel document collections.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Improving one-class collaborative filtering by incorporating rich user information.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Exploration-exploitation tradeoff in interactive relevance feedback.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Keyword Search in Text Cube: Finding Top-k Aggregated Cell Documents.
Proceedings of the 2010 Conference on Intelligent Data Understanding, 2010

Cross-Lingual Latent Topic Extraction.
Proceedings of the ACL 2010, 2010

2009
Web Search Result De-duplication and Clustering.
Proceedings of the Encyclopedia of Database Systems, 2009

Web Search Relevance Feedback.
Proceedings of the Encyclopedia of Database Systems, 2009

Learning to rank for information retrieval (LR4IR 2009).
SIGIR Forum, 2009

Topic modeling for OLAP on multidimensional text databases: topic cube and its applications.
Stat. Anal. Data Min., 2009

iNextCube: Information Network-Enhanced Text Cube.
Proc. VLDB Endow., 2009

An empirical study of gene synonym query expansion in biomedical information retrieval.
Inf. Retr., 2009

Inference of gene pathways using mixture Bayesian networks.
BMC Syst. Biol., 2009

Rated aspect summarization of short comments.
Proceedings of the 18th International Conference on World Wide Web, 2009

Adaptive Clustering of Search Results.
Proceedings of the User Modeling, 2009

Finding Related Entities by Retrieving Relations: UIUC at TREC 2009 Entity Track.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

A Study of Term Proximity and Document Weighting Normalization in Pseudo Relevance Feedback--UIUC at TREC 2009 Million Query Track.
Proceedings of The Eighteenth Text REtrieval Conference, 2009

Massive Implicit Feedback: Organizing Search Logs into Topic Maps for Collaborative Surfing.
Proceedings of the Workshop on Understanding the User, 2009

Positional language models for information retrieval.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases.
Proceedings of the SIAM International Conference on Data Mining, 2009


Parallel PathFinder Algorithms for Mining Structures from Graphs.
Proceedings of the ICDM 2009, 2009

Beyond hyperlinks: organizing information footprints in search logs to support effective browsing.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

A comparative study of methods for estimating query language models with pseudo feedback.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Adaptive relevance feedback in information retrieval.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Generating comparative summaries of contradictory opinions in text.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Constrained multi-aspect expertise matching for committee review assignment.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Evaluation of methods for relative comparison of retrieval systems based on clickthroughs.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Statistical Language Models for Information Retrieval
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02130-5, 2008

DirichletRank: Solving the zero-one gap problem of PageRank.
ACM Trans. Inf. Syst., 2008

Learning to rank for information retrieval (LR4IR 2008).
SIGIR Forum, 2008

Smoothing document language models with probabilistic term count propagation.
Inf. Retr., 2008

Statistical Language Models for Information Retrieval: A Critical Review.
Found. Trends Inf. Retr., 2008

Multi-label literature classification based on the Gene Ontology graph.
BMC Bioinform., 2008

Topic modeling with network regularization.
Proceedings of the 17th International Conference on World Wide Web, 2008

Opinion integration through semi-supervised topic modeling.
Proceedings of the 17th International Conference on World Wide Web, 2008

A Study of Adaptive Relevance Feedback - UIUC TREC 2008 Relevance Feedback Experiments.
Proceedings of The Seventeenth Text REtrieval Conference, 2008

Opinion Summarization Using Entity Features and Probabilistic Sentence Coherence Optimization: UIUC at TAC 2008 Opinion Summarization Pilot.
Proceedings of the First Text Analysis Conference, 2008

A study of methods for negative relevance feedback.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

A general optimization framework for smoothing language models on graph structures.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Mining multi-faceted overviews of arbitrary topics in a text collection.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Ranking Database Queries with User Feedback: A Neural Network Approach.
Proceedings of the Database Systems for Advanced Applications, 2008

Mining term association patterns from search logs for effective query reformulation.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Multi-aspect expertise matching for review assignment.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Modeling hidden topics on document manifold.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Generating Impact-Based Summaries for Scientific Literature.
Proceedings of the ACL 2008, 2008

2007
Semantic annotation of frequent patterns.
ACM Trans. Knowl. Discov. Data, 2007

Privacy protection in personalized search.
SIGIR Forum, 2007

Learning to rank for information retrieval (LR4IR 2007).
SIGIR Forum, 2007

Meeting of the MINDS: an information retrieval research agenda.
SIGIR Forum, 2007

An empirical study of tokenization strategies for biomedical information retrieval.
Inf. Retr., 2007

Generating gene summaries from biomedical literature: A study of semi-structured summarization.
Inf. Process. Manag., 2007

Topic sentiment mixture: modeling facets and opinions in weblogs.
Proceedings of the 16th International Conference on World Wide Web, 2007

Context-Aware Wrapping: Synchronized Data Extraction.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Language Models for Genomics Information Retrieval: UIUC at TREC 2007 Genomics Track.
Proceedings of The Sixteenth Text REtrieval Conference, 2007

Learn from web search logs to organize search results.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

An exploration of proximity measures in information retrieval.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Term feedback for information retrieval with language models.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

A study of Poisson query generation model for information retrieval.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Statistical Language Models for Information Retrieval.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

A Systematic Exploration of the Feature Space for Relation Extraction.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Mining correlated bursty topic patterns from coordinated text streams.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Automatic labeling of multinomial topic models.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Collaborative Wrapping: A Turbo Framework for Web Data Extraction.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Probabilistic Models for Expert Finding.
Proceedings of the Advances in Information Retrieval, 2007

Improve retrieval accuracy for difficult queries using negative feedback.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

A two-stage approach to domain adaptation for statistical classifiers.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

Inference of Gene Pathways Using Gaussian Mixture Models.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2007

Instance Weighting for Domain Adaptation in NLP.
Proceedings of the ACL 2007, 2007

2006
Extraction of coherent relevant passages using hidden Markov models.
ACM Trans. Inf. Syst., 2006

Research Paper: Enhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation.
J. Am. Medical Informatics Assoc., 2006

A study of mixture models for collaborative filtering.
Inf. Retr., 2006

A risk minimization framework for information retrieval.
Inf. Process. Manag., 2006

A probabilistic approach to spatiotemporal theme pattern mining on weblogs.
Proceedings of the 15th international conference on World Wide Web, 2006

Robust Pseudo Feedback Estimation and HMM Passage Extraction: UIUC at TREC 2006 Genomics Track.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Language Models for Expert Finding--UIUC TREC 2006 Enterprise Track Experiments.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Latent semantic analysis for multiple-type interrelated data objects.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Regularized estimation of mixture models for robust pseudo-relevance feedback.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Semantic term matching in axiomatic approaches to information retrieval.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Automatically Generating Gene Summaries from Biomedical Literature.
Proceedings of the Biocomputing 2006, 2006

Language Model Information Retrieval with Document Expansion.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Exploiting Domain Structure for Named Entity Recognition.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Mining long-term search history to improve search accuracy.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

A mixture model for contextual text mining.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Generating semantic annotations for frequent patterns with context analysis.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Unsupervised Named Entity Transliteration Using Temporal and Phonetic Correlation.
Proceedings of the EMNLP 2006, 2006

Best-k queries on database systems.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

A probabilistic relevance propagation model for hypertext retrieval.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Have things changed now?: an empirical study of bug characteristics in modern open source software.
Proceedings of the 1st Workshop on Architectural and System Support for Improving Software Dependability, 2006

Named Entity Transliteration with Comparable Corpora.
Proceedings of the ACL 2006, 2006

2005
UIUC/MUSC at TREC 2005 Genomics Track.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Interactive Construction of Query Language Models - UIUC TREC 2005 HARD Track Experiments.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

An Axiomatic Approach to IR--UIUC TREC 2005 Robust Track Experiments.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Active feedback in ad hoc information retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

UCAIR: a personalized search toolbar.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Context-sensitive information retrieval using implicit feedback.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

An exploration of axiomatic approaches to information retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Mining comparable bilingual text corpora for cross-language information integration.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Discovering evolutionary theme patterns from text: an exploration of temporal text mining.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Accurate language model estimation with document expansion.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Implicit user modeling for personalized search.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Accurately extracting coherent relevant passages using hidden Markov models.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2004
A study of smoothing methods for language models applied to information retrieval.
ACM Trans. Inf. Syst., 2004

Automatic annotation of protein motif function with Gene Ontology terms.
BMC Bioinform., 2004

UIUC in HARD 2004--Passage Retrieval Using HMMs.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

A two-stage mixture model for pseudo feedback.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A session-based search engine.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

ACES: a contextual engine for search.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A formal study of information retrieval heuristics.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A cross-collection mixture model for comparative text mining.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Audio segment retrieval using a short duration example query.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Subspace Clustering for Microarray Data Analysis: Multiple Criteria and Significance Assessment.
Proceedings of the 3rd International IEEE Computer Society Computational Systems Bioinformatics Conference, 2004

2003
Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002.
SIGIR Forum, 2003

Building Data Integration Systems: A Mass Collaboration Approach.
Proceedings of the International Workshop on Web and Databases, 2003

Preference-based Graphic Models for Collaborative Filtering.
Proceedings of the UAI '03, 2003

Improving the Robustness of Language Models - UIUC TREC 2003 Robust and Genomics Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Active Feedback - UIUC TREC-2003 HARD Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Relevance Propagation for Topic Distillation UIUC TREC 2003 Web Track Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Exploiting query history for document ranking in interactive information retrieval.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Error analysis of difficult TREC topics.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Information retrieval for OCR documents: a content-based probabilistic correction model.
Proceedings of the Document Recognition and Retrieval X, 2003

Text classification from positive and unlabeled documents.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

Collaborative filtering with decoupled models for preferences and ratings.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

2002
Database Research at the University of Illinois at Urbana-Champaign.
SIGMOD Rec., 2002

Risk minimization and language modeling in text retrieval dissertation abstract.
SIGIR Forum, 2002

Two-stage language models for information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Title language model for information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

2001
Model-based Feedback in the Language Modeling Approach to Information Retrieval.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Exploration of a heuristic approach to threshold learning in adaptive filtering.
Proceedings of the SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2000

1999
Optimization in CLARIT TREC-8 Adaptive Filtering.
Proceedings of The Eighth Text REtrieval Conference, 1999

CLARIT TREC-8 Manual Ad-Hoc Experiments.
Proceedings of The Eighth Text REtrieval Conference, 1999

1998
Threshold Calibration in CLARIT Adaptive Filtering.
Proceedings of The Seventh Text REtrieval Conference, 1998

1997
Exploiting Context to Identify Lexical Atoms - A Statistical View of Linguistic Context
CoRR, 1997

Fast Statistical Parsing of Noun Phrases for Document Indexing.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

1996
Evaluation of Syntactic Phrase Indexing -- CLARIT NLP Track Report.
Proceedings of The Fifth Text REtrieval Conference, 1996

OCR Correction and Query Expansion for Retrieval on OCR Data -- CLARIT TREC-5 Confusion Track Report.
Proceedings of The Fifth Text REtrieval Conference, 1996

Experiments on Chinese Text Indexing -- CLARIT TREC-5 Chinese Track Report.
Proceedings of The Fifth Text REtrieval Conference, 1996

CLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments.
Proceedings of The Fifth Text REtrieval Conference, 1996

Noun-Phrase Analysis in Unrestricted Text for Information Retrieval.
Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, 1996

1995
CLARIT TREC-4 Interactive Experiments.
Proceedings of The Fourth Text REtrieval Conference, 1995

1990
Preliminary ideas of a conceptual programming language.
ACM SIGPLAN Notices, 1990

1988
The kernel of Modula-2 integrated environment.
ACM SIGSOFT Softw. Eng. Notes, 1988


  Loading...