Chenyan Xiong

Orcid: 0000-0002-0392-4183

According to our database1, Chenyan Xiong authored at least 129 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RAGViz: Diagnose and Visualize Retrieval-Augmented Generation.
CoRR, 2024

Interpret and Control Dense Retrieval with Sparse Latent Features.
CoRR, 2024

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning.
CoRR, 2024

Harnessing Webpage UIs for Text-Rich Visual Understanding.
CoRR, 2024

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards.
CoRR, 2024

Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation.
CoRR, 2024

In-Context Probing Approximates Influence Function for Data Valuation.
CoRR, 2024

ResearchArena: Benchmarking LLMs' Ability to Collect and Organize Information as Research Agents.
CoRR, 2024

MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models.
CoRR, 2024

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression.
CoRR, 2024

Cleaner Pretraining Corpus Curation with Neural Web Scraping.
CoRR, 2024

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning.
CoRR, 2024

ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance.
CoRR, 2024


Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Fusion-in-T5: Unifying Variant Signals for Simple and Effective Document Ranking with Attention Fusion.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2023
Neural Approaches to Conversational Information Retrieval
The Information Retrieval Series 44, Springer, ISBN: 978-3-031-23079-0, 2023

Improving Multitask Retrieval by Promoting Task Specialization.
Trans. Assoc. Comput. Linguistics, 2023

An In-depth Look at Gemini's Language Abilities.
CoRR, 2023

Distributionally Robust Unsupervised Dense Retrieval Training on Web Graphs.
CoRR, 2023

Unlock Multi-Modal Capability of Dense Retrieval via Visual Module Plugin.
CoRR, 2023

Fusion-in-T5: Unifying Document Ranking Signals for Improved Information Retrieval.
CoRR, 2023

OpenMatch-v2: An All-in-one Multi-Modality PLM-based Information Retrieval Toolkit.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Unsupervised Dense Retrieval Training with Web Anchors.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CompleQA: Benchmarking the Impacts of Knowledge Graph Completion Methods on Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text Matching Improves Sequential Recommendation by Reducing Popularity Biases.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
ClueWeb22: 10 Billion Web Documents with Rich Information.
CoRR, 2022

COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning.
CoRR, 2022

Universal Multi-Modality Retrieval with One Unified Embedding Space.
CoRR, 2022

P<sup>3</sup> Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning.
CoRR, 2022

METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals.
CoRR, 2022

Neural Approaches to Conversational Information Retrieval.
CoRR, 2022

ClueWeb22: 10 Billion Web Documents with Rich Information.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators.
Proceedings of the Tenth International Conference on Learning Representations, 2022

COCO-DR: Combating the Distribution Shift in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence Annotation.
CoRR, 2021

Less is More: Pre-training a Strong Siamese Encoder Using a Weak Decoder.
CoRR, 2021

OpenMatch: An Open-Source Package for Information Retrieval.
CoRR, 2021

TREC CAsT 2021: The Conversational Assistance Track Overview.
Proceedings of the Thirtieth Text REtrieval Conference, 2021

Few-Shot Conversational Dense Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

OpenMatch: An Open Source Library for Neu-IR Research.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Capturing Global Informativeness in Open Domain Keyphrase Extraction.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

More Robust Dense Retrieval with Contrastive Dual Learning.
Proceedings of the ICTIR '21: The 2021 ACM SIGIR International Conference on the Theory of Information Retrieval, 2021

Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval.
Proceedings of the 9th International Conference on Learning Representations, 2021

Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Contrastive Multi-document Question Generation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Data Augmentation for Abstractive Query-Focused Multi-Document Summarization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Report on the first workshop on bias in automatic knowledge graph construction at AKBC 2020.
SIGIR Forum, 2020

Meta Adaptive Neural Ranking with Contrastive Synthetic Supervision.
CoRR, 2020

CMT in TREC-COVID Round 2: Mitigating the Generalization Gaps from Web to Special Domain Search.
CoRR, 2020

Proceedings of the KG-BIAS Workshop 2020 at AKBC 2020.
CoRR, 2020

Knowledge-Aware Language Model Pretraining.
CoRR, 2020

Joint Keyphrase Chunking and Salience Ranking with BERT.
CoRR, 2020

TREC CAsT 2019: The Conversational Assistance Track Overview.
CoRR, 2020

Complex Factoid Question Answering with a Free-Text Knowledge Graph.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Selective Weak Supervision for Neural Information Retrieval.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

TaxoExpan: Self-supervised Taxonomy Expansion with Position-Enhanced Graph Neural Network.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Leading Conversational Search by Suggesting Useful Questions.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

CAsT 2020: The Conversational Assistance Track Overview.
Proceedings of the Twenty-Ninth Text REtrieval Conference, 2020

Few-Shot Generative Conversational Query Rewriting.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Contextual Re-Ranking with Behavior Aware Transformers.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Knowledge Enhanced Personalized Search.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Recent Advances in Conversational Information Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

CAsT-19: A Dataset for Conversational Information Seeking.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Towards Interpretable Natural Language Understanding with Explanations as Latent Variables.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Transformer-XH: Multi-Evidence Reasoning with eXtra Hop Attention.
Proceedings of the 8th International Conference on Learning Representations, 2020

Text Classification Using Label Names Only: A Language Model Self-Training Approach.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Generalizing Open Domain Fact Extraction and Verification to COVID-FACT thorough In-Domain Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Long Document Ranking with Query-Directed Sparse Transformer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Fine-grained Fact Verification with Kernel Graph Attention Network.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Latent Relation Language Models.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Special issue on knowledge graphs and semantics in text analysis and retrieval.
Inf. Retr. J., 2019

Unsupervised Common Question Generation from Multiple Documents using Reinforced Contrastive Coordinator.
CoRR, 2019

Conversation Generation with Concept Flow.
CoRR, 2019

Kernel Graph Attention Network for Fact Verification.
CoRR, 2019

Understanding the Behaviors of BERT in Ranking.
CoRR, 2019

Generic Intent Representation in Web Search.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

An Axiomatic Approach to Regularizing Neural Ranking Models.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Neural Document Expansion with User Feedback.
Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, 2019

Open Domain Web Keyphrase Extraction Beyond Language Modeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Explore Entity Embedding Effectiveness in Entity Retrieval.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Target-Guided Open-Domain Conversation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Text Representation, Retrieval, and Understanding with Knowledge Graphs.
SIGIR Forum, 2018

Query Suggestion with Feedback Memory Network.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Convolutional Neural Networks for Soft-Matching N-Grams in Ad-hoc Search.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Improving Ad Hoc Retrieval With Bag Of Entities.
Proceedings of the Twenty-Seventh Text REtrieval Conference, 2018

Towards Better Text Understanding and Retrieval through Kernel Entity Salience Modeling.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Consistency and Variation in Kernel Neural Ranking Model.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

The Second Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR).
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Automatic Event Salience Identification.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Overview of The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR).
SIGIR Forum, 2017

Explicit Semantic Ranking for Academic Search via Knowledge Graph Embedding.
Proceedings of the 26th International Conference on World Wide Web, 2017

End-to-End Neural Ad-hoc Ranking with Kernel Pooling.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Word-Entity Duet Representations for Document Ranking.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Natural Language Supported Relation Matching for Question Answering with Knowledge Graphs.
Proceedings of the First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR 2017) co-located with the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017), 2017

DBpedia-Entity v2: A Test Collection for Entity Search.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

The First Workshop on Knowledge Graphs and Semantics for Text Retrieval and Analysis (KG4IR).
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

An Evaluation of the Kernel Based Neural Ranking Model in NTCIR-13 WWW.
Proceedings of the 13th NTCIR Conference, 2017

Overview of the NTCIR-13 We Want Web Task.
Proceedings of the 13th NTCIR Conference, 2017

JointSem: Combining Query Entity Linking and Entity based Document Ranking.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
An Empirical Study of Learning to Rank for Entity Search.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Bag-of-Entities Representation for Ranking.
Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval, 2016

Query-Biased Partitioning for Selective Search.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Query Expansion with Freebase.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

EsdRank: Connecting Query and Documents through External Semi-Structured Data.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
A language modeling approach to entity recognition and disambiguation for search queries.
Proceedings of the ERD'14, 2014

2013
Automatic Domain Partitioning for Multi-Domain Learning.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Relational click prediction for sponsored search.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012


  Loading...