Xiaoyan Zhu

Affiliations:
  • Tsinghua University, Institute for Artificial Intelligence, State Key Lab of Intelligent Technology and Systems, Beijing, China
  • Nagoya Institute of Technology, Japan (PhD 1990)


According to our database1, Xiaoyan Zhu authored at least 166 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Maximum Gaussianality training for deep speaker vector normalization.
Pattern Recognit., January, 2024

2023
EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training.
Mach. Intell. Res., April, 2023

Building Multi-domain Dialog State Trackers from Single-domain Dialogs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

KPT: Keyword-Guided Pre-training for Grounded Dialog Generation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Rethinking and Refining the Distinct Metric.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Continual Prompt Tuning for Dialog State Tracking.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Deep Normalization for Speaker Vectors.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training.
CoRR, 2021

CPM: A large-scale generative Chinese Pre-trained language model.
AI Open, 2021

CPM-2: Large-scale cost-effective pre-trained language models.
AI Open, 2021

EARL: Informative Knowledge-Grounded Conversation Generation with Entity-Agnostic Representation Learning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Semantic-based Method for Unsupervised Commonsense Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Challenges in Building Intelligent Open-domain Dialog Systems.
ACM Trans. Inf. Syst., 2020

Robust Reading Comprehension With Linguistic Constraints via Posterior Regularization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset.
Trans. Assoc. Comput. Linguistics, 2020

A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation.
Trans. Assoc. Comput. Linguistics, 2020

Knowledge-Aided Open-Domain Question Answering.
CoRR, 2020

Recent Advances and Challenges in Task-oriented Dialog System.
CoRR, 2020

CoTK: An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation.
CoRR, 2020

A Large-Scale Chinese Short-Text Conversation Dataset.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Learning Goal-oriented Dialogue Policy with opposite Agent Awareness.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Memory-Augmented Dialogue Management for Task-Oriented Dialogue Systems.
ACM Trans. Inf. Syst., 2019

Story Ending Selection by Finding Hints From Pairwise Candidate Endings.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

SentiLR: Linguistic Knowledge Enhanced Language Representation for Sentiment Analysis.
CoRR, 2019

Domain-Constrained Advertising Keyword Generation.
Proceedings of the World Wide Web Conference, 2019

Neural Multimodal Belief Tracker with Adaptive Attention for Dialogue Systems.
Proceedings of the World Wide Web Conference, 2019

Aspect-level Sentiment Analysis using AS-Capsules.
Proceedings of the World Wide Web Conference, 2019

Long and Diverse Text Generation with Planning-based Hierarchical Variational Model.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

ARAML: A Stable Adversarial Training Framework for Text Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Relation Mention Extraction from Noisy Data with Hierarchical Reinforcement Learning.
CoRR, 2018

Sentiment Analysis by Capsules.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Commonsense Knowledge Aware Conversation Generation with Graph Attention.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Assigning Personality/Profile to a Chatting Machine for Coherent Conversation Generation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

An Interpretable Reasoning Network for Multi-Relation Question Answering.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

An Operation Network for Abstractive Sentence Compression.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Generating Informative Responses with Controlled Sentence Function.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Reinforcement Learning for Relation Classification From Noisy Data.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Encoding Syntactic Knowledge in Neural Networks for Sentiment Classification.
ACM Trans. Inf. Syst., 2017

Assigning personality/identity to a chatting machine for coherent conversation generation.
CoRR, 2017

SAR: A Semantic Analysis Approach for Recommendation.
CoRR, 2017

Give me Something Unknown: Incorporate Exploration Preference in Cognition into Recommender System.
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017

Linguistically Regularized LSTM for Sentiment Classification.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

SSP: Semantic Space Projection for Knowledge Graph Embedding with Text Descriptions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Linguistically Regularized LSTMs for Sentiment Classification.
CoRR, 2016

Knowledge Semantic Representation: A Generative Model for Interpretable Knowledge Graph Embedding.
CoRR, 2016

SSP: Semantic Space Projection for Knowledge Graph Embedding with Text Descriptions.
CoRR, 2016

Knowledge Graph Embedding by Flexible Translation.
Proceedings of the Principles of Knowledge Representation and Reasoning: Proceedings of the Fifteenth International Conference, 2016

From One Point to a Manifold: Knowledge Graph Embedding for Precise Link Prediction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A Chinese text paraphrase detection method based on dependency tree.
Proceedings of the 13th IEEE International Conference on Networking, Sensing, and Control, 2016

Attention-based LSTM for Aspect-level Sentiment Classification.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Context-aware Natural Language Generation for Spoken Dialogue Systems.
Proceedings of the COLING 2016, 2016

Product Review Summarization by Exploiting Phrase Properties.
Proceedings of the COLING 2016, 2016

GAKE: Graph Aware Knowledge Embedding.
Proceedings of the COLING 2016, 2016

TransG : A Generative Model for Knowledge Graph Embedding.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Semi-Supervised Multinomial Naive Bayes for Text Classification by Leveraging Word-Level Statistical Constraint.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Preface.
J. Comput. Sci. Technol., 2015

Margin-Based Feed-Forward Neural Network Classifiers.
CoRR, 2015

Max-Entropy Feed-Forward Clustering Neural Network.
CoRR, 2015

From One Point to A Manifold: Orbit Models for Knowledge Graph Embedding.
CoRR, 2015

TransA: An Adaptive Approach for Knowledge Graph Embedding.
CoRR, 2015

TransG : A Generative Mixture Model for Knowledge Graph Embedding.
CoRR, 2015

A Question Answering System Built on Domain Knowledge Base.
Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Clustering Sentiment Phrases in Product Reviews by Constrained Co-clustering.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

Tackling Data Sparseness in Recommendation using Social Media based Topic Hierarchy Modeling.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Sentiment Extraction by Leveraging Aspect-Opinion Association Structure.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Optimizing the Bayesian Inference of Phylogeny on Graphic Processors.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

Learning Tag Embeddings and Tag-specific Composition Functions in Recursive Neural Network.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Estimating feature ratings through an effective review selection approach.
Knowl. Inf. Syst., 2014

Exploring the Interactions of Storylines from Informative News Events.
J. Comput. Sci. Technol., 2014

Sampling dilemma: towards effective data sampling for click prediction in sponsored search.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

A Chinese Question Answering System for Specific Domain.
Proceedings of the Web-Age Information Management - 15th International Conference, 2014

Contextual Combinatorial Bandit and its Application on Diversified Online Recommendation.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Clustering Aspect-related Phrases by Leveraging Sentiment Distribution Consistency.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Customized Organization of Social Media Contents using Focused Topic Hierarchy.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Ranking Sentiment Explanations for Review Summarization Using Dual Decomposition.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

New Word Detection for Sentiment Analysis.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Information distance between what I said and what it heard.
Commun. ACM, 2013

Topic hierarchy construction for the organization of multi-source user generated contents.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Promoting Diversity in Recommendation by Entropy Regularizer.
Proceedings of the IJCAI 2013, 2013

Functional dirichlet process.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Exploring weakly supervised latent sentiment explanations for aspect-level review analysis.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
A Unified Active Learning Framework for Biomedical Relation Extraction.
J. Comput. Sci. Technol., 2012

Sentiment Analysis with Multi-source Product Reviews.
Proceedings of the Intelligent Computing Technology - 8th International Conference, 2012

Finding nuggets in IP portfolios: core patent mining through textual temporal analysis.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Using First-Order Logic to Compress Sentences.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Guided Structure-Aware Review Summarization.
J. Comput. Sci. Technol., 2011

A New Multiword Expression Metric and Its Applications.
J. Comput. Sci. Technol., 2011

GeneTUKit: a software for document-level gene normalization.
Bioinform., 2011

Quality-biased Ranking of Short Texts in Microblogging Services.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Learning to Identify Review Spam.
Proceedings of the IJCAI 2011, 2011

Semantic Relationship Discovery with Wikipedia Structure.
Proceedings of the IJCAI 2011, 2011

Generating Breakpoint-based Timeline Overview for News Topic Retrospection.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Protein-protein interaction extraction from bio-literature with compact features and data sampling strategy.
Proceedings of the 4th International Conference on Biomedical Engineering and Informatics, 2011

2010
A New Approach for Multi-Document Update Summarization.
J. Comput. Sci. Technol., 2010

Discovering breast cancer drug candidates from biomedical literature.
Int. J. Data Min. Bioinform., 2010

The THU Summarization Systems at TAC 2010.
Proceedings of the Third Text Analysis Conference, 2010

Learning to Link Entities with Knowledge Base.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Function-Based Question Classification for General QA.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

A Review Selection Approach for Accurate Feature Rating Estimation.
Proceedings of the COLING 2010, 2010

Structure-Aware Review Mining and Summarization.
Proceedings of the COLING 2010, 2010

A Comparative Study on Ranking and Selection Strategies for Multi-Document Summarization.
Proceedings of the COLING 2010, 2010

Measuring the Non-compositionality of Multiword Expressions.
Proceedings of the COLING 2010, 2010

Recognizing Biomedical Named Entities Using Skip-Chain Conditional Random Fields.
Proceedings of the 2010 Workshop on Biomedical Natural Language Processing, 2010

Sentiment Analysis with Global Topics and Local Dependency.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Mining Reviews for Product Comparison and Recommendation.
Polibits, 2009

Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts.
PLoS Comput. Biol., 2009

Extract interaction detection methods from the biological literature.
BMC Bioinform., 2009

Specialized Review Selection for Feature Rating Estimation.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence, 2009

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance.
Proceedings of the Second Text Analysis Conference, 2009

THU QUANTA at TAC 2009 KBP and RTE Track.
Proceedings of the Second Text Analysis Conference, 2009

Multi-document Summarization by Information Distance.
Proceedings of the ICDM 2009, 2009

Towards Automatic Generation of Gene Summary.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

Answering Opinion Questions with Random Walks on Graphs.
Proceedings of the ACL 2009, 2009

A query-specific opinion summarization system.
Proceedings of the 8th IEEE International Conference on Cognitive Informatics, 2009

2008
New Information Distance Measure and Its Application in Question Answering System.
J. Comput. Sci. Technol., 2008

Exploiting and integrating rich features for biological literature classification.
BMC Bioinform., 2008

THU QUANTA at TAC 2008 QA and RTE Track.
Proceedings of the First Text Analysis Conference, 2008

Tsinghua University at the Summarization Track of TAC 2008.
Proceedings of the First Text Analysis Conference, 2008

Mining disease-specific molecular association profiles from biomedical literature: a case study.
Proceedings of the 2008 ACM Symposium on Applied Computing (SAC), 2008

OntoAlbum: An Ontology Based Digital Photo Management System.
Proceedings of the Image Analysis and Recognition, 5th International Conference, 2008

A Generative Probabilistic Model for Multi-label Classification.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Classifying What-Type Questions by Head Noun Tagging.
Proceedings of the COLING 2008, 2008

Information shared by many objects.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Using Conditional Random Fields to Extract Contexts and Answers of Questions from Online Forums.
Proceedings of the ACL 2008, 2008

2007
Combining Convolution Kernels Defined on Heterogeneous Sub-structures.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Information distance from a question to an answer.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

A Novel Kernel-Based Approach for Predicting Binding Peptides for HLA Class II Molecules.
Proceedings of the Bioinformatics Research and Applications, Third International Symposium, 2007

A New Type of Feature - Loose N-Gram Feature in Text Categorization.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

Semi-supervised Pattern Learning for Extracting Relations from Bioscience Texts.
Proceedings of 5th Asia-Pacific Bioinformatics Conference, 2007

2006
A hybrid method for relation extraction from biomedical literature.
Int. J. Medical Informatics, 2006

An Ontology-Based Mining System for Competitive Intelligence in Neuroscience.
Proceedings of the Web Intelligence Meets Brain Informatics, 2006

Automatic Extraction of Genomic Glossary Triggered by Query.
Proceedings of the Data Mining for Biomedical Applications, PAKDD 2006 Workshop, 2006

A Hybrid Handwritten Chinese Address Recognition Approach.
Proceedings of the Neural Information Processing, 13th International Conference, 2006

Movie review mining and summarization.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

ONBRIRES: Ontology-Based Biological Relation Extraction System.
Proceedings of 4th Asia-Pacific Bioinformatics Conference. 13-16 February 2006, 2006

2005
Programming Style Based Program Partition.
Int. J. Softw. Eng. Knowl. Eng., 2005

Discovering patterns to extract protein-protein interactions from the literature: Part II.
Bioinform., 2005

Learning Domain-Specific Knowledge from Context--THUIR at TREC 2005 Genomics Track.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

An OCR Post-processing Approach Based on Multi-knowledge.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

Various Features with Integrated Strategies for Protein Name Classification.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Extended Bi-gram Features in Text Categorization.
Proceedings of the Pattern Recognition and Image Analysis, Second Iberian Conference, 2005

A New Algorithm for Pattern Optimization in Protein-Protein Interaction Extraction System.
Proceedings of the Pattern Recognition and Image Analysis, Second Iberian Conference, 2005

An Utterance Verification Algorithm in Keyword Spotting System.
Proceedings of the Pattern Recognition and Image Analysis, Second Iberian Conference, 2005

2004
Discovering patterns to extract protein-protein interactions from full texts.
Bioinform., 2004

THUIR at TREC 2004: Genomics Track.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

A Chinese OCR spelling check approach based on statistical language models.
Proceedings of the IEEE International Conference on Systems, 2004

Discovering Patterns to Extract Protein-Protein Interactions from Full Biomedical Texts.
Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications, 2004

PathwayFinder: Paving the Way Towards Automatic Pathway Extraction.
Proceedings of the Second Asia-Pacific Bioinformatics Conference (APBC 2004), 2004

2001
An effective result-feedback neural algorithm for handwritten character recognition.
Neural Parallel Sci. Comput., 2001

A Chinese spoken dialog system for blind men.
Proceedings of the IEEE International Conference on Systems, 2001

2000
A novel text-independent speaker verification method based on the global speaker model.
IEEE Trans. Syst. Man Cybern. Part A, 2000

A criterion based on Fourier transform for segmentation of connected digits.
Int. J. Document Anal. Recognit., 2000


  Loading...