Xiaojun Wan

Orcid: 0000-0001-6887-1994

Affiliations:
  • Peking University, Institute of Computer Science and Technology, MOE Key Laboratory of Computational Linguistics, Beijing, China


According to our database1, Xiaojun Wan authored at least 313 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement.
CoRR, 2024

SMART-RAG: Selection using Determinantal Matrices for Augmented Retrieval.
CoRR, 2024

Themis: Towards Flexible and Interpretable NLG Evaluation.
CoRR, 2024

MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency.
CoRR, 2024

ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions.
CoRR, 2024

WaterPool: A Watermark Mitigating Trade-offs among Imperceptibility, Efficacy and Robustness.
CoRR, 2024

Automated Similarity Metric Generation for Recommendation.
CoRR, 2024

WikiTableEdit: A Benchmark for Table Editing by Natural Language Instruction.
CoRR, 2024

Evaluating and Mitigating Number Hallucinations in Large Vision-Language Models: A Consistency Perspective.
CoRR, 2024

DPP-Based Adversarial Prompt Searching for Lanugage Models.
CoRR, 2024

Entity-Aware Multimodal Alignment Framework for News Image Captioning.
CoRR, 2024

Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation.
CoRR, 2024

LLM-based NLG Evaluation: Current Status and Challenges.
CoRR, 2024

Describe Images in a Boring Way: Towards Cross-Modal Sarcasm Generation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

QAVidCap: Enhancing Video Captioning through Question Answering Techniques.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Selecting Large Language Model to Fine-tune via Rectified Scaling Law.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Cross Modal Training for ASR Error Correction with Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

PaCoST: Paired Confidence Significance Testing for Benchmark Contamination Detection in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Style-Compress: An LLM-Based Prompt Compression Framework Considering Task-Specific Styles.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Error-Robust Retrieval for Chinese Spelling Check.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Is Summary Useful or Not? An Extrinsic Human Evaluation of Text Summaries on Downstream Tasks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Contextual Modeling for Document-level ASR Error Correction.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Are LLM-based Evaluators Confusing NLG Quality Criteria?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

History Matters: Temporal Knowledge Editing in Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
RST Discourse Parsing as Text-to-Text Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Counterfactual Representation Augmentation for Cross-Domain Sentiment Analysis.
IEEE Trans. Affect. Comput., 2023

OpinSummEval: Revisiting Automated Evaluation for Opinion Summarization.
CoRR, 2023

A Comprehensive Evaluation of Constrained Text Generation for Large Language Models.
CoRR, 2023

Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency.
CoRR, 2023

Summarization is (Almost) Dead.
CoRR, 2023

A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check.
CoRR, 2023

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP.
CoRR, 2023

Human-like Summarization Evaluation with ChatGPT.
CoRR, 2023

Overview of the NLPCC 2023 Shared Task: Chinese Spelling Check.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Creative Destruction: Can Language Models Interpret Oxymorons?
Proceedings of the Natural Language Processing and Chinese Computing, 2023

CrossDial: An Entertaining Dialogue Dataset of Chinese Crosstalk.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

WikiIns: A High-Quality Dataset for Controlled Text Editing by Natural Language Instruction.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Visual Question Generation Under Multi-granularity Cross-Modal Interaction.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

ALCUNA: Large Language Models Meet New Knowledge.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A New Benchmark and Reverse Validation Method for Passage-level Hallucination Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

New Datasets and Controllable Iterative Data Augmentation Method for Code-switching ASR Error Correction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Models See Hallucinations: Evaluating the Factuality in Video Captioning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Context-Aware Evaluation Metrics for Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploring Discourse Structure in Document-level Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploiting Summarization Data to Help Text Simplification.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Exploring the Impact of Vision Features in News Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MIL-Decoding: Detoxifying Language Models at Token-Level via Multiple Instance Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A New Dataset and Empirical Study for Sentence Simplification in Chinese.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Evaluating Factuality in Cross-lingual Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
How to Describe Images in a More Funny Way? Towards a Modular Approach to Cross-Modal Sarcasm Generation.
CoRR, 2022

Chinese Spelling Check with Nearest Neighbors.
CoRR, 2022

Social Biases in Automatic Evaluation Metrics for NLG.
CoRR, 2022

An Empirical Study of Automatic Post-Editing.
CoRR, 2022

CC-Riddle: A Question Answering Dataset of Chinese Character Riddles.
CoRR, 2022

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words.
CoRR, 2022

A Simple Information-Based Approach to Unsupervised Domain-Adaptive Aspect-Based Sentiment Analysis.
CoRR, 2022

GraDual: Graph-based Dual-modal Representation for Image-Text Matching.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Relation-Constrained Decoding for Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Nearest Neighbor Knowledge Distillation for Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

DialSummEval: Revisiting Summarization Evaluation for Dialogues.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MOVER: Mask, Over-generate and Rank for Hyperbole Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

COSSUM: Towards Conversation-Oriented Structured Summarization for Automatic Medical Insurance Assessment.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Guiding Abstractive Dialogue Summarization with Content Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Diversifying Neural Text Generation with Part-of-Speech Guided Softmax and Sampling.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Visual Information Guided Zero-Shot Paraphrase Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

How Do Seq2Seq Models Perform on End-to-End Data-to-Text Generation?
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Dependency-based Mixture Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

PosterBot: A System for Generating Posters of Scientific Papers with Neural Models.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Automatic Slides Generation for Scholarly Papers: A Fine-Grained Dataset and Baselines (Student Abstract).
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

BiRdQA: A Bilingual Dataset for Question Answering on Tricky Riddles.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
CoRR, 2021

Neural Content Extraction for Poster Generation of Scientific Papers.
CoRR, 2021

A Syntax-Guided Grammatical Error Correction Model with Dependency Tree Correction.
CoRR, 2021

Neural Text Generation with Part-of-Speech Guided Softmax.
CoRR, 2021

Comparing Knowledge-Intensive and Data-Intensive Models for English Resource Semantic Parsing.
Comput. Linguistics, 2021

Learning a Product Relevance Model from Click-Through Data in E-Commerce.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Adverse Drug Events Detection, Extraction and Normalization from Online Comments of Chinese Patent Medicines.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

Continual Learning for Neural Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Document-Level Text Simplification: Dataset, Criteria and Baseline.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CodeQA: A Question Answering Dataset for Source Code Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Revisiting Pivot-Based Paraphrase Generation: Language Is Not the Only Optional Pivot.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ParaSCI: A Large Scientific Paraphrase Dataset for Longer Paraphrase Generation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Structure-Aware Pre-Training for Table-to-Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

TransSum: Translating Aspect and Sentiment Embeddings for Self-Supervised Opinion Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Video Paragraph Captioning as a Text Summarization Task.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

WIND: Weighting Instances Differentially for Model-Agnostic Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Bridging the Domain Gap: Improve Informal Language Translation via Counterfactual Domain Adaptation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Neural Sentence Simplification with Semantic Dependency Information.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
AMR-To-Text Generation with Graph Transformer.
Trans. Assoc. Comput. Linguistics, 2020

Constructing a Family Tree of Ten Indo-European Languages with Delexicalized Cross-linguistic Transfer Patterns.
CoRR, 2020

TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Domain-Adaptive Neural Automated Essay Scoring.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Better AMR-To-Text Generation with Graph Structure Reconstruction.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Routing Enforced Generative Model for Recipe Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Homophonic Pun Generation with Lexically Constrained Rewriting.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Adversarial Text Generation via Sequence Contrast Discrimination.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Abstractive Multi-Document Summarization via Joint Learning with Single-Document Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

DivGAN: Towards Diverse Paraphrase Generation via Diversified Generative Adversarial Network.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

IGSQL: Database Schema Interaction Graph Based Neural Model for Context-Dependent Text-to-SQL Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Improving Grammatical Error Correction with Data Augmentation by Editing Latent Representation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

On the Helpfulness of Document Context to Sentence Simplification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Semantic Parsing for English as a Second Language.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Heterogeneous Graph Transformer for Graph-to-Sequence Learning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Multimodal Transformer for Multimodal Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning to Ask More: Semi-Autoregressive Sequential Question Generation under Dual-Graph Interaction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Jointly Learning to Align and Summarize for Neural Cross-Lingual Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SemSUM: Semantic Dependency Guided Neural Abstractive Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

MultiSumm: Towards a Unified Model for Multi-Lingual Abstractive Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Cross-language document summarization via extraction and ranking of multiple summaries.
Knowl. Inf. Syst., 2019

A Neural Approach to Irony Generation.
CoRR, 2019

A Comparative Analysis of Knowledge-Intensive and Data-Intensive Semantic Parsers.
CoRR, 2019

A Semi-Supervised Approach for Low-Resourced Text Generation.
CoRR, 2019

Massive Styles Transfer with Limited Labeled Data.
CoRR, 2019

Parsing Chinese Sentences with Grammatical Relations.
Comput. Linguistics, 2019

Automatic generation of sentimental texts via mixture adversarial networks.
Artif. Intell., 2019

Harvesting Drug Effectiveness from Social Media.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

BAB-QA: A New Neural Model for Emotion Detection in Multi-party Dialogue.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2019

PKU Paraphrase Bank: A Sentence-Level Paraphrase Corpus for Chinese.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Learning Diachronic Word Embeddings with Iterative Stable Information Alignment.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

Controllable Unsupervised Text Attribute Transfer via Editing Entangled Latent Representation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

How to Avoid Sentences Spelling Boring? Towards a Neural Approach to Unsupervised Metaphor Generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

INS: An Interactive Chinese News Synthesis System.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Learning Bilingual Sentiment-Specific Word Embeddings without Cross-lingual Supervision.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

T-CVAE: Transformer-Based Conditioned Variational Autoencoder for Story Completion.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Multi-Domain Sentiment Classification Based on Domain-Aware Embedding and Attention.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Generating Diverse and Descriptive Image Captions Using Visual Paraphrases.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Towards a Unified End-to-End Approach for Fully Unsupervised Cross-Lingual Sentiment Analysis.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Interactive Multi-Grained Joint Model for Targeted Sentiment Analysis.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Neural Review Summarization Leveraging User and Product Information.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

DeepDial: Passage Completion on Dialogs.
Proceedings of the Web and Big Data - Third International Joint Conference, 2019

Neural Comment Generation for Source Code with Auxiliary Code Classification Task.
Proceedings of the 26th Asia-Pacific Software Engineering Conference, 2019

Automated Chess Commentator Powered by Neural Chess Engine.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Asking the Crowd: Question Analysis, Evaluation and Generation for Open Discussion on Online Forums.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Hierarchical Attention Networks for Sentence Ordering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
QuoteRec: Toward Quote Recommendation for Writing.
ACM Trans. Inf. Syst., 2018

Towards a Neural Network Approach to Abstractive Multi-Document Summarization.
CoRR, 2018

Automatic Text Simplification Horacio Saggion (Universitat Pompeu Fabra) Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst, volume 37), 2017, xvi+121 pp; paperback, ISBN 978-1-62705-868-1; hardcover, ISBN 978-1-68173-214-5, $69.95; ebook, ISBN 978-1-62705-869-8, $39.96; doi: 10.2200/S00700ED1V01Y201602HLT032.
Comput. Linguistics, 2018

Sentiment Analysis of Peer Review Texts for Scholarly Papers.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Overview of the NLPCC 2018 Shared Task: Grammatical Error Correction.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Overview of the NLPCC 2018 Shared Task: Single Document Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Improving Word Embeddings for Antonym Detection Using Thesauri and SentiWordNet.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Images2Poem: Generating Chinese Poetry from Image Streams.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Adapting Neural Single-Document Summarization Model for Abstractive Multi-Document Summarization: A Pilot Study.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

SentiGAN: Generating Sentimental Texts via Mixture Adversarial Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Learning to Explain Ambiguous Headlines of Online News.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Semantic Role Labeling for Learner Chinese: the Importance of Syntactic Parsing and L2-L1 Parallel Data.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Maximum Subgraph Parsing for Cross-Domain Semantic Dependency Analysis.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Point Precisely: Towards Ensuring the Precision of Data in Generated Texts Using Delayed Copy Mechanism.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Neural Approach to Pun Generation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Language Generation via DAG Transduction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Pre- and In-Parsing Models for Neural Empty Category Detection.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Accurate SHRG-Based Semantic Parsing.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Sense-Aware Neural Models for Pun Location in Texts.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Recent advances in document summarization.
Knowl. Inf. Syst., 2017

Towards Automatic Generation of Entertaining Dialogues in Chinese Crosstalks.
CoRR, 2017

Overview of the NLPCC 2017 Shared Task: Single Document Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Towards Automatic Generation of Product Reviews from Aspect-Sentiment Scores.
Proceedings of the 10th International Conference on Natural Language Generation, 2017

Content Selection for Real-time Sports News Construction from Commentary Texts.
Proceedings of the 10th International Conference on Natural Language Generation, 2017

Leveraging Diverse Lexical Chains to Construct Essays for Chinese College Entrance Examination.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Learning to Identify Ambiguous and Misleading News Headlines.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

From Neural Sentence Summarization to Headline Generation: A Coarse-to-Fine Approach.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Towards Automatic Construction of News Overview Articles by News Synthesis.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Towards a Universal Sentiment Classifier in Multiple languages.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Quasi-Second-Order Parsing for 1-Endpoint-Crossing, Pagenumber-2 Graphs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

The Covert Helps Parse the Overt.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Parsing for Grammatical Relations via Graph Merging.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Abstractive Document Summarization with a Graph-Based Attentional Neural Model.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Semantic Dependency Parsing via Book Embedding.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Parsing to 1-Endpoint-Crossing, Pagenumber-2 Graphs.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Greedy Flipping for Constrained Word Deletion.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Phrase-Based Presentation Slides Generation for Academic Papers.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

ATSUM: Extracting Attractive Summaries for News Propagation on Microblogs.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
CMiner: Opinion Extraction and Summarization for Chinese Microblogs.
IEEE Trans. Knowl. Data Eng., 2016

Transition-Based Parsing for Deep Dependency Structures.
Comput. Linguistics, 2016

Towards Accurate and Efficient Chinese Part-of-Speech Tagging.
Comput. Linguistics, 2016

Overview of the NLPCC-ICCPOL 2016 Shared Task: Sports News Generation from Live Webcast Scripts.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Overview of Baidu Cup 2016: Challenge on Entity Search.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Attention-based LSTM Network for Cross-Lingual Sentiment Classification.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

PKUSUMSUM : A Java Platform for Multilingual Document Summarization.
Proceedings of the COLING 2016, 2016

Graph-Based Multi-Modality Learning for Clinical Decision Support.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

A Neural Network Approach to Quote Recommendation in Writings.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Cross-Lingual Sentiment Classification with Bilingual Document Representation Learning.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Towards Constructing Sports News from Live Text Commentary.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

User Embedding for Scholarly Microblog Recommendation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Automatic Labeling of Topic Models Using Text Summaries.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

MicroScholar: Mining Scholarly Information from Chinese Microblogs.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Tweet Timeline Generation with Determinantal Point Processes.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Deep Dependency Substructure-Based Learning for Multidocument Summarization.
ACM Trans. Inf. Syst., 2015

PPSGen: Learning-Based Presentation Slides Generation for Academic Papers.
IEEE Trans. Knowl. Data Eng., 2015

CLOpinionMiner: Opinion Target Extraction in a Cross-Language Scenario.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Multi-Document Summarization via Discriminative Summary Reranking.
CoRR, 2015

Learning to Mine Chinese Coordinate Terms Using the Web.
CoRR, 2015

Mining and Analyzing the Future Works in Scientific Articles.
CoRR, 2015

Joint Matrix Factorization and Manifold-Ranking for Topic-Focused Multi-Document Summarization.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Peking: Building Semantic Dependency Graphs with a Hybrid Parser.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Overview of the NLPCC 2015 Shared Task: Weibo-Oriented Chinese News Summarization.
Proceedings of the Natural Language Processing and Chinese Computing - 4th CCF Conference, 2015

Compressive Document Summarization via Sparse Optimization.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Phrase-based Compressive Cross-Language Summarization.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

BrailleSUM: A News Summarization System for the Blind and Visually Impaired People.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

A Data-Driven, Factorization Parser for CCG Dependency Structures.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Representation Learning for Aspect Category Detection in Online Reviews.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Learning to Recommend Quotes for Writing.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
SRRank: leveraging semantic roles for extractive multi-document summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Comparative news summarization using concept-based optimization.
Knowl. Inf. Syst., 2014

WL-index: Leveraging citation mention number to quantify an individual's scientific impact.
J. Assoc. Inf. Sci. Technol., 2014

Are all literature citations equally important? Automatic citation strength estimation and its applications.
J. Assoc. Inf. Sci. Technol., 2014

x-index: a fantastic new indicator for quantifying a scientist's scientific impact.
CoRR, 2014

Learning Bilingual Embedding Model for Cross-Language Sentiment Classification.
Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), Warsaw, Poland, August 11-14, 2014, 2014

CTSUM: extracting more certain summaries for news articles.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Cross-language context-aware citation recommendation in scientific articles.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Peking: Profiling Syntactic Tree Parsing Techniques for Semantic Graph Parsing.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Joint Decoding of Tree Transduction Models for Sentence Compression.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Automatic Generation of Related Work Sections in Scientific Papers: An Optimization Approach.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Grammatical Relations in Chinese: GB-Ground Extraction and Data-Driven Parsing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Emotion Classification in Microblog Texts Using Class Sequential Rules.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Data-driven, PCFG-based and Pseudo-PCFG-based Models for Chinese Dependency Parsing.
Trans. Assoc. Comput. Linguistics, 2013

Subtopic-Based Multimodality Ranking for Topic-Focused Multidocument Summarization.
Comput. Intell., 2013

AKMiner: Domain-Specific Knowledge Graph Mining from Academic Literatures.
Proceedings of the Web Information Systems Engineering - WISE 2013, 2013

Capturing Long-distance Dependencies in Sequence Models: A Case Study of Chinese Part-of-speech Tagging.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

PPSGen: Learning to Generate Presentation Slides for Academic Papers.
Proceedings of the IJCAI 2013, 2013

Collective Opinion Target Extraction in Chinese Microblogs.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Co-Regression for Cross-Language Review Rating Prediction.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Learning to Order Natural Language Texts.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

AMRec: An Intelligent System for Academic Method Recommendation.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

2012
Exploiting syntactic and semantic relationships between terms for opinion retrieval.
J. Assoc. Inf. Sci. Technol., 2012

S2ORM: exploiting syntactic and semantic information for opinion retrieval.
Proceedings of the 21st World Wide Web Conference, 2012

BiCWS: Mining Cognitive Differences from Bilingual Web Search Results.
Proceedings of the Web Information Systems Engineering - WISE 2012, 2012

Learning to Find Comparable Entities on the Web.
Proceedings of the Web Information Systems Engineering - WISE 2012, 2012

A Comparative Study of Cross-Lingual Sentiment Classification.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

Hierarchical Graph Summarization: Leveraging Hybrid Information through Visible and Invisible Linkage.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2012

Cross-Language Opinion Target Extraction in Review Texts.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Update Summarization Based on Co-Ranking with Constraints.
Proceedings of the COLING 2012, 2012

Visualizing timelines: evolutionary summarization via iterative reinforcement between text and image streams.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Bilingual Co-Training for Sentiment Classification of Chinese Product Reviews.
Comput. Linguistics, 2011

PKUTM participation at TAC 2011 Summarization Track.
Proceedings of the Fourth Text Analysis Conference, 2011

Evolutionary timeline summarization: a balanced optimization framework via iterative substitution.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Summarizing the differences in multilingual news.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Collaborative Data Cleaning for Sentiment Classification with Noisy Training Corpus.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2011

Named Entity Recognition in Chinese News Comments on the Web.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Integrating feature ranking and clustering method to discover person relations in web news.
Proceedings of the Eighth International Conference on Fuzzy Systems and Knowledge Discovery, 2011

Timeline Generation through Evolutionary Trans-Temporal Summarization.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Using Bilingual Information for Cross-Language Document Summarization.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Comparative News Summarization Using Linear Programming.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction.
ACM Trans. Inf. Syst., 2010

A practical system for harvesting and monitoring hot topics on the web.
Proceedings of the 19th International Conference on World Wide Web, 2010

PKUTM at TREC 2010 Blog Track.
Proceedings of The Nineteenth Text REtrieval Conference, 2010

PKUTM Participation at TAC 2010 RTE and Summarization Track.
Proceedings of the Third Text Analysis Conference, 2010

EUSUM: extracting easy-to-understand english summaries for non-native readers.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

PKUTM Experiments in NTCIR-8 MOAT Task.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

Multi-document Summarization Using Minimum Distortion.
Proceedings of the ICDM 2010, 2010

Towards a Unified Approach to Simultaneous Single-Document and Multi-Document Summarizations.
Proceedings of the COLING 2010, 2010

Opinion Target Extraction in Chinese News Comments.
Proceedings of the COLING 2010, 2010

Named Entity Resolution in Chinese News Comments on the Web.
Proceedings of the Advances in Web Technologies and Applications, 2010

Cross-Language Document Summarization Based on Machine Translation Quality Prediction.
Proceedings of the ACL 2010, 2010

CRF-based Experiments for Cross-Domain Chinese Word Segmentation at CIPS-SIGHAN-2010.
Proceedings of the CIPS-SIGHAN Joint Conference on Chinese Language Processing, 2010

2009
Towards a Novel Association Measure via Web Search Results Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

Graph-Based Multi-Modality Learning for Topic-Focused Multi-Document Summarization.
Proceedings of the IJCAI 2009, 2009

Combining Content and Context Similarities for Image Retrieval.
Proceedings of the Advances in Information Retrieval, 2009

Topic analysis for topic-focused multi-document summarization.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Co-Training for Cross-Lingual Sentiment Classification.
Proceedings of the ACL 2009, 2009

2008
Beyond topical similarity: a structural similarity measure for retrieving highly similar documents.
Knowl. Inf. Syst., 2008

Using only cross-document relationships for both generic and topic-focused multi-document summarizations.
Inf. Retr., 2008

Towards a unified approach to document similarity search using manifold-ranking of blocks.
Inf. Process. Manag., 2008

Cm-pmi: improved web-based association measure with contextual label matching.
Proceedings of the 17th International Conference on World Wide Web, 2008

Multi-document summarization using cluster-based link analysis.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Document-Based HITS Model for Multi-document Summarization.
Proceedings of the PRICAI 2008: Trends in Artificial Intelligence, 2008

Learning to Identify Comparative Sentences in Chinese Text.
Proceedings of the PRICAI 2008: Trends in Artificial Intelligence, 2008

An Exploration of Document Impact on Graph-Based Multi-Document Summarization.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Using Bilingual Knowledge and Ensemble Techniques for Unsupervised Chinese Sentiment Analysis.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

CollabRank: Towards a Collaborative Approach to Single-Document Keyphrase Extraction.
Proceedings of the COLING 2008, 2008

Single Document Keyphrase Extraction Using Neighborhood Knowledge.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
A novel document similarity measure based on earth mover's distance.
Inf. Sci., 2007

Learning information diffusion process on the web.
Proceedings of the 16th International Conference on World Wide Web, 2007

CollabSum: exploiting multiple document clustering for collaborative single document summarizations.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

TimedTextRank: adding the temporal dimension to multi-document summarization.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

OMES: a new evaluation strategy using optimal matching for document clustering.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Manifold-Ranking Based Topic-Focused Multi-Document Summarization.
Proceedings of the IJCAI 2007, 2007

Content Based Image Retrieval Using Manifold-Ranking of Blocks.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Towards a Unified Approach Based on Affinity Graph to Various Multi-document Summarizations.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2007

Towards an Iterative Reinforcement Approach for Simultaneous Document Summarization and Keyword Extraction.
Proceedings of the ACL 2007, 2007

Single Document Summarization with Document Expansion.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Using proportional transportation similarity with learned element semantics for XML document clustering.
Proceedings of the 15th international conference on World Wide Web, 2006

Block-Based Similarity Search on the Web Using Manifold-Ranking.
Proceedings of the Web Information Systems, 2006

Using Cross-Document Random Walks for Topic-Focused Multi-Document.
Proceedings of the 2006 IEEE / WIC / ACM International Conference on Web Intelligence (WI 2006), 2006

Improved Affinity Graph Based Multi-Document Summarization.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

The Great Importance of Cross-Document Relationships for Multi-document Summarization.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2006

Using Proportional Transportation Distances for Measuring Document Similarity.
Proceedings of the Advances in Information Retrieval, 2006

WordRank-Based Lexical Signatures for Finding Lost or Related Web Pages.
Proceedings of the Frontiers of WWW Research and Development, 2006

Document Similarity Search Based on Manifold-Ranking of TextTiles.
Proceedings of the Information Retrieval Technology, 2006

2005
A New Retrieval Model Based on TextTiling for Document Similarity Search.
J. Comput. Sci. Technol., 2005

Extract Salient Words with WordRank for Effective Similarity Search in Text Data.
Proceedings of the Web Information Systems Engineering, 2005

A New Re-ranking Method for Generic Chinese Text Summarization and Its Evaluation.
Proceedings of the Digital Libraries: Implementing Strategies and Sharing Experiences, 2005

The earth mover's distance as a semantic measure for document similarity.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Person resolution in person search results: WebHawk.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Document Similarity Search Based on Generic Summaries.
Proceedings of the Information Retrieval Technology, 2005

2004
A Measure Based on Optimal Matching in Graph Theory for Document Similarity.
Proceedings of the Information Retrieval Technology, Asia Information Retrieval Symposium, 2004


  Loading...