Xu Sun

Orcid: 0000-0001-8241-9320

Affiliations:
  • Peking University, School of EECS, MOE Key Lab of Computational Linguistics, China
  • University of Tokyo, Japan (PhD 2010)


According to our database1, Xu Sun authored at least 235 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Decentralized Decoupled Training for Federated Long-Tailed Learning.
Trans. Mach. Learn. Res., 2024

Temporal Reasoning Transfer from Text to Video.
CoRR, 2024

Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts.
CoRR, 2024

DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models.
CoRR, 2024

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation.
CoRR, 2024

Towards Multimodal Video Paragraph Captioning Models Robust to Missing Modality.
CoRR, 2024

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents.
CoRR, 2024

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Edit As You Wish: Video Caption Editing with Multi-grained User Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Defying Forgetting in Continual Relation Extraction via Batch Spectral Norm Regularization.
Proceedings of the International Joint Conference on Neural Networks, 2024

Towards Codable Watermarking for Injecting Multi-Bits Information to LLMs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Survey on In-context Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Enhancing Byzantine-Resistant Aggregations with Client Embedding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TempCompass: Do Video LLMs Really Understand Videos?
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning.
Trans. Mach. Learn. Res., 2023

A medical multimodal large language model for future pandemics.
npj Digit. Medicine, 2023

ASAT: Adaptively scaled adversarial training in time series.
Neurocomputing, 2023

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models.
CoRR, 2023

RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge.
CoRR, 2023

Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction.
CoRR, 2023

Towards Codable Text Watermarking for Large Language Models.
CoRR, 2023

M<sup>3</sup>IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.
CoRR, 2023

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning.
CoRR, 2023

A Survey for In-context Learning.
CoRR, 2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Fed-FA: Theoretically Modeling Client Data Divergence for Federated Language Backdoor Defense.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Can Language Models Understand Physical Concepts?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Delving into the Openness of CLIP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention.
ACM Trans. Knowl. Discov. Data, 2022

Alleviating the Knowledge-Language Inconsistency: A Study for Deep Commonsense Knowledge.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Aligning Source Visual and Target Language Domains for Unpaired Video Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Gradient Knowledge Distillation for Pre-trained Language Models.
CoRR, 2022

Prophet Attention: Predicting Attention with Future Attention for Improved Image Captioning.
CoRR, 2022

Rethinking the Openness of CLIP.
CoRR, 2022

Distributional Correlation-Aware Knowledge Distillation for Stock Trading Volume Prediction.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Stock Trading Volume Prediction with Dual-Process Meta-Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

GA-SAM: Gradient-Strength based Adaptive Sharpness-Aware Minimization for Improved Generalization.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Dim-Krum: Backdoor-Resistant Federated Learning for NLP with Dimension-wise Krum-Based Aggregation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Holistic Sentence Embeddings for Better Out-of-Distribution Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Position Offset Label Prediction for Grammatical Error Correction.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Hierarchical Inductive Transfer for Continual Dialogue Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Well-Classified Examples Are Underestimated in Classification with Deep Neural Networks.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Adversarial parameter defense by multi-step risk minimization.
Neural Networks, 2021

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
CoRR, 2021

Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models.
CoRR, 2021

Well-classified Examples are Underestimated in Classification with Deep Neural Networks.
CoRR, 2021

Long-term, Short-term and Sudden Event: Trading Volume Movement Prediction with Graph-based Multi-view Modeling.
CoRR, 2021

ASAT: Adaptively Scaled Adversarial Training in Time Series.
CoRR, 2021

Rethinking Skip Connection with Layer Normalization in Transformers and ResNets.
CoRR, 2021

Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Topology-Imbalance Learning for Semi-Supervised Node Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Network Surgery: Injecting Data Patterns into Pre-trained Models with Minimal Instance-wise Side Effects.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Global Past-Future Early Exit Method for Accelerating Inference of Pre-trained Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Long-term, Short-term and Sudden Event: Trading Volume Movement Prediction with Graph-based Multi-view Modeling.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

KNAS: Green Neural Architecture Search.
Proceedings of the 38th International Conference on Machine Learning, 2021

Leveraging Word-Formation Knowledge for Chinese Word Sense Disambiguation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Translation as Cross-Domain Knowledge: Attention Augmentation for Unsupervised Cross-Domain Segmenting and Labeling Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Rethinking Denoised Auto-Encoding in Language Pre-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Dynamic Knowledge Distillation for Pre-trained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Rethinking Stealthiness of Backdoor Attack against NLP Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning Relation Alignment for Calibrated Cross-modal Retrieval.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Contrastive Attention for Automatic Chest X-ray Report Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings?
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

EQG-RACE: Examination-Type Question Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Collaborative Group Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Exploring the Vulnerability of Deep Neural Networks: A Study of Parameter Corruption.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Training Simplification and Model Simplification for Deep Learning : A Minimal Effort Back Propagation Method.
IEEE Trans. Knowl. Data Eng., 2020

Memorized sparse backpropagation.
Neurocomputing, 2020

Accelerating Pre-trained Language Models via Calibrated Cascade.
CoRR, 2020

Learning Robust Representation for Clustering through Locality Preserving Variational Discriminative Network.
CoRR, 2020

Distance-wise Graph Contrastive Learning.
CoRR, 2020

CAPT: Contrastive Pre-Training for Learning Denoised Sequence Representations.
CoRR, 2020

Graph-based Multi-hop Reasoning for Long Text Generation.
CoRR, 2020

Layer-Wise Cross-View Decoding for Sequence-to-Sequence Learning.
CoRR, 2020

Query-Variant Advertisement Text Generation with Association Knowledge.
CoRR, 2020

Jointly Modeling Aspect and Sentiment with Dynamic Heterogeneous Graph Neural Networks.
CoRR, 2020

Prophet Attention: Predicting Attention with Future Attention.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Pretrain-KGE: Learning Knowledge Representation from Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Regularizing Dialogue Generation by Imitating Implicit Scenarios.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Rethinking Skip Connection with Layer Normalization.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Parallel Data Augmentation for Formality Style Transfer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

How to Ask Good Questions? Try to Leverage Paraphrases.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Visual Agreement Regularized Training for Multi-Modal Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Measuring and Relieving the Over-Smoothing Problem for Graph Neural Networks from the Topological View.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO).
Inf. Sci., 2019

Explicit Sparse Transformer: Concentrated Attention Through Explicit Selection.
CoRR, 2019

MUSE: Parallel Multi-Scale Attention for Sequence to Sequence Learning.
CoRR, 2019

Improving Node Classification by Co-training Node Pair Classification: A Novel Training Framework for General Graph Neural Networks.
CoRR, 2019

An Adaptive and Momental Bound Method for Stochastic Learning.
CoRR, 2019

Incorporating Fine-grained Events in Stock Movement Prediction.
CoRR, 2019

Group, Extract and Aggregate: Summarizing a Large Amount of Finance News for Forex Movement Prediction.
CoRR, 2019

Recursive Graphical Neural Networks for Text Classification.
CoRR, 2019

Sequence-to-sequence Pre-training with Data Augmentation for Sentence Rewriting.
CoRR, 2019

PKUSEG: A Toolkit for Multi-Domain Chinese Word Segmentation.
CoRR, 2019

Coherent Comment Generation for Chinese Articles with a Graph-to-Sequence Model.
CoRR, 2019

Memorized Sparse Backpropagation.
CoRR, 2019

Aligning Visual Regions and Textual Concepts: Learning Fine-Grained Image Representations for Image Captioning.
CoRR, 2019

Understanding and Improving Layer Normalization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Review-Driven Multi-Label Music Style Classification by Exploiting Style Correlations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Exploring and Distilling Cross-Modal Information for Image Captioning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Adaptive Gradient Methods with Dynamic Bound of Learning Rate.
Proceedings of the 7th International Conference on Learning Representations, 2019

Aligning Cross-Lingual Entities with Multi-Aspect Information.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Specificity-Driven Cascading Approach for Unsupervised Sentiment Modification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Asking Clarification Questions in Knowledge-Based Question Answering.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Pun-GAN: Generative Adversarial Network for Pun Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Cross-Modal Commentator: Automatic Machine Commenting Based on Cross-Modal Information.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Deep Reinforced Sequence-to-Set Model for Multi-Label Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Enhancing Topic-to-Essay Generation with External Commonsense Knowledge.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

MAAM: A Morphology-Aware Alignment Model for Unsupervised Bilingual Lexicon Induction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Imitation Learning for Non-Autoregressive Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Towards Fine-grained Text Sentiment Transfer.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning to Control the Fine-grained Sentiment for Story Ending Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Coherent Comments Generation for Chinese Articles with a Graph-to-Sequence Model.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Personalized End-to-End Goal-Oriented Dialog.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Cross-Domain and Semisupervised Named Entity Recognition in Chinese Social Media: A Unified Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Learning Unsupervised Word Mapping by Maximizing Mean Discrepancy.
CoRR, 2018

Unsupervised Machine Commenting with Neural Variational Topic Model.
CoRR, 2018

Evaluating Semantic Rationality of a Sentence: A Sememe-Word-Matching Neural Network based on HowNet.
CoRR, 2018

A Deep Reinforced Sequence-to-Set Model for Multi-Label Text Classification.
CoRR, 2018

Identifying High-Quality Chinese News Comments Based on Multi-Target Text Matching Model.
CoRR, 2018

Sememe Prediction: Learning Semantic Knowledge from Unstructured Textual Wiki Descriptions.
CoRR, 2018

Automatic Transferring between Ancient Chinese and Contemporary Chinese.
CoRR, 2018

Decoding-History-Based Adaptive Control of Attention for Neural Machine Translation.
CoRR, 2018

DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified Text.
CoRR, 2018

Exploration on Generating Traditional Chinese Medicine Prescription from Symptoms with an End-to-End method.
CoRR, 2018

Accelerating Graph-Based Dependency Parsing with Lock-Free Parallel Perceptron.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Structure Regularized Neural Network for Entity Relation Classification for Chinese Literature Text.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Chinese Dataset with Negative Full Forms for General Abbreviation Prediction.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Building an Ellipsis-aware Chinese Dependency Treebank for Web Text.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Hierarchical End-to-End Model for Jointly Improving Text Summarization and Sentiment Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Learning Sentiment Memories for Sentiment Modification without Parallel Data.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Diversity-Promoting GAN: A Cross-Entropy Based Generative Adversarial Network for Diversified Text Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Auto-Dialabel: Labeling Dialogue Data with Unsupervised Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Learning When to Concentrate or Divert Attention: Self-Adaptive Attention Temperature for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Does Higher Order LSTM Have Better Accuracy for Segmenting and Labeling Sequence Data?
Proceedings of the 27th International Conference on Computational Linguistics, 2018

SGM: Sequence Generation Model for Multi-label Classification.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Neural Question Answering Model Based on Semi-Structured Tables.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Deconvolution-Based Global Decoding for Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Automatic Academic Paper Rating Based on Modularized Hierarchical Convolutional Neural Network.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Question Condensing Networks for Answer Selection in Community Question Answering.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Bag-of-Words as Target for Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Autoencoder as Assistant Supervisor: Improving Text Representation for Chinese Social Media Text Summarization.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Global Encoding for Abstractive Summarization.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Unpaired Sentiment-to-Sentiment Translation: A Cycled Reinforcement Learning Approach.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Duplicate Question Identification by Integrating FrameNet With Neural Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Modeling Scientific Influence for Research Trending Topic Prediction.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Complex Structure Leads to Overfitting: A Structure Regularization Decoding Method for Natural Language Processing.
CoRR, 2017

Hybrid Oracle: Making Use of Ambiguity in Transition-based Chinese Dependency Parsing.
CoRR, 2017

Does Higher Order LSTM Have Better Accuracy in Chunking and Named Entity Recognition?
CoRR, 2017

A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text.
CoRR, 2017

Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method.
CoRR, 2017

Deep Stacking Networks for Low-Resource Chinese Word Segmentation with Transfer Learning.
CoRR, 2017

Label Embedding Network: Learning Label Representation for Soft Training of Deep Networks.
CoRR, 2017

A Semantic Relevance Based Neural Network for Text Summarization and Text Simplification.
CoRR, 2017

Minimal Effort Back Propagation for Convolutional Neural Networks.
CoRR, 2017

Transfer Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network.
CoRR, 2017

Lock-Free Parallel Perceptron for Graph-based Dependency Parsing.
CoRR, 2017

A Generic Online Parallel Learning Framework for Large Margin Models.
CoRR, 2017

Transfer Deep Learning for Low-Resource Chinese Word Segmentation with a Novel Neural Network.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Tag-Enhanced Tree-Structured Neural Networks for Implicit Discourse Relation Classification.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Cascading Multiway Attentions for Document-level Sentiment Classification.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Addressing Domain Adaptation for Chinese Word Segmentation with Global Recurrent Structure.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting.
Proceedings of the 34th International Conference on Machine Learning, 2017

F-Score Driven Max Margin Neural Network for Named Entity Recognition in Chinese Social Media.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Improving Semantic Relevance for Sequence-to-Sequence Learning of Chinese Social Media Text Summarization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

A Unified Model for Cross-Domain and Semi-Supervised Named Entity Recognition in Chinese Social Media.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
A New Recurrent Neural CRF for Learning Non-linear Edge Features.
CoRR, 2016

Asynchronous Parallel Learning for Neural Networks and Structured Models with Dense Features.
Proceedings of the COLING 2016, 2016

Dependency-based Gated Recursive Neural Network for Chinese Word Segmentation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Knowledge-Based Semantic Embedding for Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Towards Shockingly Easy Structured Classification: A Search-based Probabilistic Online Learning Framework.
CoRR, 2015

Multi-label Text Categorization with Joint Learning Predictions-as-Features Method.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
Structure Regularization for Structured Prediction: Theories and Experiments.
CoRR, 2014

Exact Decoding on Latent Variable Conditional Models is NP-Hard.
CoRR, 2014

Feature-Frequency-Adaptive On-line Training for Fast and Accurate Natural Language Processing.
Comput. Linguistics, 2014

Structure Regularization for Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Predicting Chinese Abbreviations with Minimum Semantic Unit and Global Constraints.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Coarse-grained Candidate Generation and Fine-grained Re-ranking for Chinese Abbreviation Prediction.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Latent Structured Perceptrons for Large-Scale Learning with Hidden Information.
IEEE Trans. Knowl. Data Eng., 2013

Large-Scale Personalized Human Activity Recognition Using Online Multitask Learning.
IEEE Trans. Knowl. Data Eng., 2013

Learning Abbreviations from Chinese and English Terms by Modeling Non-Local Information.
ACM Trans. Asian Lang. Inf. Process., 2013

Probabilistic Chinese word segmentation with non-local information and stochastic training.
Inf. Process. Manag., 2013

Generalized Abbreviation Prediction with Negative Full Forms and Its Application on Improving Chinese Web Search.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Exploring Representations from Unlabeled Data with Co-training for Chinese Word Segmentation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

A unified graph model for personalized query-oriented reference paper recommendation.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Query spelling correction using multi-task learning.
Proceedings of the 21st World Wide Web Conference, 2012

Fast multi-task learning for query spelling correction.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Fast Online Training with Frequency-Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Large Scale Real-Life Action Recognition Using Conditional Random Fields with Stochastic Training.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2011

A New Multi-task Learning Method for Personalized Activity Recognition.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

2010
Averaged Stochastic Gradient Descent with Feedback: An Accurate, Robust, and Fast Training Method.
Proceedings of the ICDM 2010, 2010

A Large Scale Ranker-Based System for Search Query Spelling Correction.
Proceedings of the COLING 2010, 2010

Learning Phrase-Based Spelling Error Models from Clickthrough Data.
Proceedings of the ACL 2010, 2010

2009
A Discriminative Latent Variable Chinese Segmenter with Hybrid Word/Character Information.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Latent Variable Perceptron Algorithm for Structured Classification.
Proceedings of the IJCAI 2009, 2009

Sequential Labeling with Latent Variables: An Exact Inference Algorithm and its Efficient Approximation.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Robust Approach to Abbreviating Terms: A Discriminative Latent Variable Model with Global Information.
Proceedings of the ACL 2009, 2009

2008
Predicting Chinese Abbreviations from Definitions: An Empirical Learning Approach Using Support Vector Regression.
J. Comput. Sci. Technol., 2008

Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Imrpoved Inference.
Proceedings of the COLING 2008, 2008

2007
Word Clustering for Collocation-Based Word Sense Disambiguation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2007

2006
Chinese Abbreviation-Definition Identification: A SVM Approach Using Context Information.
Proceedings of the PRICAI 2006: Trends in Artificial Intelligence, 2006

Chinese Abbreviation Identification Using Abbreviation-Template Features and Context Information.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006


  Loading...