Jie Zhou

Orcid: 0000-0002-5899-5165

Affiliations:
  • Tencent Inc., WeChat AI, Pattern Recognition Center, Beijing, China


According to our database1, Jie Zhou authored at least 313 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Triple Sequence Learning for Cross-domain Recommendation.
ACM Trans. Inf. Syst., July, 2024

Decentralized Decoupled Training for Federated Long-Tailed Learning.
Trans. Mach. Learn. Res., 2024

Complex Question Enhanced Transfer Learning for Zero-Shot Joint Information Extraction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Overview of the Tenth Dialog System Technology Challenge: DSTC10.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Exploring Universal Intrinsic Task Subspace for Few-Shot Learning via Prompt Tuning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Hyperbolic Pre-Trained Language Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory.
CoRR, 2024

DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models.
CoRR, 2024

AVG-LLaVA: A Large Multimodal Model with Adaptive Visual Granularity.
CoRR, 2024

MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation.
CoRR, 2024

A Survey on the Honesty of Large Language Models.
CoRR, 2024

POINTS: Improving Your Vision-language Model with Affordable Strategies.
CoRR, 2024

MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words.
CoRR, 2024

Patch-Level Training for Large Language Models.
CoRR, 2024

Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task.
CoRR, 2024

Multilingual Knowledge Editing with Language-Agnostic Factual Neurons.
CoRR, 2024

Think out Loud: Emotion Deducing Explanation in Dialogues.
CoRR, 2024

Touch100k: A Large-Scale Touch-Language-Vision Dataset for Touch-Centric Multimodal Representation.
CoRR, 2024

Outdated Issue Aware Decoding for Reasoning Questions on Edited Knowledge.
CoRR, 2024

Rethinking Overlooked Aspects in Vision-Language Models.
CoRR, 2024

Accelerating Inference in Large Language Models with a Unified Layer Skipping Strategy.
CoRR, 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
CoRR, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
CoRR, 2024

Graph Representation of Narrative Context: Coherence Dependency via Retrospective Questions.
CoRR, 2024

Identifying Factual Inconsistency in Summaries: Towards Effective Utilization of Large Language Model.
CoRR, 2024

Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents.
CoRR, 2024

Previously on the Stories: Recap Snippet Identification for Story Reading.
CoRR, 2024

Prompt-Driven LLM Safeguarding via Directed Representation Optimization.
CoRR, 2024

XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

On Large Language Models' Hallucination with Regard to Known Facts.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

On Prompt-Driven Safeguarding for Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Exploring the Benefit of Activation Sparsity in Pre-training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Generation with Strictly Proper Scoring Rules.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Large Language Models Are Not Robust Multiple Choice Selectors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards Codable Watermarking for Injecting Multi-Bits Information to LLMs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Note-Level Singing Transcription Model with Unlabeled and Weakly Labeled Data.
Proceedings of the IEEE International Conference on Acoustics, 2024

Identifying Factual Inconsistencies in Summaries: Grounding LLM Inference via Task Taxonomy.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

C-LLM: Learn to Check Chinese Spelling Errors Character by Character.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

On the token distance modeling ability of higher RoPE attention dimension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Enhancing Byzantine-Resistant Aggregations with Client Embedding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

UMTIT: Unifying Recognition, Translation, and Generation for Multimodal Text Image Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Plot Retrieval as an Assessment of Abstract Semantic Association.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Fine-Grained Modeling of Narrative Context: A Coherence Perspective via Retrospective Questions.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TasTe: Teaching Large Language Models to Translate through Self-Reflection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Outdated Issue Aware Decoding for Factual Knowledge Editing.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Trust in Internal or External Knowledge? Generative Multi-Modal Entity Linking with Knowledge Retriever.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Instruction Position Matters in Sequence Generation with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Continual Learning with Semi-supervised Contrastive Distillation for Incremental Neural Machine Translation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Teaching Large Language Models to Translate with Comparison.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Generative Multi-Modal Knowledge Retrieval with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Multi-modal graph contrastive encoding for neural machine translation.
Artif. Intell., October, 2023

A Multi-Task Multi-Stage Transitional Training Framework for Neural Chat Translation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Parallel Training of Pre-Trained Models via Chunk-Based Dynamic Memory Management.
IEEE Trans. Parallel Distributed Syst., 2023

When to Trust Aggregated Gradients: Addressing Negative Client Sampling in Federated Learning.
Trans. Mach. Learn. Res., 2023

Enabling Large Language Models to Learn from Rules.
CoRR, 2023

Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction.
CoRR, 2023

RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge.
CoRR, 2023

TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models.
CoRR, 2023

Improving Machine Translation with Large Language Models: A Preliminary Study with Cooperative Decoding.
CoRR, 2023

Universal Multi-modal Multi-domain Pre-trained Recommendation.
CoRR, 2023

Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language.
CoRR, 2023

Improving Translation Faithfulness of Large Language Models via Augmenting Instructions.
CoRR, 2023

Instruction Position Matters in Sequence Generation with Large Language Models.
CoRR, 2023

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents.
CoRR, 2023

An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning.
CoRR, 2023

Towards Multiple References Era - Addressing Data Leakage and Limited Reference Diversity in NLG Evaluation.
CoRR, 2023

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs.
CoRR, 2023

Towards Codable Text Watermarking for Large Language Models.
CoRR, 2023

TIM: Teaching Large Language Models to Translate with Comparison.
CoRR, 2023

D<sup>2</sup>TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization.
CoRR, 2023

Mitigating Catastrophic Forgetting in Task-Incremental Continual Learning with Adaptive Classification Criterion.
CoRR, 2023

WeLayout: WeChat Layout Analysis System for the ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents.
CoRR, 2023

Investigating Forgetting in Pre-Trained Representations Through Continual Learning.
CoRR, 2023

BranchNorm: Robustly Scaling Extremely Deep Transformers.
CoRR, 2023

Unified Model Learning for Various Neural Machine Translation.
CoRR, 2023

Is ChatGPT a Good NLG Evaluator? A Preliminary Study.
CoRR, 2023

Cross-Lingual Summarization via ChatGPT.
CoRR, 2023

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning.
CoRR, 2023

Attacking Pre-trained Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Exploring False Hard Negative Sample in Cross-Domain Recommendation.
Proceedings of the 17th ACM Conference on Recommender Systems, 2023

Interpretable User Retention Modeling in Recommendation.
Proceedings of the 17th ACM Conference on Recommender Systems, 2023

Fed-FA: Theoretically Modeling Client Data Divergence for Federated Language Backdoor Defense.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LingGe: An Automatic Ancient Chinese Poem-to-Song Generation System.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Humming2Music: Being A Composer As Long As You Can Humming.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

HyperNetwork-based Decoupling to Improve Model Generalization for Few-Shot Relation Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Re³Dial: Retrieve, Reorganize and Rescale Conversations for Long-Turn Open-Domain Dialogue Pre-training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Understanding Translationese in Cross-Lingual Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Argument Structure Extraction with Efficient Leverage of Contextual Information.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

D²TV: Dual Knowledge Distillation and Target-oriented Vision Modeling for Many-to-Many Multimodal Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Multi-Source Probing for Open-Domain Conversational Understanding.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Enhancing Dialogue Generation with Conversational Concept Flows.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Emergent Modularity in Pre-trained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Plug-and-Play Knowledge Injection for Pre-trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Diffusion Theory as a Scalpel: Detecting and Purifying Poisonous Dimensions in Pre-trained Language Models Caused by Backdoor or Bias.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Soft Language Clustering for Multilingual Model Pre-training.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Personality Understanding of Fictional Characters during Book Reading.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Consistency Regularization Training for Compositional Generalization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Rethinking the Word-level Quality Estimation for Machine Translation from Human Judgement.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Towards Unifying Multi-Lingual and Cross-Lingual Summarization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Recyclable Tuning for Continual Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

WebCPM: Interactive Web Search for Chinese Long-form Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Question-Interlocutor Scope Realized Graph Modeling over Key Utterances for Dialogue Reading Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Stochastic Bridges as Effective Regularizers for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Rephrasing the Reference for Non-autoregressive Machine Translation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
A Survey on Cross-Lingual Summarization.
Trans. Assoc. Comput. Linguistics, 2022

State scheduling method of redundant nodes in power communication network based on least square method.
Int. J. Auton. Adapt. Commun. Syst., 2022

MKGN: A Multi-Dimensional Knowledge Enhanced Graph Network for Multi-Hop Question and Answering.
IEICE Trans. Inf. Syst., 2022

AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning.
CoRR, 2022

CSCD-IME: Correcting Spelling Errors Generated by Pinyin IME.
CoRR, 2022

Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind.
CoRR, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Delta Tuning.
CoRR, 2022

WeLM: A Well-Read Pre-trained Language Model for Chinese.
CoRR, 2022

Rethink about the Word-level Quality Estimation for Machine Translation from Human Judgement.
CoRR, 2022

Manual-Guided Dialogue for Flexible Conversational Agents.
CoRR, 2022

Probing Causes of Hallucinations in Neural Machine Translations.
CoRR, 2022

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene.
CoRR, 2022

Towards Robust Online Dialogue Response Generation.
CoRR, 2022

Mental Health Assessment for the Chatbots.
CoRR, 2022

Emotional conversation generation with heterogeneous graph neural network.
Artif. Intell., 2022

Findings of the WMT 2022 Shared Task on Translation Suggestion.
Proceedings of the Seventh Conference on Machine Translation, 2022

BJTU-WeChat's Systems for the WMT22 Chat Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Summer: WeChat Neural Machine Translation Systems for the WMT22 Biomedical Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Transferability of Prompt Tuning for Natural Language Processing.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Knowledge Inheritance for Pre-trained Language Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Visual Dialog for Spotting the Differences between Pairs of Similar Images.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Structure-Enhanced Pop Music Generation via Harmony-Aware Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

High-Resolution and Arbitrary-Sized Chinese Landscape Painting Creation Based on Generative Adversarial Networks.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Neutral Utterances are Also Causes: Enhancing Conversational Causal Emotion Entailment with Social Commonsense Knowledge.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Rethinking the Promotion Brought by Contrastive Learning to Semi-Supervised Node Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Different Tunes Played with Equal Skill: Exploring a Unified Optimization Subspace for Parameter-Efficient Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

WeTS: A Benchmark for Translation Suggestion.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Digging Errors in NMT: Evaluating and Understanding Model Errors from Partial Hypothesis Space.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

AutoCAD: Automatically Generate Counterfactuals for Mitigating Shortcut Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible Knowledge Selection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

MAVEN-ERE: A Unified Large-scale Dataset for Event Coreference, Temporal, Causal, and Subevent Relation Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Exploring Mode Connectivity for Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ROSE: Robust Selective Fine-tuning for Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Robust k-Nearest-Neighbor Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

TSAM: A Two-Stream Attention Model for Causal Emotion Entailment.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Categorizing Semantic Representations for Neural Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

TAKE: Topic-shift Aware Knowledge sElection for Dialogue Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Selecting Stickers in Open-Domain Dialogue through Multitask Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

EAG: Extract and Generate Multi-way Aligned Corpus for Complete Multi-lingual Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

ELLE: Efficient Lifelong Pre-training for Emerging Data.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

A Variational Hierarchical Model for Neural Cross-Lingual Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MSCTD: A Multimodal Sentiment Chat Translation Dataset.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Scheduled Multi-task Learning for Neural Chat Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

BMInf: An Efficient Toolkit for Big Model Inference and Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Fully Hyperbolic Neural Networks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Dual Context-Guided Continuous Prompt Tuning for Few-Shot Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dual Context-Guided Continuous Prompt Tuning for Few-Shot Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
CSS-LM: A Contrastive Framework for Semi-Supervised Fine-Tuning of Pre-Trained Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

MS-Ranker: Accumulating evidence from potentially correct candidates via reinforcement learning for answer selection.
Neurocomputing, 2021

Simulated annealing for optimization of graphs and sequences.
Neurocomputing, 2021

A dependency syntactic knowledge augmented interactive architecture for end-to-end aspect-based sentiment analysis.
Neurocomputing, 2021

Research on protection of vulnerable nodes in power communication network based on the model of invalid proximity of edge.
Int. J. Auton. Adapt. Commun. Syst., 2021

Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models.
CoRR, 2021

On Transferability of Prompt Tuning for Natural Language Understanding.
CoRR, 2021

Exploring Low-dimensional Intrinsic Task Subspace via Prompt Tuning.
CoRR, 2021

MoEfication: Conditional Computation of Transformer Models for Efficient Inference.
CoRR, 2021

Constructing Emotion Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation.
CoRR, 2021

Challenging Instances are Worth Learning: Generating Valuable Negative Samples for Response Selection Training.
CoRR, 2021

Towards Expressive Communication with Internet Memes: A New Multimodal Conversation Dataset and Benchmark.
CoRR, 2021

PatrickStar: Parallel Training of Pre-trained Models via a Chunk-based Memory Management.
CoRR, 2021

Rethinking the Evaluation of Neural Machine Translation.
CoRR, 2021

WeChat AI's Submission for DSTC9 Interactive Dialogue Evaluation Track.
CoRR, 2021

Sequence-Level Training for Non-Autoregressive Neural Machine Translation.
Comput. Linguistics, 2021

CokeBERT: Contextual knowledge selection and embedding towards enhanced pre-trained language models.
AI Open, 2021

WeChat Neural Machine Translation Systems for WMT21.
Proceedings of the Sixth Conference on Machine Translation, 2021

TurboTransformers: an efficient GPU serving system for transformer models.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Topology-Imbalance Learning for Semi-Supervised Node Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Context Tracking Network: Graph-based Context Modeling for Implicit Discourse Relation Recognition.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Competence-based Curriculum Learning for Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Constructing Emotional Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Different Strokes for Different Folks: Investigating Appropriate Further Pre-training Approaches for Diverse Dialogue Tasks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Scheduled Sampling Based on Decoding Steps for Neural Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Towards Making the Most of Dialogue Characteristics for Neural Chat Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Dynamic Knowledge Distillation for Pre-trained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Improving Graph-based Sentence Ordering with Iteratively Predicted Pairwise Orderings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MOOCCubeX: A Large Knowledge-centered Repository for Adaptive Learning in MOOCs.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Modeling Explicit Concerning States for Reinforcement Learning in Visual Dialogue.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Rethinking Stealthiness of Backdoor Attack against NLP Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

CLEVE: Contrastive Pre-training for Event Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GTM: A Generative Triple-wise Model for Conversational Question Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving Gradient-based Adversarial Training for Text Classification by Contrastive Learning and Auto-Encoder.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Prevent the Language Model from being Overconfident in Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Confidence-Aware Scheduled Sampling for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Modeling Bilingual Conversational Characteristics for Neural Chat Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot Consistency.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Exploring Dynamic Selection of Branch Expansion Orders for Code Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Toward Fully Exploiting Heterogeneous Corpus: A Decoupled Named Entity Recognition Model with Two-stage Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Unsupervised Knowledge Selection for Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Selective Knowledge Distillation for Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Guiding Non-Autoregressive Neural Machine Translation Decoding with Reordering Information.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Faster Depth-Adaptive Transformers.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Infusing Multi-Source Knowledge with Heterogeneous Graph Neural Network for Emotional Conversation Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Aspect-Level Sentiment-Controllable Review Generation with Mutual Learning Framework.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Accelerating Pre-trained Language Models via Calibrated Cascade.
CoRR, 2020

Distance-wise Graph Contrastive Learning.
CoRR, 2020

MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection.
CoRR, 2020

Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models.
CoRR, 2020

Modeling Inter-Aspect Dependencies with a Non-temporal Mechanism for Aspect-Based Sentiment Analysis.
CoRR, 2020

Dual Past and Future for Neural Machine Translation.
CoRR, 2020

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence.
CoRR, 2020

Explicitly Modeling Adaptive Depths for Transformer.
CoRR, 2020

Towards Multimodal Response Generation with Exemplar Augmentation and Curriculum Optimization.
CoRR, 2020

Learning to Encode Evolutionary Knowledge for Automatic Commenting Long Novels.
CoRR, 2020

An Iterative Knowledge Transfer Network with Routing for Aspect-based Sentiment Analysis.
CoRR, 2020

Depth-Adaptive Graph Recurrent Network for Text Classification.
CoRR, 2020

Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog.
CoRR, 2020

WeChat Neural Machine Translation Systems for WMT20.
Proceedings of the Fifth Conference on Machine Translation, 2020

Dynamic Context-guided Capsule Network for Multimodal Machine Translation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

SenseMood: Depression Detection on Social Media.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Neural Gibbs Sampling for Joint Event Argument Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Multi-Unit Transformers for Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MAVEN: A Massive General Domain Event Detection Dataset.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MovieChats: Chat like Humans in a Closed Domain.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

A Sentiment-Controllable Topic-to-Essay Generator with Topic Knowledge Graph.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Disentangle-based Continual Graph Representation Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Token-level Adaptive Training for Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

One Comment from One Perspective: An Effective Strategy for Enhancing Automatic Music Comment.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Contrastive Zero-Shot Learning for Cross-Domain Slot Filling with Adversarial Attack.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Diversifying Dialogue Generation with Non-Conversational Text.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Contextual Hierarchical Attention Network with Adaptive Objective for Dialogue State Tracking.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Unsupervised Paraphrasing by Simulated Annealing.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Continual Relation Learning via Episodic Memory Activation and Reconsolidation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Enhancing Pointer Network for Sentence Ordering with Pairwise Ordering Predictions.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Zone Unit for Recurrent Neural Networks.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

DMRM: A Dual-Channel Multi-Hop Reasoning Model for Visual Dialog.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Measuring and Relieving the Over-Smoothing Problem for Graph Neural Networks from the Topological View.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Improving Node Classification by Co-training Node Pair Classification: A Novel Training Framework for General Graph Neural Networks.
CoRR, 2019

Semantic Graph Convolutional Network for Implicit Discourse Relation Classification.
CoRR, 2019

An end-to-end Neural Network Framework for Text Clustering.
CoRR, 2019

Option Comparison Network for Multiple-choice Reading Comprehension.
CoRR, 2019

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Enhancing Context Modeling with a Query-Guided Capsule Network for Document-level Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

HMEAE: Hierarchical Modular Event Argument Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

NumNet: Machine Reading Comprehension with Numerical Reasoning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

FewRel 2.0: Towards More Challenging Few-Shot Relation Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Metaphorical Analysis of Five Senses and Emotions in Mandarin Chinese.
Proceedings of the Chinese Lexical Semantics - 20th Workshop, 2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Improving Multi-turn Dialogue Modelling with Utterance ReWriter.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Towards Fine-grained Text Sentiment Transfer.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Rhetorically Controlled Encoder-Decoder for Modern Chinese Poetry Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Incremental Transformer with Deliberation Decoder for Document Grounded Conversations.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2017
ME-MD: An Effective Framework for Neural Machine Translation with Multiple Encoders and Decoders.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Incorporating Word Reordering Knowledge into Attention-based Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Deep Neural Machine Translation with Linear Associative Unit.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017


  Loading...