Jiajun Zhang

Orcid: 0000-0001-5293-7434

Affiliations:
  • Chinese Academy of Sciences (CAS), Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China (PhD 2011)


According to our database1, Jiajun Zhang authored at least 151 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Language Imbalance Driven Rewarding for Multilingual Self-improving.
CoRR, 2024

BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation.
CoRR, 2024

Bridging the Gap between Different Vocabularies for LLM Ensemble.
CoRR, 2024

DPPA: Pruning Method for Large Language Model to Model Merging.
CoRR, 2024

Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Improving Unsupervised Neural Machine Translation via Training Data Self-Correction.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Transformer: A General Framework from Machine Translation to Others.
Mach. Intell. Res., August, 2023

Contrastive Adversarial Training for Multi-Modal Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., June, 2023

Towards Unified Multi-Domain Machine Translation With Mixture of Domain Experts.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Topic-Oriented Dialogue Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Instance-Aware Prompt Learning for Language Understanding and Generation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2023

MoDS: Model-oriented Data Selection for Instruction Tuning.
CoRR, 2023

Align after Pre-train: Improving Multilingual Generative Models with Cross-lingual Alignment.
CoRR, 2023

ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model.
CoRR, 2023

BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing.
CoRR, 2023

BigTrans: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages.
CoRR, 2023

Language Cognition and Language Computation - Human and Machine Language Understanding.
CoRR, 2023

Adapter Tuning With Task-Aware Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2023

Unified Prompt Learning Makes Pre-Trained Language Models Better Few-Shot Learners.
Proceedings of the IEEE International Conference on Acoustics, 2023

Take a Closer Look at Multilinguality! Improve Multilingual Pre-Training Using Monolingual Corpora Only.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Parameter-efficient Tuning for Large Language Model without Calculating Its Gradients.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Synchronous Inference for Multilingual Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Attention Analysis and Calibration for Transformer in Natural Language Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Dual-View Conditional Variational Auto-Encoder for Emotional Dialogue Generation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

Enhancing Lexical Translation Consistency for Document-Level Neural Machine Translation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2022

Life-long Learning for Multilingual Neural Machine Translation with Knowledge Distillation.
CoRR, 2022

Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Norm-based Noisy Corpora Filtering and Refurbishing in Neural Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Cross-Modal Cloze Task: A New Task to Brain-to-Word Decoding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Learning Confidence for Transformer-based Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Other Roles Matter! Enhancing Role-Oriented Dialogue Summarization via Role Interactions.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Probing Word Syntactic Representations in the Brain by a Feature Elimination Method.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Neural Encoding and Decoding With Distributed Sentence Representations.
IEEE Trans. Neural Networks Learn. Syst., 2021

Graph-based Multimodal Ranking Models for Multimodal Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

Chinese Spelling Error Detection Using a Fusion Lattice LSTM.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
CoRR, 2021

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation.
CoRR, 2021

Augmenting Slot Values and Contexts for Spoken Language Understanding with Pretrained Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Exploiting Curriculum Learning in Unsupervised Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Entity-level Cross-modal Learning Improves Multi-modal Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Attention Calibration for Transformer in Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Synchronous Interactive Decoding for Multilingual Neural Machine Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Text Data Mining
Springer Singapore, ISBN: 978-981-16-0100-2, 2021

2020
Deep Neural Network-based Machine Translation System Combination.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Structurally Comparative Hinge Loss for Dependency-Based Neural Text Representation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Fine-grained neural decoding with distributed word representations.
Inf. Sci., 2020

Bridging the Modality Gap for Speech-to-Text Translation.
CoRR, 2020

Neural Machine Translation: Challenges, Progress and Future.
CoRR, 2020

Synchronous bidirectional inference for neural sequence generation.
Artif. Intell., 2020

Non-autoregressive Neural Machine Translation with Distortion Model.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

CASIA's System for IWSLT 2020 Open Domain Translation.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Touch Editing: A Flexible One-Time Interaction Approach for Translation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Knowledge Graphs Enhanced Neural Machine Translation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Knowledge Graph Enhanced Neural Machine Translation via Multi-task Learning on Sub-entity Granularity.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Distill and Replay for Continual Language Learning.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Multimodal Sentence Summarization via Multimodal Selective Encoding.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Multimodal Summarization with Guidance of Multimodal Reference.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Probing Brain Activation Patterns by Dissociating Semantics and Syntax in Sentences.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Keywords-Guided Abstractive Sentence Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video.
IEEE Trans. Knowl. Data Eng., 2019

Attention With Sparsity Regularization for Neural Machine Translation and Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Experience-based Causality Learning for Intelligent Agents.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

Input Method for Human Translators: A Novel Approach to Integrate Machine Translation Effectively and Imperceptibly.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

Synchronous Bidirectional Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2019

Chinese Spelling Error Detection Using a Fusion Lattice LSTM.
CoRR, 2019

Understanding Memory Modules on Learning Simple Algorithms.
CoRR, 2019

Select the Best Translation from Different Systems Without Reference.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

End-to-End Speech Translation with Knowledge Distillation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Sequence Generation: From Both Sides to the Middle.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

NCLS: Neural Cross-Lingual Summarization.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Synchronously Generating Two Languages with Interactive Decoding.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Are You for Real? Detecting Identity Fraud via Dialogue Interactions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Compact and Language-Sensitive Multilingual Translation Method.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Incremental Learning from Scratch for Task-Oriented Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Memory Consolidation for Contextual Spoken Language Understanding with Dialogue Logistic Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Addressing the Under-Translation Problem from the Entropy Perspective.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Towards Sentence-Level Brain Decoding with Distributed Representations.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Empirical Exploring Word-Character Relationship for Chinese Sentence Representation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2018

Language-Independent Representor for Neural Machine Translation.
CoRR, 2018

A Comparable Study on Model Averaging, Ensembling and Reranking in NMT.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Exploiting Pre-Ordering for Neural Machine Translation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

One Sentence One Model for Neural Machine Translation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Phrase Table as Recommendation Memory for Neural Machine Translation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multi-modal Sentence Summarization with Modality Attention and Image Filtering.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

MSMO: Multimodal Summarization with Multimodal Output.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Addressing Troublesome Words in Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Three Strategies to Improve One-to-Many Multilingual Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Teacher-Student Framework for Maintainable Dialog Manager.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Associative Multichannel Autoencoder for Multimodal Word Representation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Ensure the Correctness of the Summary: Incorporate Entailment Knowledge into Abstractive Sentence Summarization.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Source Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Review Headline Generation with User Embedding.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

Learning Multimodal Word Representation via Dynamic Fusion Methods.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Investigating Inner Properties of Multimodal Representation and Semantic Compositionality With Brain-Based Componential Semantics.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Implicit Discourse Relation Recognition for English and Chinese with Multiview Modeling and Effective Representation Learning.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2017

Augmenting Neural Sentence Summarization Through Extractive Summarization.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Look-Ahead Attention for Generation in Neural Machine Translation.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Shortcut Sequence Tagging.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

Towards Neural Machine Translation with Partially Aligned Corpora.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Learning Sentence Representation with Guidance of Human Attention.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Exploiting Word Internal Structures for Generic Chinese Sentence Representation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Word, Subword or Character? An Empirical Study of Granularity in Chinese-English NMT.
Proceedings of the Machine Translation - 13th China Workshop, 2017

Cost-Aware Learning Rate for Neural Machine Translation.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

Memory Augmented Attention Model for Chinese Implicit Discourse Relation Recognition.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

Multichannel LSTM-CRF for Named Entity Recognition in Chinese Social Media.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

Neural System Combination for Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Learning from Parenthetical Sentences for Term Translation in Machine Translation.
Proceedings of the 9th SIGHAN Workshop on Chinese Language Processing, 2017

A Dynamic Window Neural Network for CCG Supertagging.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Abstractive Cross-Language Summarization via Translation Model Enhanced Predicate Argument Structure Fusing.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Bridging Neural Machine Translation and Bilingual Dictionaries.
CoRR, 2016

Neural Name Translation Improves Neural Machine Translation.
CoRR, 2016

GuideRank: A Guided Ranking Graph Model for Multilingual Multi-document Summarization.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Predicting Implicit Discourse Relation with Multi-view Modeling and Effective Representation Learning.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Learning from User Feedback for Machine Translation in Real-Time.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

A Simple, Straightforward and Effective Model for Joint Bilingual Terms Detection and Word Alignment in SMT.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

A Bilingual Discourse Corpus and Its Applications.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Towards Zero Unknown Word in Neural Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Exploiting Source-side Monolingual Data in Neural Machine Translation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

MinKSR: A Novel MT Evaluation Metric for Coordinating Human Translators with the CAT-Oriented Input Method.
Proceedings of the Machine Translation - 12th China Workshop, 2016

An End-to-End Chinese Discourse Parser with Adaptation to Explicit and Non-explicit Relation Recognition.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning: Shared Task, 2016

An Empirical Exploration of Skip Connections for Sequential Tagging.
Proceedings of the COLING 2016, 2016

2015
Towards Machine Translation in Semantic Vector Space.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2015

Deep Neural Networks in Machine Translation: An Overview.
IEEE Intell. Syst., 2015

Beyond Word-based Language Model in Statistical Machine Translation.
CoRR, 2015

A New Input Method for Human Translators: Integrating Machine Translation Effectively and Imperceptibly.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Predicting Implicit Discourse Relations with Purely Distributed Representations.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2015

2014
Bilingually-constrained Phrase Embeddings for Machine Translation.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

RNN-based Derivation Structure Prediction for SMT.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Mind the Gap: Machine Translation by Minimizing the Semantic Gap in Embedding Space.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Syntax-Based Translation With Bilingually Lexicalized Synchronous Tree Substitution Grammars.
IEEE Trans. Speech Audio Process., 2013

Unsupervised Tree Induction for Tree-based Translation.
Trans. Assoc. Comput. Linguistics, 2013

A unified approach for effectively integrating source-side syntactic reordering rules into phrase-based translation.
Lang. Resour. Evaluation, 2013

A Substitution-Translation-Restoration Framework for Handling Unknown Words in Statistical Machine Translation.
J. Comput. Sci. Technol., 2013

Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Handling Ambiguities of Bilingual Predicate-Argument Structures for Statistical Machine Translation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Towards a Chinese Common and Common Sense Knowledge Base for Sentiment Analysis.
Proceedings of the Advanced Research in Applied Artificial Intelligence, 2012

Tree-based Translation without using Parse Trees.
Proceedings of the COLING 2012, 2012

Machine Translation by Modeling Predicate-Argument Structure Transformation.
Proceedings of the COLING 2012, 2012

2011
Simple but Effective Approaches to Improving Tree-to-tree Model.
Proceedings of Machine Translation Summit XIII: Papers, 2011

Augmenting String-to-Tree Translation Models with Fuzzy Use of Source-side Syntax.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2009
A Framework for Effectively Integrating Hard and Soft Syntactic Rules into Phrase Based Translation.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

The CASIA statistical machine translation system for IWSLT 2009.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

2008
The CASIA statistical machine translation system for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Sentence Type Based Reordering Model for Statistical Machine Translation.
Proceedings of the COLING 2008, 2008


  Loading...