Qun Liu

Orcid: 0000-0002-7000-1792

Affiliations:
  • Huawei Noah's Ark Lab
  • Dublin City University (former)
  • ADAPT Center (former)
  • CNGL (former)
  • Institute of Computing Technology, Chinese Academy of Sciences (former)
  • University of Chinese Academy of Sciences (former)


According to our database1, Qun Liu authored at least 434 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset.
IEEE Trans. Vis. Comput. Graph., July, 2024

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis.
CoRR, 2024

Roadmap towards Superhuman Speech Understanding using Large Language Models.
CoRR, 2024

Subtle Errors Matter: Preference Learning via Error-injected Self-editing.
CoRR, 2024

Privacy-Preserving Quantum Annealing for Quadratic Unconstrained Binary Optimization (QUBO) Problems.
CoRR, 2024

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels.
CoRR, 2024

ToolACE: Winning the Points of LLM Function Calling.
CoRR, 2024

Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope.
CoRR, 2024

Chain-of-Probe: Examing the Necessity and Accuracy of CoT Step-by-Step.
CoRR, 2024

Evaluating the External and Parametric Knowledge Fusion of Large Language Models.
CoRR, 2024

Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment.
CoRR, 2024

Visually Guided Generative Text-Layout Pre-training for Document Intelligence.
CoRR, 2024

Retrieval-based Full-length Wikipedia Generation for Emergent Events.
CoRR, 2024

Findings of the First Workshop on Simulating Conversational Intelligence in Chat.
CoRR, 2024

YODA: Teacher-Student Progressive Learning for Language Models.
CoRR, 2024

Visually Guided Generative Text-Layout Pre-training for Document Intelligence.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Retrieval-based Disentangled Representation Learning with Natural Language Supervision.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LFED: A Literary Fiction Evaluation Dataset for Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Prompt-Based Length Controlled Generation with Multiple Control Types.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Learning to Edit: Aligning LLMs with Knowledge Editing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Unsupervised Extractive Summarization with Learnable Length Control Strategies.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Preparing Lessons for Progressive Training on Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Aspiration drives adaptive switching between two different payoff matrices.
Appl. Math. Comput., June, 2023

Sub-Character Tokenization for Chinese Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2023

MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages.
Trans. Assoc. Comput. Linguistics, 2023

A Survey of Reasoning with Foundation Models.
CoRR, 2023

NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation.
CoRR, 2023

Data Management For Large Language Models: A Survey.
CoRR, 2023

M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models.
CoRR, 2023

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis.
CoRR, 2023

SELF: Language-Driven Self-Evolution for Large Language Model.
CoRR, 2023

FIMO: A Challenge Formal Dataset for Automated Theorem Proving.
CoRR, 2023

Prompt-Based Length Controlled Generation with Reinforcement Learning.
CoRR, 2023

Aligning Large Language Models with Human: A Survey.
CoRR, 2023

Enhancing Coherence of Extractive Summarization with Multitask Learning.
CoRR, 2023

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models.
CoRR, 2023

Evaluating the Efficacy of Length-Controllable Machine Translation.
CoRR, 2023

End-to-end Training and Decoding for Pivot-based Cascaded Translation Model.
CoRR, 2023

Learning Homographic Disambiguation Representation for Neural Machine Translation.
CoRR, 2023

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing.
CoRR, 2023

Adapting Pre-trained Language Models for Quantum Natural Language Processing.
CoRR, 2023

Reusing Pretrained Models by Multi-linear Operators for Efficient Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

History, Present and Future: Enhancing Dialogue Generation with Few-Shot History-Future Prompt.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lexicon-injected Semantic Parsing for Task-Oriented Dialog.
Proceedings of the IEEE International Conference on Acoustics, 2023

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Improving Factual Consistency for Knowledge-Grounded Dialogue Systems via Knowledge Enhancement and Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Gradually Excavating External Knowledge for Implicit Complex Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Structured Pruning for Efficient Generative Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

SongRewriter: A Chinese Song Rewriting System with Controllable Content and Rhyme Scheme.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

NewsDialogues: Towards Proactive News Grounded Conversation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

mCLIP: Multilingual CLIP via Cross-lingual Transfer.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

KPT: Keyword-Guided Pre-training for Grounded Dialog Generation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Dynamic Multi-Branch Layers for On-Device Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Constructing Moral Discussions.
CoRR, 2022

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding.
CoRR, 2022

Retrieval-based Disentanglement with Distant Supervision.
CoRR, 2022

Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages.
CoRR, 2022

PanGu-Coder: Program Synthesis with Function-Level Language Modeling.
CoRR, 2022

PERT: A New Solution to Pinyin to Character Conversion Task.
CoRR, 2022

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding.
CoRR, 2022

PANGUBOT: Efficient Generative Dialogue Pre-training from Pre-trained Language Model.
CoRR, 2022

Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks.
CoRR, 2022

TGEA 2.0: A Large-Scale Diagnostically Annotated Dataset with Benchmark Tasks for Text Generation of Pretrained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

FreeTransfer-X: Safe and Label-Free Cross-Lingual Transfer from Off-the-Shelf Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Exploring extreme parameter compression for pre-trained language models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Identifying Social Bias in Dialog Systems: Framework, Dataset, and Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

COPEN: Probing Conceptual Knowledge in Pre-trained Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Pre-training Language Models with Deterministic Factual Knowledge.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

FPT: Improving Prompt Tuning Efficiency via Progressive Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Processing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

UTC: A Unified Transformer with Inter-Task Contrastive Learning for Visual Dialog.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

CStory: A Chinese Large-scale News Storyline Dataset.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Triangular Transfer: Freezing the Pivot for Triangular Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Compilable Neural Code Generation with Compiler Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Compression of Generative Pre-trained Language Models via Quantization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MINER: Multi-Interest Matching Network for News Recommendation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Universal Conditional Masked Language Pre-training for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Achieving Reliable Human Assessment of Open-Domain Dialogue Systems.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Controlled Text Generation Using Dictionary Prior in Variational Autoencoders.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

bert2BERT: Towards Reusable Pretrained Language Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MTRec: Multi-Task Learning over BERT for News Recommendation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Read before Generate! Faithful Long Form Question Answering with Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

UniDS: A Unified Dialogue System for Chit-Chat and Task-oriented Dialogues.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

From Fully Trained to Fully Random Embeddings: Improving Neural Machine Translation with Compact Word Embedding Tables.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Learning to Generate Explainable Plots for Neural Story Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Improving task-agnostic BERT distillation with layer mapping search.
Neurocomputing, 2021

LMTurk: Few-Shot Learners as Crowdsourcing Workers.
CoRR, 2021

JABER: Junior Arabic BERt.
CoRR, 2021

CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis.
CoRR, 2021

CINS: Comprehensive Instruction for Few-shot Learning in Task-oriented Dialog Systems.
CoRR, 2021

KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs.
CoRR, 2021

NumGPT: Improving Numeracy Ability of Generative Pre-trained Models.
CoRR, 2021

Integrating Regular Expressions with Neural Networks via DFA.
CoRR, 2021

Learning Multilingual Representation for Natural Language Understanding with Enhanced Cross-Lingual Supervision.
CoRR, 2021

Improved OOD Generalization via Adversarial Training and Pre-training.
CoRR, 2021

PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021

Improving Neural Machine Translation with Compact Word Embedding Tables.
CoRR, 2021

An Approach to Improve Robustness of NLP Systems against ASR Errors.
CoRR, 2021

Dependency Graph-to-String Statistical Machine Translation.
CoRR, 2021

LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation.
CoRR, 2021

Training Multilingual Pre-trained Language Model with Byte-level Subwords.
CoRR, 2021

Know what you don't need: Single-Shot Meta-Pruning for attention heads.
AI Open, 2021

NoahNMT at WMT 2021: Dual Transfer for Very Low Resource Supervised Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Huawei AARC's Submissions to the WMT21 Biomedical Translation Task: Domain Adaption from a Practical Perspective.
Proceedings of the Sixth Conference on Machine Translation, 2021

Multilingual Speech Translation with Unified Transformer: Huawei Noah's Ark Lab at IWSLT 2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Improved OOD Generalization via Adversarial Training and Pretraing.
Proceedings of the 38th International Conference on Machine Learning, 2021

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss.
Proceedings of the 9th International Conference on Learning Representations, 2021

On Position Embeddings in BERT.
Proceedings of the 9th International Conference on Learning Representations, 2021

Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

Self-Supervised Quality Estimation for Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Document Graph for Neural Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Generate & Rank: A Multi-task Framework for Math Word Problems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Revisiting Robust Neural Machine Translation: A Transformer Case Study.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Improving Unsupervised Question Answering via Summarization-Informed Question Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Chinese WPLC: A Chinese Dataset for Evaluating Pretrained Language Models on Word Prediction Given Long-Range Context.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Two Parents, One Child: Dual Transfer for Low-Resource Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GhostBERT: Generate More Features with Cheap Operations for BERT.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from Pretrained Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

BinaryBERT: Pushing the Limit of BERT Quantization.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings?
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

ALP-KD: Attention-Based Layer Projection for Knowledge Distillation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

BinaryBERT: Pushing the Limit of BERT Quantization.
CoRR, 2020

Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning.
CoRR, 2020

PPKE: Knowledge Representation Learning by Path-based Pre-training.
CoRR, 2020

KgPLM: Knowledge-guided Language Model Pre-training via Generative and Discriminative Learning.
CoRR, 2020

SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval.
CoRR, 2020

TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling.
CoRR, 2020

Learning to Detect Unacceptable Machine Translations for Downstream Tasks.
CoRR, 2020

DynaBERT: Dynamic BERT with Adaptive Width and Depth.
CoRR, 2020

Dictionary-based Data Augmentation for Cross-Domain Neural Machine Translation.
CoRR, 2020

PERQ: Predicting, Explaining, and Rectifying Failed Questions in KB-QA Systems.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Huawei's Submissions to the WMT20 Biomedical Translation Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

The Solution of Huawei Cloud & Noah's Ark Lab to the NLPCC-2020 Challenge: Light Pre-Training Chinese Language Model for NLP Task.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

DynaBERT: Dynamic BERT with Adaptive Width and Depth.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A General Framework for Adaptation of Neural Machine Translation to Simultaneous Translation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Bridging the Gap between Training and Inference for Neural Machine Translation (Extended Abstract).
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

From Unsupervised Machine Translation to Adversarial Text Generation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

HyperText: Endowing FastText with Hyperbolic Geometry.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

TernaryBERT: Distillation-aware Ultra-low Bit BERT.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TinyBERT: Distilling BERT for Natural Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Integrating Graph Contextualized Knowledge into Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

The Box is in the Pen: Evaluating Commonsense Reasoning in Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Accurate Word Alignment Induction from Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Word-level Textual Adversarial Attacking as Combinatorial Optimization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Multi-Channel Reverse Dictionary Model.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Dialog State Tracking with Reinforced Data Augmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Machine Translation Evaluation Metric Based on Dependency Parsing Model.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

An error analysis for image-based multi-modal neural machine translation.
Mach. Transl., 2019

Multi-channel Reverse Dictionary Model.
CoRR, 2019

Learning to Predict Explainable Plots for Neural Story Generation.
CoRR, 2019

Zero-Shot Paraphrase Generation with Multilingual Language Models.
CoRR, 2019

How to Do Simultaneous Translation Better with Consecutive Neural Machine Translation?
CoRR, 2019

Pretrained Language Models for Document-Level Neural Machine Translation.
CoRR, 2019

Open the Boxes of Words: Incorporating Sememes into Textual Adversarial Attack.
CoRR, 2019

Enhancing Recurrent Neural Networks with Sememes.
CoRR, 2019

NEZHA: Neural Contextualized Representation for Chinese Language Understanding.
CoRR, 2019

GPT-based Generation for Classical Chinese Poetry.
CoRR, 2019

Bilingual-GAN: A Step Towards Parallel Text Generation.
CoRR, 2019

Huawei's NMT Systems for the WMT 2019 Biomedical Translation Task.
Proceedings of the Fourth Conference on Machine Translation, 2019

Improving Domain Adaptation Translation with Domain Invariant and Specific Information.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

ERNIE: Enhanced Language Representation with Informative Entities.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Bridging the Gap between Training and Inference for Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Modeling Semantic Compositionality with Sememe Knowledge.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Decomposable Neural Paraphrase Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Improving the Robustness of Speech Translation.
CoRR, 2018

Understanding Meanings in Multilingual Customer Feedback.
CoRR, 2018

Improving Character-Based Decoding Using Target-Side Morphological Information for Neural Machine Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

E2E NLG Challenge Submission: Towards Controllable Generation of Diverse Natural Language.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

Learning Tag Dependencies for Sequence Tagging.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Speeding Up Neural Machine Translation Decoding by Cube Pruning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Learning to Jointly Translate and Predict Dropped Pronouns with a Shared Reconstruction Mechanism.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Refining Source Representations with Relation Networks for Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Tailoring Neural Architectures for Translating from Morphologically Rich Languages.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Knowledge Diffusion for Neural Dialogue Generation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data.
Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP, 2018

Translating Pro-Drop Languages With Reconstruction Models.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Translating Low-Resource Languages by Vocabulary Adaptation from Close Counterparts.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2017

Providing Morphological Information for SMT Using Neural Networks.
Prague Bull. Math. Linguistics, 2017

A novel and robust approach for pro-drop language translation.
Mach. Transl., 2017

Editorial.
Mach. Transl., 2017

CASICT Tibetan Word Segmentation System for MLWS2017.
CoRR, 2017

Multilingual Multi-modal Embeddings for Natural Language Processing.
CoRR, 2017

Incorporating Global Visual Features into Attention-Based Neural Machine Translation.
CoRR, 2017

CASICT-DCU Neural Machine Translation Systems for WMT17.
Proceedings of the Second Conference on Machine Translation, 2017

Blend: a Novel Combined MT Metric Based on Direct Assessment - CASICT-DCU submission to WMT17 Metrics Task.
Proceedings of the Second Conference on Machine Translation, 2017

DCU System Report on the WMT 2017 Multi-modal Machine Translation Task.
Proceedings of the Second Conference on Machine Translation, 2017

Findings of the 2017 Conference on Machine Translation (WMT17).
Proceedings of the Second Conference on Machine Translation, 2017

Sentence-Level Multilingual Multi-modal Embedding for Natural Language Processing.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

Detection of Verbal Multi-Word Expressions via Conditional Random Fields with Syntactic Dependency Features and Semantic Re-Ranking.
Proceedings of the 13th Workshop on Multiword Expressions, 2017

Semantics-Enhanced Task-Oriented Dialogue Translation: A Case Study on Hotel Booking.
Proceedings of the IJCNLP 2017, Tapei, Taiwan, November 27, 2017

ADAPT Centre Cone Team at IJCNLP-2017 Task 5: A Similarity-Based Logistic Regression Approach to Multi-choice Question Answering in an Examinations Shared Task.
Proceedings of the IJCNLP 2017, Shared Tasks, Taipei, Taiwan, November 27, 2017

ME-MD: An Effective Framework for Neural Machine Translation with Multiple Encoders and Decoders.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Exploiting Cross-Sentence Context for Neural Machine Translation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Further Investigation into Reference Bias in Monolingual Evaluation of Machine Translation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Incorporating Global Visual Features into Attention-based Neural Machine Translation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Context-Aware Graph Segmentation for Graph-Based Translation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Neural Automatic Post-Editing Using Prior Alignment and Reranking.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

If You Can't Beat Them Join Them: Handcrafted Features Complement Neural Nets for Non-Factoid Answer Reranking.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Improving Evaluation of Document-level Machine Translation Quality Estimation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

ENTF: An Entropy-Based MT Evaluation Metric.
Proceedings of the Machine Translation - 13th China Workshop, 2017

Incorporating Word Reordering Knowledge into Attention-based Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Deep Neural Machine Translation with Linear Associative Unit.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Lexically Constrained Decoding for Sequence Generation Using Grid Beam Search.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Doubly-Attentive Decoder for Multi-modal Neural Machine Translation.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
A Semisupervised Tag-Transition-Based Markovian Model for Uyghur Morphology Analysis.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

Boosting Neural POS Tagger for Farsi Using Morphological Information.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

Combining translation memories and statistical machine translation using sparse features.
Mach. Transl., 2016

Topic-based term translation models for statistical machine translation.
Artif. Intell., 2016

A Dynamic Programming Approach to Improving Translation Memory Matching and Retrieval Using Paraphrases.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

MaxSD: A Neural Machine Translation Evaluation Metric Optimized by Maximizing Similarity Distance.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

A Novel Approach to Dropped Pronoun Translation.
Proceedings of the NAACL HLT 2016, 2016

Extending Phrase-Based Translation with Dependencies by Using Graphs.
Proceedings of the 2nd Workshop on Semantics-Driven Machine Translation, 2016

Achieving Accurate Conclusions in Evaluation of Automatic Machine Translation Metrics.
Proceedings of the NAACL HLT 2016, 2016

ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Automatic Construction of Discourse Corpora for Dialogue Translation.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Dropped pronoun generation for dialogue machine translation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Valence-arousal ratings prediction of Chinese words using similarity measures based on Word2Vec.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Variational Neural Discourse Relation Recognizer.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Memory-enhanced Decoder for Neural Machine Translation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Neural Network for Heterogeneous Annotations.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Improving Phrase-Based SMT Using Cross-Granularity Embedding Similarity.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation, 2016

Combining Translation Memories and Syntax-Based SMT: Experiments with Real Industrial Data.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation, 2016

A subtree-based factorization of dependency parsing.
Proceedings of the COLING 2016, 2016

Fast Gated Neural Domain Adaptation: Language Model as a Case Study.
Proceedings of the COLING 2016, 2016

Topic-Informed Neural Machine Translation.
Proceedings of the COLING 2016, 2016

Enriching Phrase Tables for Statistical Machine Translation Using Mixed Embeddings.
Proceedings of the COLING 2016, 2016

Interactive Attention for Neural Machine Translation.
Proceedings of the COLING 2016, 2016

Is Local Window Essential for Neural Network Based Chinese Word Segmentation?
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2016

Phrase-Level Combination of SMT and TM Using Constrained Word Lattice.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Graph-Based Translation Via Graph Segmentation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Improve the Evaluation of Translation Fluency by Using Entropy of Matched Sub-segments.
CoRR, 2015

An Automatic Machine Translation Evaluation Metric Based on Dependency Parsing Model.
CoRR, 2015

<i>gen</i>CNN: A Convolutional Architecture for Word Sequence Prediction.
CoRR, 2015

Neural Transformation Machine: A New Architecture for Sequence-to-Sequence Learning.
CoRR, 2015

Automatic Adaptation of Annotations.
Comput. Linguistics, 2015

A Comparative Study of Online Translation Services for Cross Language Information Retrieval.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

CASICT-DCU Participation in WMT2015 Metrics Task.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

MT Tuning on RED: A Dependency-Based Evaluation Metric.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Referential Translation Machines for Predicting Translation Quality and Related Statistics.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

ParFDA for Fast Deployment of Accurate Statistical Machine Translation Systems, Benchmarks, and Statistics.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Bilingual distributed phrase representations for statistical machin translation.
Proceedings of Machine Translation Summit XV: Papers, 2015

Syntax-Based Deep Matching of Short Texts.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Joint Learning of Constituency and Dependency Grammars by Decomposed Cross-Lingual Induction.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Dependency Graph-to-String Translation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Benchmarking SMT Performance for Farsi Using the TEP++ Corpus.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

HandyCAT - An Open-Source Platform for CAT Tool Research.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

The DCU Discourse Parser for Connective, Argument Identification and Explicit Sense Classification.
Proceedings of the 19th Conference on Computational Natural Language Learning: Shared Task, 2015

The DCU Discourse Parser: A Sense Classification Task.
Proceedings of the 19th Conference on Computational Natural Language Learning: Shared Task, 2015

genCNN: A Convolutional Architecture for Word Sequence Prediction.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Encoding Source Language with Convolutional Neural Network for Machine Translation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Topic-Based Dissimilarity and Sensitivity Models for Translation Rule Selection.
J. Artif. Intell. Res., 2014

RED, The DCU-CASICT Submission of Metrics Tasks.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

DCU-Lingo24 Participation in WMT 2014 Hindi-English Translation task.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The DCU-ICTCAS MT system at WMT 2014 on German-English Translation Task.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Parallel FDA5 for Fast Deployment of Accurate Statistical Machine Translation Systems.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

DCU Terminology Translation System for Medical Query Subtask at WMT14.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Transformation and Decomposition for Efficiently Implementing and Improving Dependency-to-String Model In Moses.
Proceedings of SSST@EMNLP 2014, 2014

The Question Answering System of DCUMT in NTCIR-11 QALab.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

A NLP System of DCUMT in NTCIR-11 MedNLP-2: RNN for ICD/Time Entity Recognition and ICD Classification Tasks.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

A Novel Rule Refinement Method for SMT through Simulated Post-Editing.
Proceedings of the Natural Language Processing and Chinese Computing, 2014

Syntactic SMT Using a Discriminative Text Generation Model.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Modeling Term Translation for Document-informed Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Active Learning for Post-Editing Based Incrementally Retrained MT.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

RED: A Reference Dependency Based MT Evaluation Metric.
Proceedings of the COLING 2014, 2014

A Structured Language Model for Incremental Tree-to-String Translation.
Proceedings of the COLING 2014, 2014

Augment Dependency-to-String Translation with Fixed and Floating Structures.
Proceedings of the COLING 2014, 2014

Annotation Adaptation and Language Adaptation in NLP.
Proceedings of the COLING 2014, 2014

A Dependency Edge-based Transfer Model for Statistical Machine Translation.
Proceedings of the COLING 2014, 2014

Unsupervised Joint Monolingual Character Alignment and Word Segmentation.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2014

A probabilistic feature-based fill-up for SMT.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

Review and analysis of China workshop on machine translation 2013 evaluation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

A discriminative framework of integrating translation memory features into SMT.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

Joint Morphological Generation and Syntactic Linearization.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
DCU Participation in WMT2013 Metrics Task.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

The CNGL-DCU-Prompsit Translation Systems for WMT13.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Shallow Semantically-Informed PBSMT and HPBSMT.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

A Simple, Fast Strategy for Weighted Alignment Hypergraph.
Proceedings of the Natural Language Processing and Chinese Computing, 2013

Machine Translation in CNGL II.
Proceedings of Machine Translation Summit XIV: European projects, 2013

A Topic-Triggered Language Model for Statistical Machine Translation.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Modeling Lexical Cohesion for Document-Level Machine Translation.
Proceedings of the IJCAI 2013, 2013

Rule Refinement for Spoken Language Translation by Retrieving the Missing Translation of Content Words.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013

Improving Alignment of System Combination by Using Multi-objective Optimization.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Translation with Source Constituency and Dependency Trees.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Stem Translation with Affix-Based Rule Selection for Agglutinative Languages.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A Novel Graph-based Compact Representation of Word Alignment.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Bilingually-Guided Monolingual Dependency Grammar Induction.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Iterative Transformation of Annotation Guidelines for Constituency Parsing.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Discriminative Learning with Natural Annotations: Word Segmentation as a Case Study.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Bilingual Lexical Cohesion Trigger Model for Document-Level Machine Translation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
ICT: A System Combination for Chinese Semantic Dependency Parsing.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

ICT: A Translation based Method for Cross-lingual Textual Entailment.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

An Approach to N-Gram Language Model Evaluation in Phrase-Based Statistical Machine Translation.
Proceedings of the 2012 International Conference on Asian Language Processing, 2012

Iterative Annotation Transformation with Predict-Self Reestimation for Chinese Word Segmentation.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Left-to-Right Tree-to-String Decoding with Prediction.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

ICT: System Description for CoNLL-2012.
Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Unsupervised Discriminative Induction of Synchronous Grammar for Machine Translation.
Proceedings of the COLING 2012, 2012

System Combination with Extra Alignment Information.
Proceedings of the Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT@COLING 2012, 2012

Combining Multiple Alignments to Improve Machine Translation.
Proceedings of the COLING 2012, 2012

Discriminative Boosting from Dictionary and Raw Text - A Novel Approach to Build A Chinese Word Segmenter.
Proceedings of the COLING 2012, 2012

A Topic Similarity Model for Hierarchical Phrase-based Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Identifying High-Impact Sub-Structures for Convolution Kernels in Document-level Sentiment Classification.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Hierarchical Chunk-to-String Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Introduction to the Special Issue on Chinese Language Processing.
ACM Trans. Asian Lang. Inf. Process., 2011

Conditional Random Fields for Machine Translation System Combination.
Int. J. Asian Lang. Process., 2011

The <i>ICT</i>'s Patent MT System Description for NTCIR-9.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

Maximum Rank Correlation Training for Statistical Machine Translation.
Proceedings of Machine Translation Summit XIII: Papers, 2011

Multi-granularity Word Alignment and Decoding for Agglutinative Language Translation.
Proceedings of Machine Translation Summit XIII: Papers, 2011

Bagging-based System Combination for Domain Adaption.
Proceedings of Machine Translation Summit XIII: Papers, 2011

Extracting Hierarchical Rules from a Weighted Alignment Matrix.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

An Orientation Model for Hierarchical Phrase-Based Translation.
Proceedings of the International Conference on Asian Language Processing, 2011

A novel dependency-to-string model for statistical machine translation.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Fast Generation of Translation Forest for Large-Scale SMT Discriminative Training.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Relaxed Cross-lingual Projection of Constituent Syntax.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

ETS: An Error Tolerable System for Coreference Resolution.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, 2011

Adjoining Tree-to-String Translation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Discriminative Word Alignment by Linear Modeling.
Comput. Linguistics, 2010

Statistical Translation Model Based On Source Syntax Structure.
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010

The ICT statistical machine translation system for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Multilingual Machine Translation system.
Proceedings of the 4th International Universal Communication Symposium, 2010

A fixed-point decoding approach for statistical machine translation on mobile terminals.
Proceedings of the 4th International Universal Communication Symposium, 2010

A Combination of Statistical and Rule-Based Approach for Mongolian Lexical Analysis.
Proceedings of the International Conference on Asian Language Processing, 2010

Conditional Random Fields for Machine Translation System Combination.
Proceedings of the International Conference on Asian Language Processing, 2010

Joint Tokenization and Translation.
Proceedings of the COLING 2010, 2010

Dependency Forest for Statistical Machine Translation.
Proceedings of the COLING 2010, 2010

Dependency-Based Bracketing Transduction Grammar for Statistical Machine Translation.
Proceedings of the COLING 2010, 2010

Machine Translation with Lattices and Forests.
Proceedings of the COLING 2010, 2010

Joint Parsing and Translation.
Proceedings of the COLING 2010, 2010

Effective Constituent Projection across Languages.
Proceedings of the COLING 2010, 2010

An Efficient Shift-Reduce Decoding Algorithm for Phrased-Based Machine Translation.
Proceedings of the COLING 2010, 2010

Better Filtration and Augmentation for Hierarchical Phrase-Based Translation Rules.
Proceedings of the ACL 2010, 2010

Learning Lexicalized Reordering Models from Reordering Graphs.
Proceedings of the ACL 2010, 2010

Constituency to Dependency Translation with Forests.
Proceedings of the ACL 2010, 2010

Dependency Parsing and Projection Based on Word-Pair Classification.
Proceedings of the ACL 2010, 2010

Forest-Based Semantic Role Labeling.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Improving Statistical Machine Translation Using Domain Bilingual Multiword Expressions.
Proceedings of the Workshop on Multiword Expressions: Identification, 2009

Introduction to China's CWMT2008 Machine Translation Evaluation.
Proceedings of Machine Translation Summit XII: Papers, 2009

The ICT statistical machine translation system for the IWSLT 2009.
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2009, 2009

The ICT statistical machine translation systems for the IWSLT 2009.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Automatic Adaptation of Annotation Standards for Dependency Parsing ? Using Projected Treebank as Source Corpus.
Proceedings of the 11th International Workshop on Parsing Technologies (IWPT-2009), 2009

Improved Reordering Rules for Hierarchical Phrase-Based Translation.
Proceedings of the 2009 International Conference on Asian Language Processing, 2009

Weighted Alignment Matrices for Statistical Machine Translation.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Bilingually-Constrained (Monolingual) Shift-Reduce Parsing.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Lattice-based System Combination for Statistical Machine Translation.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Sub-Sentence Division for Tree-Based Machine Translation.
Proceedings of the ACL 2009, 2009

Joint Decoding with Multiple Translation Models.
Proceedings of the ACL 2009, 2009

Improving Tree-to-Tree Translation with Packed Forests.
Proceedings of the ACL 2009, 2009

Automatic Adaptation of Annotation Standards: Chinese Word Segmentation and POS Tagging - A Case Study.
Proceedings of the ACL 2009, 2009

Reducing SMT Rule Table with Monolingual Key Phrase.
Proceedings of the ACL 2009, 2009

2008
Robust Multi-Target Tracking Using Mean Shift and Particle Filter with Target Model Update.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

ICT-Crossn: The System of Cross-lingual Information Retrieval of ICT in NTCIR-7.
Proceedings of the 7th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2008

The ICT system description for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

The Maximum Entropy based Rule Selection Model for Statistical Machine Translation (Invited Paper).
Proceedings of the ISUC 2008, 2008

Refinements in BTG-based Statistical Machine Translation.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Fast commercial detection based on audio retrieval.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Maximum Entropy based Rule Selection Model for Syntax-based Statistical Machine Translation.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Word Lattice Reranking for Chinese Word Segmentation and Part-of-Speech Tagging.
Proceedings of the COLING 2008, 2008

Improving Statistical Machine Translation using Lexicalized Rule Selection.
Proceedings of the COLING 2008, 2008

Forest-Based Translation.
Proceedings of the ACL 2008, 2008

A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging.
Proceedings of the ACL 2008, 2008

Partial Matching Strategy for Phrase-based Statistical Machine Translation.
Proceedings of the ACL 2008, 2008

2007
HTRDP evaluations on Chinese information processing and intelligent human-machine interface.
Frontiers Comput. Sci. China, 2007

A Dependency Treelet String Correspondence Model for Statistical Machine Translation.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

Shadow Elimination in Traffic Video Segmentation.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2007), 2007

The ICT statistical machine translation systems for IWSLT 2007.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

The PICA Framework for Performance Analysis of Pattern Recognition Systems and Its Application in Broadcast News Segmentation.
Proceedings of the New Trends in Applied Artificial Intelligence, 2007

Improving Statistical Machine Translation Performance by Training Data Selection and Optimization.
Proceedings of the EMNLP-CoNLL 2007, 2007

Forest-to-String Statistical Translation Rules.
Proceedings of the ACL 2007, 2007

2006
Storage challenge - HUSt: a heterogeneous unified storage system for GIS grid.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

A Framework for Accessing General Object Storage.
Proceedings of the 2006 International Workshop on Networking, 2006

Motion and Gray Based Automatic Road Segment Method MGARS in Urban Traffic Surveillance.
Proceedings of the Advances in Machine Vision, 2006

DOE and ANOVA based Performance Influencing Factor Analysis for Evaluation of Speech Recognition Systems.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

An Approximate Analytic Performance Model of Object-Based Storage.
Proceedings of the Computational Science and Its Applications, 2006

Acquiring Translational Equivalence from a Japanese-Chinese Parallel Corpus.
Proceedings of the Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead, 2006

Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation.
Proceedings of the ACL 2006, 2006

Tree-to-String Alignment Template for Statistical Machine Translation.
Proceedings of the ACL 2006, 2006

2005
Holistic Load Balancing Strategy and Practice in Object Storage System.
Proceedings of the 2005 International Conference on Semantics, 2005

A Multi-aligner for Japanese-Chinese Parallel Corpora.
Proceedings of Machine Translation Summit X: Papers, 2005

Parsing the Penn Chinese Treebank with Semantic Knowledge.
Proceedings of the Natural Language Processing, 2005

Geographic Information Systems Grid.
Proceedings of the Advances in Grid Computing, 2005

Lexicalized Beam Thresholding Parsing with Prior and Boundary Estimates.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2005

Adaptive Policy Trigger Mechanism for OBSS.
Proceedings of the 19th International Conference on Advanced Information Networking and Applications (AINA 2005), 2005

Log-Linear Models for Word Alignment.
Proceedings of the ACL 2005, 2005

2004
An EBMT system based on word alignment.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Tagging Complex NEs with MaxEnt Models: Layered Structures Versus Extended Tagset.
Proceedings of the Natural Language Processing, 2004

2003
Chinese Named Entity Recognition Using Role Model.
Int. J. Comput. Linguistics Chin. Lang. Process., 2003

HHMM-based Chinese Lexical Analyzer ICTCLAS.
Proceedings of the Second Workshop on Chinese Language Processing, 2003

Chinese Lexical Analysis Using Hierarchical Hidden Markov Model.
Proceedings of the Second Workshop on Chinese Language Processing, 2003

2002
Semantic Computation in a Chinese Question-Answering System.
J. Comput. Sci. Technol., 2002

基於《知網》的辭彙語義相似度計算 (Word Similarity Computing Based on How-net).
Int. J. Comput. Linguistics Chin. Lang. Process., 2002

Automatic Recognition of Chinese Unknown Words Based on Roles Tagging.
Proceedings of the First Workshop on Chinese Language Processing, 2002

2001
Automatic extraction of lexical relations from Chinese machine readable dictionary.
Proceedings of the IEEE International Conference on Systems, 2001

1998
TransEasy: A Chinese-English Machine Translation System Based on Hybrid Approach.
Proceedings of the Machine Translation and the Information Soup, 1998

1995
Efficient realization of frequently used bijections on cube-connected cycles.
J. Comput. Sci. Technol., 1995


  Loading...