2025
Advantageous Parameter Expansion Training Makes Better Large Language Models.
CoRR, May, 2025
Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation.
CoRR, May, 2025
Unveiling Knowledge Utilization Mechanisms in LLM-based Retrieval-Augmented Generation.
CoRR, May, 2025
Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking.
CoRR, February, 2025
BeamLoRA: Beam-Constraint Low-Rank Adaptation.
CoRR, February, 2025
Curiosity-Driven Reinforcement Learning from Human Feedback.
CoRR, January, 2025
Towards few-shot mixed-type dialogue generation.
Sci. China Inf. Sci., 2025
A simple yet effective self-debiasing framework for transformer models.
Artif. Intell., 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
Learning to Select External Knowledge With Multi-Scale Negative Sampling.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Mixture of Hidden-Dimensions Transformer.
CoRR, 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions.
CoRR, 2024
Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging.
CoRR, 2024
S<sup>2</sup>AG-Vid: Enhancing Multi-Motion Alignment in Video Diffusion Models via Spatial and Syntactic Attention-Based Guidance.
CoRR, 2024
Orthogonal Finetuning for Direct Preference Optimization.
CoRR, 2024
NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time.
CoRR, 2024
Exploring the Causality of End-to-End Autonomous Driving.
CoRR, 2024
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space.
CoRR, 2024
HFT: Half Fine-Tuning for Large Language Models.
CoRR, 2024
Dual Modalities of Text: Visual and Textual Generative Pre-training.
CoRR, 2024
On Training Data Influence of GPT Models.
CoRR, 2024
Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models.
CoRR, 2024
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Tool-Augmented Reward Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
BASES: Large-scale Web Search User Simulation with Large Language Model based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Autoregressive Pre-Training on Pixels and Texts.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
On Training Data Influence of GPT Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
QDMR-based Planning-and-Solving Prompting for Complex Reasoning Tasks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
A High-Order Curvature Compensated CMOS Bandgap Reference Without Resistors.
Circuits Syst. Signal Process., November, 2023
A method for multiple-sequence-alignment-free protein structure prediction using a protein language model.
,
,
,
,
,
,
,
,
,
,
Nat. Mac. Intell., October, 2023
RLCharge: Imitative Multi-Agent Spatiotemporal Reinforcement Learning for Electric Vehicle Charging Station Recommendation.
IEEE Trans. Knowl. Data Eng., June, 2023
Graph-Grounded Goal Planning for Conversational Recommendation.
IEEE Trans. Knowl. Data Eng., May, 2023
Controllable Dialogue Generation With Disentangled Multi-Grained Style Specification and Attribute Consistency Reward.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Tool-Augmented Reward Modeling.
CoRR, 2023
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue.
CoRR, 2023
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation.
CoRR, 2023
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models.
CoRR, 2023
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization.
CoRR, 2023
SeSQL: A High-Quality Large-Scale Session-Level Chinese Text-to-SQL Dataset.
Proceedings of the Natural Language Processing and Chinese Computing, 2023
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Modeling for Low Power Bypass Window SAR ADC Based on Highest Weight Capacitor Splitting.
Proceedings of the 30th IEEE International Conference on Electronics, Circuits and Systems, 2023
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
A Thorough Examination on Zero-shot Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Towards Zero-Shot Persona Dialogue Generation with In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
TOME: A Two-stage Approach for Model-based Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Learning In-context Learning for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Towards Boosting the Open-Domain Chatbot with Human Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Universal Information Extraction as Unified Semantic Matching.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning.
Trans. Mach. Learn. Res., 2022
Towards Knowledge-Aware Video Captioning via Transitive Visual Relationship Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022
Geometry-enhanced molecular representation learning for property prediction.
Nat. Mach. Intell., 2022
ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation.
CoRR, 2022
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation.
CoRR, 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training.
CoRR, 2022
SeSQL: Yet Another Large-scale Session-level Chinese Text-to-SQL Dataset.
CoRR, 2022
GEM-2: Next Generation Molecular Property Prediction Network with Many-body and Full-range Interaction Modeling.
CoRR, 2022
An Interpretability Evaluation Benchmark for Pre-trained Language Models.
CoRR, 2022
HelixFold-Single: MSA-free Protein Structure Prediction by Using Protein Language Model as an Alternative.
CoRR, 2022
SINC: Service Information Augmented Open-Domain Conversation.
CoRR, 2022
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation.
CoRR, 2022
ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Towards Multi-Turn Empathetic Dialogs with Positive Emotion Elicitation.
CoRR, 2022
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention.
CoRR, 2022
DuReader_retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine.
CoRR, 2022
Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods.
CoRR, 2022
HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer.
,
,
,
,
,
,
,
,
,
,
Bioinform., 2022
BatchDTA: implicit batch alignment enhances deep learning-based drug-target affinity estimation.
Briefings Bioinform., 2022
DTSyn: a dual-transformer-based neural network to predict synergistic drug combinations.
Briefings Bioinform., 2022
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Non-Autoregressive Chinese ASR Error Correction with Phonological Training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
CLOP: Video-and-Language Pre-Training with Knowledge Regularizations.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022
DuQM: A Chinese Dataset of Linguistically Perturbed Natural Questions for Evaluating the Robustness of Question Matching Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
CDConv: A Benchmark for Contradiction Detection in Chinese Conversations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Q-TOD: A Query-driven Task-oriented Dialogue System.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
DuReader-Retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022
Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
A Fine-grained Interpretability Evaluation Benchmark for Neural NLP.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022
HelixMO: Sample-Efficient Molecular Optimization in Scene-Sensitive Latent Space.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022
Syntax-guided Contrastive Learning for Pre-trained Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Long Time No See! Open-Domain Conversation with Long-Term Persona Memory.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
DuReader<sub>vis</sub>: A Chinese Dataset for Open-domain Document Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Where to Go for the Holidays: Towards Mixed-Type Dialogs for Clarification of User Goals.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
UNIMO-2: End-to-End Unified Vision-Language Grounded Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Unified Structure Generation for Universal Information Extraction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
Coherent Dialog Generation with Query Graph.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation.
CoRR, 2021
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
CELLS: Cost-Effective Evolution in Latent Space for Goal-Directed Molecular Generation.
CoRR, 2021
Amendable Generation for Dialogue State Tracking.
CoRR, 2021
Building Chinese Biomedical Language Models via Multi-Level Text Discrimination.
CoRR, 2021
A Multimodal Sentiment Dataset for Video Recommendation.
CoRR, 2021
Do What Nature Did To Us: Evolving Plastic Recurrent Neural Networks For Task Generalization.
CoRR, 2021
A Sentiment Analysis Dataset for Trustworthiness Evaluation.
CoRR, 2021
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
ChemRL-GEM: Geometry Enhanced Molecular Representation Learning for Property Prediction.
CoRR, 2021
ERNIE-Tiny : A Progressive Distillation Framework for Pretrained Transformer Compression.
CoRR, 2021
A Unified Pre-training Framework for Conversational AI.
CoRR, 2021
BSTC: A Large-Scale Chinese-English Speech Translation Dataset.
CoRR, 2021
Learning with Noisy Correspondence for Cross-modal Matching.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Fine-grained Entity Typing via Label Reasoning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Mixup Decoding for Diverse Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
SgSum: Transforming Multi-document Summarization into Sub-graph Selection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Familia: A Configurable Topic Modeling Framework for Industrial Text Engineering.
Proceedings of the Database Systems for Advanced Applications, 2021
From Learning-to-Match to Learning-to-Discriminate: Global Prototype Learning for Few-shot Relation Classification.
Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021
Docking-based Virtual Screening with Multi-Task Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021
Correcting Chinese Spelling Errors with Phonetic Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Discovering Dialog Structure Graph for Coherent Dialog Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora.
CoRR, 2020
Discovering Dialog Structure Graph for Open-Domain Dialog Generation.
CoRR, 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding.
CoRR, 2020
ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph.
CoRR, 2020
DuReaderrobust: A Chinese Dataset Towards Evaluating the Robustness of Machine Reading Comprehension Models.
CoRR, 2020
Enhancing Dialog Coherence with Event Graph Grounded Content Planning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
TopicOcean: An Ever-Increasing Topic Model With Meta-learning.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020
Learning Adaptive Segmentation Policy for Simultaneous Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Syntactic and Semantic-driven Learning for Open Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Towards Conversational Recommendation over Multi-Type Dialogs.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Leveraging Graph to Improve Abstractive Multi-Document Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoKE: Contextualized Knowledge Graph Embedding.
CoRR, 2019
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting.
CoRR, 2019
Proactive Human-Machine Conversation with Explicit Conversation Goals.
CoRR, 2019
ERNIE: Enhanced Representation through Knowledge Integration.
CoRR, 2019
Knowledge Aware Conversation Generation with Reasoning on Augmented Graph.
CoRR, 2019
Integrating Bayesian and Neural Networks for Discourse Coherence.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019
Baidu Neural Machine Translation Systems for WMT19.
Proceedings of the Fourth Conference on Machine Translation, 2019
A Key-Phrase Aware End2end Neural Response Generation Model.
Proceedings of the Natural Language Processing and Chinese Computing, 2019
An Overview of the 2019 Language and Intelligence Challenge.
Proceedings of the Natural Language Processing and Chinese Computing, 2019
End-to-End Speech Translation with Knowledge Distillation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Learning to Select Knowledge for Response Generation in Dialog Systems.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Enhancing Local Feature Extraction with Global Representation for Neural Text Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Multi-agent Learning for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proactive Human-Machine Conversation with Explicit Conversation Goal.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
ARNOR: Attention Regularization based Noise Reduction for Distant Supervision Relation Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019
Addressing the Under-Translation Problem from the Entropy Perspective.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Modeling Coherence for Discourse Neural Machine Translation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Familia: A Configurable Topic Modeling Framework for Industrial Text Engineering.
CoRR, 2018
A New Method of Region Embedding for Text Classification.
Proceedings of the 6th International Conference on Learning Representations, 2018
Addressing Troublesome Words in Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018
Multi-Channel Encoder for Neural Machine Translation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2017
An End-to-End Model for Question Answering over Knowledge Base with Cross-Attention Combining Global Knowledge.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Question Answering over Knowledge Base with Neural Attention Combining Global Knowledge Information.
CoRR, 2016
Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016
Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Multi-view Response Selection for Human-Computer Conversation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Chinese Poetry Generation with Planning based Neural Network.
Proceedings of the COLING 2016, 2016
Proceedings of the COLING 2016, 2016
"Shall I Be Your Chat Companion?": Towards an Online Human-Computer Conversation System.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016
Minimum Risk Training for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Active Learning for Dependency Parsing with Partial Annotation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Semi-Supervised Learning for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
Improved Neural Machine Translation with SMT Features.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Exploiting Collective Hidden Structures in Webpage Titles for Open Domain Entity Extraction.
Proceedings of the 24th International Conference on World Wide Web, 2015
Improved beam search with constrained softmax for NMT.
Proceedings of Machine Translation Summit XV: Papers, 2015
Multi-Task Learning for Multiple Language Translation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Count of Phrase Pairs.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
2013
Generalization of Words for Chinese Dependency Parsing.
Proceedings of The 13th International Conference on Parsing Technologies, 2013
Improving Pivot-Based Statistical Machine Translation Using Random Walk.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
2012
Opening Machine Translation Black Box for Cross-Language Information Retrieval.
Proceedings of the Information Retrieval Technology, 2012
Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
Improve SMT Quality with Automatically Extracted Paraphrase Rules.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
2011
Two-Word Collocation Extraction Using Monolingual Word Alignment Method.
ACM Trans. Intell. Syst. Technol., 2011
Reordering with Source Language Collocations.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011
2010
Improving Statistical Machine Translation with Monolingual Collocation.
Proceedings of the ACL 2010, 2010
2009
Collocation Extraction Using Monolingual Word Alignment Method.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009
Revisiting Pivot Language Approach for Machine Translation.
Proceedings of the ACL 2009, 2009
Exploiting Heterogeneous Treebanks for Parsing.
Proceedings of the ACL 2009, 2009
2008
The TCH machine translation system for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008
Predicting and Tagging Dialog-Act Using MDP and SVM.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora.
Proceedings of the COLING 2008, 2008
2007
Pivot language approach for phrase-based statistical machine translation.
Mach. Transl., 2007
Improving statistical word alignment with various clues.
Proceedings of Machine Translation Summit XI: Papers, 2007
Log-linear generation models for example-based machine translation.
Proceedings of Machine Translation Summit XI: Papers, 2007
Comparative study of word alignment heuristics and phrase-based SMT.
Proceedings of Machine Translation Summit XI: Papers, 2007
Using RBMT Systems to Produce Bilingual Corpus for SMT.
Proceedings of the EMNLP-CoNLL 2007, 2007
2006
Example-based machine translation based on tree-string correspondence and statistical generation.
Mach. Transl., 2006
Boosting Statistical Word Alignment Using Labeled and Unlabeled Data.
Proceedings of the ACL 2006, 2006
Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs.
Proceedings of the ACL 2006, 2006
2005
Boosting Statistical Word Alignment.
Proceedings of Machine Translation Summit X: Papers, 2005
Example-based Machine Translation Based on TSC and Statistical Generation.
Proceedings of Machine Translation Summit X: Papers, 2005
Improving Statistical Word Alignment with Ensemble Methods.
Proceedings of the Natural Language Processing, 2005
Alignment Model Adaptation for Domain-Specific Word Alignment.
Proceedings of the ACL 2005, 2005
2004
Improving Statistical Word Alignment with a Rule-Based Machine Translation System.
Proceedings of the COLING 2004, 2004
: Improving Domain-Specific Word Alignment with a General Bilingual Corpus.
Proceedings of the Machine Translation: From Real Users to Research, 2004
Improving Domain-Specific Word Alignment for Computer Assisted Translation.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004, 2004