Hua Wu

Affiliations:
  • Baidu Inc., Beijing, China
  • Toshiba (China) Research and Development Center, Beijing, China (former)


According to our database1, Hua Wu authored at least 233 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning to Select External Knowledge With Multi-Scale Negative Sampling.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions.
CoRR, 2024

Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging.
CoRR, 2024

S<sup>2</sup>AG-Vid: Enhancing Multi-Motion Alignment in Video Diffusion Models via Spatial and Syntactic Attention-Based Guidance.
CoRR, 2024

Orthogonal Finetuning for Direct Preference Optimization.
CoRR, 2024

NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time.
CoRR, 2024

Exploring the Causality of End-to-End Autonomous Driving.
CoRR, 2024

BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space.
CoRR, 2024

HFT: Half Fine-Tuning for Large Language Models.
CoRR, 2024

Dual Modalities of Text: Visual and Textual Generative Pre-training.
CoRR, 2024

On Training Data Influence of GPT Models.
CoRR, 2024

Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models.
CoRR, 2024

An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Tool-Augmented Reward Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

BASES: Large-scale Web Search User Simulation with Large Language Model based Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Autoregressive Pre-Training on Pixels and Texts.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

On Training Data Influence of GPT Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

QDMR-based Planning-and-Solving Prompting for Complex Reasoning Tasks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
A High-Order Curvature Compensated CMOS Bandgap Reference Without Resistors.
Circuits Syst. Signal Process., November, 2023

A method for multiple-sequence-alignment-free protein structure prediction using a protein language model.
Nat. Mac. Intell., October, 2023

RLCharge: Imitative Multi-Agent Spatiotemporal Reinforcement Learning for Electric Vehicle Charging Station Recommendation.
IEEE Trans. Knowl. Data Eng., June, 2023

Graph-Grounded Goal Planning for Conversational Recommendation.
IEEE Trans. Knowl. Data Eng., May, 2023

Controllable Dialogue Generation With Disentangled Multi-Grained Style Specification and Attribute Consistency Reward.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Tool-Augmented Reward Modeling.
CoRR, 2023

GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue.
CoRR, 2023

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation.
CoRR, 2023

A Simple yet Effective Self-Debiasing Framework for Transformer Models.
CoRR, 2023

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models.
CoRR, 2023

ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization.
CoRR, 2023

SeSQL: A High-Quality Large-Scale Session-Level Chinese Text-to-SQL Dataset.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Modeling for Low Power Bypass Window SAR ADC Based on Highest Weight Capacitor Splitting.
Proceedings of the 30th IEEE International Conference on Electronics, Circuits and Systems, 2023

IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Thorough Examination on Zero-shot Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Zero-Shot Persona Dialogue Generation with In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

TOME: A Two-stage Approach for Model-based Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learning In-context Learning for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Towards Boosting the Open-Domain Chatbot with Human Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Universal Information Extraction as Unified Semantic Matching.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Evolving Decomposed Plasticity Rules for Information-Bottlenecked Meta-Learning.
Trans. Mach. Learn. Res., 2022

Towards Knowledge-Aware Video Captioning via Transitive Visual Relationship Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Geometry-enhanced molecular representation learning for property prediction.
Nat. Mach. Intell., 2022

ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation.
CoRR, 2022

ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech.
CoRR, 2022

PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation.
CoRR, 2022

UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance.
CoRR, 2022

ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts.
CoRR, 2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
CoRR, 2022

ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training.
CoRR, 2022

SeSQL: Yet Another Large-scale Session-level Chinese Text-to-SQL Dataset.
CoRR, 2022

GEM-2: Next Generation Molecular Property Prediction Network with Many-body and Full-range Interaction Modeling.
CoRR, 2022

An Interpretability Evaluation Benchmark for Pre-trained Language Models.
CoRR, 2022

HelixFold-Single: MSA-free Protein Structure Prediction by Using Protein Language Model as an Alternative.
CoRR, 2022

SINC: Service Information Augmented Open-Domain Conversation.
CoRR, 2022

Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation.
CoRR, 2022

ERNIE-Search: Bridging Cross-Encoder with Dual-Encoder via Self On-the-fly Distillation for Dense Passage Retrieval.
CoRR, 2022

Towards Multi-Turn Empathetic Dialogs with Positive Emotion Elicitation.
CoRR, 2022

ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention.
CoRR, 2022

DuReader_retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine.
CoRR, 2022

Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods.
CoRR, 2022

HelixADMET: a robust and endpoint extensible ADMET system incorporating self-supervised knowledge transfer.
Bioinform., 2022

BatchDTA: implicit batch alignment enhances deep learning-based drug-target affinity estimation.
Briefings Bioinform., 2022

DTSyn: a dual-transformer-based neural network to predict synergistic drug combinations.
Briefings Bioinform., 2022

Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Non-Autoregressive Chinese ASR Error Correction with Phonological Training.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CLOP: Video-and-Language Pre-Training with Knowledge Regularizations.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

DuQM: A Chinese Dataset of Linguistically Perturbed Natural Questions for Evaluating the Robustness of Question Matching Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

CDConv: A Benchmark for Contradiction Detection in Chinese Conversations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

FRSUM: Towards Faithful Abstractive Summarization via Enhancing Factual Robustness.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Q-TOD: A Query-driven Task-oriented Dialogue System.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DuReader-Retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PLATO-Ad: A Unified Advertisement Text Generation Framework with Multi-Task Prompt Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

Clip-Tuning: Towards Derivative-free Prompt Learning with a Mixture of Rewards.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

A Fine-grained Interpretability Evaluation Benchmark for Neural NLP.
Proceedings of the 26th Conference on Computational Natural Language Learning, 2022

HelixMO: Sample-Efficient Molecular Optimization in Scene-Sensitive Latent Space.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

Syntax-guided Contrastive Learning for Pre-trained Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Long Time No See! Open-Domain Conversation with Long-Term Persona Memory.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

DuReader<sub>vis</sub>: A Chinese Dataset for Open-domain Document Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Where to Go for the Holidays: Towards Mixed-Type Dialogs for Clarification of User Goals.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

UNIMO-2: End-to-End Unified Vision-Language Grounded Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

DU-VLG: Unifying Vision-and-Language Generation via Dual Sequence-to-Sequence Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Unified Structure Generation for Universal Information Extraction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Coherent Dialog Generation with Query Graph.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation.
CoRR, 2021

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.
CoRR, 2021

TOD-DA: Towards Boosting the Robustness of Task-oriented Dialogue Modeling on Spoken Conversations.
CoRR, 2021

CELLS: Cost-Effective Evolution in Latent Space for Goal-Directed Molecular Generation.
CoRR, 2021

Amendable Generation for Dialogue State Tracking.
CoRR, 2021

Building Chinese Biomedical Language Models via Multi-Level Text Discrimination.
CoRR, 2021

A Multimodal Sentiment Dataset for Video Recommendation.
CoRR, 2021

Do What Nature Did To Us: Evolving Plastic Recurrent Neural Networks For Task Generalization.
CoRR, 2021

A Sentiment Analysis Dataset for Trustworthiness Evaluation.
CoRR, 2021

ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.
CoRR, 2021

ChemRL-GEM: Geometry Enhanced Molecular Representation Learning for Property Prediction.
CoRR, 2021

ERNIE-Tiny : A Progressive Distillation Framework for Pretrained Transformer Compression.
CoRR, 2021

A Unified Pre-training Framework for Conversational AI.
CoRR, 2021

BSTC: A Large-Scale Chinese-English Speech Translation Dataset.
CoRR, 2021

Learning with Noisy Correspondence for Cross-modal Matching.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Data Augmentation with Hierarchical SQL-to-Question Generation for Cross-domain Text-to-SQL Parsing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

RocketQAv2: A Joint Training Method for Dense Passage Retrieval and Passage Re-ranking.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Fine-grained Entity Typing via Label Reasoning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Mixup Decoding for Diverse Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

SgSum: Transforming Multi-document Summarization into Sub-graph Selection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Familia: A Configurable Topic Modeling Framework for Industrial Text Engineering.
Proceedings of the Database Systems for Advanced Applications, 2021

From Learning-to-Match to Learning-to-Discriminate: Global Prototype Learning for Few-shot Relation Classification.
Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021

Docking-based Virtual Screening with Multi-Task Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

Correcting Chinese Spelling Errors with Phonetic Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

BASS: Boosting Abstractive Summarization with Unified Semantic Graph.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ERNIE-Doc: A Retrospective Long-Document Modeling Transformer.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Discovering Dialog Structure Graph for Coherent Dialog Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora.
CoRR, 2020

Discovering Dialog Structure Graph for Open-Domain Dialog Generation.
CoRR, 2020

ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding.
CoRR, 2020

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations Through Scene Graph.
CoRR, 2020

DuReaderrobust: A Chinese Dataset Towards Evaluating the Robustness of Machine Reading Comprehension Models.
CoRR, 2020

Enhancing Dialog Coherence with Event Graph Grounded Content Planning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

TopicOcean: An Ever-Increasing Topic Model With Meta-learning.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Learning Adaptive Segmentation Policy for Simultaneous Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

DuSQL: A Large-Scale and Pragmatic Chinese Text-to-SQL Dataset.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Syntactic and Semantic-driven Learning for Open Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Towards Conversational Recommendation over Multi-Type Dialogs.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Leveraging Graph to Improve Abstractive Multi-Document Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Knowledge Graph Grounded Goal Planning for Open-Domain Conversation Generation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

ERNIE 2.0: A Continual Pre-Training Framework for Language Understanding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CoKE: Contextualized Knowledge Graph Embedding.
CoRR, 2019

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable.
CoRR, 2019

DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting.
CoRR, 2019

Proactive Human-Machine Conversation with Explicit Conversation Goals.
CoRR, 2019

ERNIE: Enhanced Representation through Knowledge Integration.
CoRR, 2019

Knowledge Aware Conversation Generation with Reasoning on Augmented Graph.
CoRR, 2019

Integrating Bayesian and Neural Networks for Discourse Coherence.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Baidu Neural Machine Translation Systems for WMT19.
Proceedings of the Fourth Conference on Machine Translation, 2019

A Key-Phrase Aware End2end Neural Response Generation Model.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

An Overview of the 2019 Language and Intelligence Challenge.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

End-to-End Speech Translation with Knowledge Distillation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning to Select Knowledge for Response Generation in Dialog Systems.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Enhancing Local Feature Extraction with Global Representation for Neural Text Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Multi-agent Learning for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Proactive Human-Machine Conversation with Explicit Conversation Goal.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

ARNOR: Attention Regularization based Noise Reduction for Distant Supervision Relation Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading Comprehension.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

Addressing the Under-Translation Problem from the Entropy Perspective.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Modeling Coherence for Discourse Neural Machine Translation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Familia: A Configurable Topic Modeling Framework for Industrial Text Engineering.
CoRR, 2018

A New Method of Region Embedding for Text Classification.
Proceedings of the 6th International Conference on Learning Representations, 2018

Addressing Troublesome Words in Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Multi-Turn Response Selection for Chatbots with Deep Attention Matching Network.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

Multi-Channel Encoder for Neural Machine Translation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications.
CoRR, 2017

An End-to-End Model for Question Answering over Knowledge Base with Cross-Attention Combining Global Knowledge.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Question Answering over Knowledge Base with Neural Attention Combining Global Knowledge Information.
CoRR, 2016

Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Agreement-Based Joint Training for Bidirectional Attention-Based Neural Machine Translation.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Multi-view Response Selection for Human-Computer Conversation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Chinese Poetry Generation with Planning based Neural Network.
Proceedings of the COLING 2016, 2016

Latent Topic Embedding.
Proceedings of the COLING 2016, 2016

"Shall I Be Your Chat Companion?": Towards an Online Human-Computer Conversation System.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Minimum Risk Training for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Active Learning for Dependency Parsing with Partial Annotation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Semi-Supervised Learning for Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Improved Neural Machine Translation with SMT Features.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Exploiting Collective Hidden Structures in Webpage Titles for Open Domain Entity Extraction.
Proceedings of the 24th International Conference on World Wide Web, 2015

Improved beam search with constrained softmax for NMT.
Proceedings of Machine Translation Summit XV: Papers, 2015

Multi-Task Learning for Multiple Language Translation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Count of Phrase Pairs.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Improve Statistical Machine Translation with Context-Sensitive Bilingual Semantic Embedding Model.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Generalization of Words for Chinese Dependency Parsing.
Proceedings of The 13th International Conference on Parsing Technologies, 2013

Improving Pivot-Based Statistical Machine Translation Using Random Walk.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Opening Machine Translation Black Box for Cross-Language Information Retrieval.
Proceedings of the Information Retrieval Technology, 2012

Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Improve SMT Quality with Automatically Extracted Paraphrase Rules.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Two-Word Collocation Extraction Using Monolingual Word Alignment Method.
ACM Trans. Intell. Syst. Technol., 2011

Reordering with Source Language Collocations.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Improving Statistical Machine Translation with Monolingual Collocation.
Proceedings of the ACL 2010, 2010

2009
Collocation Extraction Using Monolingual Word Alignment Method.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Revisiting Pivot Language Approach for Machine Translation.
Proceedings of the ACL 2009, 2009

Exploiting Heterogeneous Treebanks for Parsing.
Proceedings of the ACL 2009, 2009

2008
The TCH machine translation system for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Predicting and Tagging Dialog-Act Using MDP and SVM.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Domain Adaptation for Statistical Machine Translation with Domain Dictionary and Monolingual Corpora.
Proceedings of the COLING 2008, 2008

2007
Pivot language approach for phrase-based statistical machine translation.
Mach. Transl., 2007

Improving statistical word alignment with various clues.
Proceedings of Machine Translation Summit XI: Papers, 2007

Log-linear generation models for example-based machine translation.
Proceedings of Machine Translation Summit XI: Papers, 2007

Comparative study of word alignment heuristics and phrase-based SMT.
Proceedings of Machine Translation Summit XI: Papers, 2007

Using RBMT Systems to Produce Bilingual Corpus for SMT.
Proceedings of the EMNLP-CoNLL 2007, 2007

2006
Example-based machine translation based on tree-string correspondence and statistical generation.
Mach. Transl., 2006

Boosting Statistical Word Alignment Using Labeled and Unlabeled Data.
Proceedings of the ACL 2006, 2006

Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs.
Proceedings of the ACL 2006, 2006

2005
Boosting Statistical Word Alignment.
Proceedings of Machine Translation Summit X: Papers, 2005

Example-based Machine Translation Based on TSC and Statistical Generation.
Proceedings of Machine Translation Summit X: Papers, 2005

Improving Statistical Word Alignment with Ensemble Methods.
Proceedings of the Natural Language Processing, 2005

Alignment Model Adaptation for Domain-Specific Word Alignment.
Proceedings of the ACL 2005, 2005

2004
Improving Statistical Word Alignment with a Rule-Based Machine Translation System.
Proceedings of the COLING 2004, 2004

: Improving Domain-Specific Word Alignment with a General Bilingual Corpus.
Proceedings of the Machine Translation: From Real Users to Research, 2004

Improving Domain-Specific Word Alignment for Computer Assisted Translation.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004, 2004


  Loading...