Dayiheng Liu

Orcid: 0000-0002-8755-8941

According to our database1, Dayiheng Liu authored at least 86 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation.
Comput. Linguistics, March, 2024

Language Models can Self-Lengthen to Generate Long Texts.
CoRR, 2024

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution.
CoRR, 2024

Qwen2.5-Coder Technical Report.
CoRR, 2024

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement.
CoRR, 2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey.
CoRR, 2024

Qwen2 Technical Report.
CoRR, 2024

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning.
CoRR, 2024

An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language Pre-train Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Knowledge Enhanced Pre-training for Cross-lingual Dense Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MoNMT: Modularly Leveraging Monolingual and Bilingual Knowledge for Neural Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Rationales for Answers to Simple Math Word Problems Confuse Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Talk Funny! A Large-Scale Humor Response Dataset with Chain-of-Humor Interpretation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models.
CoRR, 2023

Qwen Technical Report.
CoRR, 2023

PolyLM: An Open Source Polyglot Large Language Model.
CoRR, 2023

Interactive Natural Language Processing.
CoRR, 2023

Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors.
CoRR, 2023

EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dynamic Voting for Efficient Reasoning in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Unifying Discrete and Continuous Representations for Unsupervised Paraphrase Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Noisy Pair Corrector for Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Fantastic Expressions and Where to Find Them: Chinese Simile Generation with Multiple Constraints.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Bridging the Domain Gaps in Context Representations for k-Nearest Neighbor Neural Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Tailor: A Soft-Prompt-Based Approach to Attribute-Based Controlled Text Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
CoupGAN: Chinese couplet generation via encoder-decoder model and adversarial training under global control.
Soft Comput., 2022

Prediction, selection, and generation: a knowledge-driven conversation system.
Neural Comput. Appl., 2022

Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task.
CoRR, 2022

Draft, Command, and Edit: Controllable Text Editing in E-Commerce.
CoRR, 2022

Tailor: A Prompt-Based Approach to Attribute-Based Controlled Text Generation.
CoRR, 2022

RMBR: A Regularized Minimum Bayes Risk Reranking Framework for Machine Translation.
CoRR, 2022

Effective Approaches to Neural Query Language Identification.
Comput. Linguistics, 2022

Alibaba-Translate China's Submission for WMT2022 Metrics Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Self-supervised Product Title Rewrite for Product Listing Ads.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Dangling-Aware Entity Alignment with Mixed High-Order Proximities.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Competency-Aware Neural Machine Translation: Can Machine Translation Know its Own Translation Quality?
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Unsupervised Preference-Aware Language Identification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Attention Mechanism with Energy-Friendly Operations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

UniTE: Unified Translation Evaluation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

GCPG: A General Framework for Controllable Paraphrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Frequency-Aware Contrastive Learning for Neural Machine Translation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

KGR4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
An automatic evaluation metric for Ancient-Modern Chinese translation.
Neural Comput. Appl., 2021

KGR^4: Retrieval, Retrospect, Refine and Rethink for Commonsense Generation.
CoRR, 2021

Leveraging Advantages of Interactive and Non-Interactive Models for Vector-Based Cross-Lingual Information Retrieval.
CoRR, 2021

Prediction, Selection, and Generation: Exploration of Knowledge-Driven Conversation System.
CoRR, 2021

RoBLEURT Submission for WMT2021 Metrics Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Mask Attention Networks: Rethinking and Strengthen Transformer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

AnchiBERT: A Pre-Trained Model for Ancient Chinese Language Understanding and Generation.
Proceedings of the International Joint Conference on Neural Networks, 2021

BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining.
Proceedings of the 38th International Conference on Machine Learning, 2021

Evolving transformer architecture for neural machine translation.
Proceedings of the GECCO '21: Genetic and Evolutionary Computation Conference, 2021

POS-Constrained Parallel Decoding for Non-autoregressive Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

GLGE: A New General Language Generation Evaluation Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Towards User-Driven Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Ancient-Modern Chinese Translation with a New Large Training Dataset.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

μ-Forcing: Training Variational Recurrent Autoencoders for Text Generation.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation.
CoRR, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
CoRR, 2020

Generating Chinese Poetry from Images via Concrete and Abstract Information.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Exploration on the Generation of Chinese Palindrome Poetry.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Diverse, Controllable, and Keyphrase-Aware: A Corpus and Method for News Multi-Headline Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Herb-Know: Knowledge Enhanced Prescription Generation for Traditional Chinese Medicine.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

Let's be Humorous: Knowledge Enhanced Humor Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

RikiNet: Reading Wikipedia Pages for Natural Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Deep Poetry: A Chinese Classical Poetry Generation System.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Revision in Continuous Space: Unsupervised Text Style Transfer without Adversarial Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Generating Style-Specific Chinese Tang Poetry With a Simple Actor-Critic Model.
IEEE Trans. Emerg. Top. Comput. Intell., 2019

BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Deep learning-based automatic downbeat tracking: a brief review.
Multim. Syst., 2019

Revision in Continuous Space: Fine-Grained Control of Text Style Transfer.
CoRR, 2019

mu-Forcing: Training Variational Recurrent Autoencoders for Text Generation.
CoRR, 2019

TIGS: An Inference Algorithm for Text Infilling with Gradient Search.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Ancient-Modern Chinese Translation with a Large Training Dataset.
CoRR, 2018

BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation.
CoRR, 2018

Method to Improve the Performance of Restricted Boltzmann Machines.
Proceedings of the Advances in Neural Networks - ISNN 2018, 2018

A Multi-Modal Chinese Poetry Generation Model.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

2016
A neural words encoding model.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016


  Loading...