Zihan Liu

Affiliations:
  • Nvidia, Santa Clara, CA, USA
  • Hong Kong University of Science and Technology, Center for Artificial Intelligence Research (CAiRE), Clear Water Bay, Hong Kong (PhD 2022)


According to our database1, Zihan Liu authored at least 54 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
NVLM: Open Frontier-Class Multimodal LLMs.
CoRR, 2024

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities.
CoRR, 2024

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs.
CoRR, 2024

Nemotron-4 340B Technical Report.
CoRR, 2024

ChatQA: Building GPT-4 Level Conversational QA Models.
CoRR, 2024

Retrieval meets Long Context Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition.
CoRR, 2023

Cross-Lingual Cross-Age Adaptation for Low-Resource Elderly Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

RHO: Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
RHO ($ρ$): Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding.
CoRR, 2022

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

Multi-Stage Prompting for Knowledgeable Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.
CoRR, 2021

NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging.
CoRR, 2021

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.
CoRR, 2021

X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Preserving Cross-Linguality of Pre-trained Models via Continual Learning.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multimodal End-to-End Sparse Model for Emotion Recognition.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System.
Proceedings of the 1st Workshop on Document-grounded Dialogue and Conversational Question Answering, 2021

Are Multilingual Models Effective in Code-Switching?
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

On the Importance of Word Order Information in Cross-lingual Sequence Labeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

CrossNER: Evaluating Cross-Domain Named Entity Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
EmoGraph: Capturing Emotion Correlations using Graph Networks.
CoRR, 2020

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems.
CoRR, 2020

Exploring Fine-tuning Techniques for Pre-trained Cross-lingual Models via Continual Learning.
CoRR, 2020

Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection.
CoRR, 2020

Variational Transformers for Diverse Response Generation.
CoRR, 2020

XPersona: Evaluating Multilingual Personalized Chatbot.
CoRR, 2020

Do We Need Word Order Information for Cross-lingual Sequence Labeling.
CoRR, 2020

Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-TaskLearning for Offensive Language Detection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Zero-Resource Cross-Domain Named Entity Recognition.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Learning Fast Adaptation on Cross-Accented Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Modality-Transferable Emotion Embeddings for Low-Resource Multimodal Emotion Recognition.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Lightweight and Efficient End-To-End Speech Recognition Using Low-Rank Transformer.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Meta-Transfer Learning for Code-Switched Speech Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Attention-Informed Mixed-Language Training for Zero-Shot Cross-Lingual Task-Oriented Dialogue Systems.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CAiRE: An End-to-End Empathetic Chatbot.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CAiRE: An End-to-End Empathetic Chatbot.
CoRR, 2019

Incorporating Word and Subword Units in Unsupervised Machine Translation Using Language Model Rescoring.
Proceedings of the Fourth Conference on Machine Translation, 2019

Team yeon-zi at SemEval-2019 Task 4: Hyperpartisan News Detection by De-noising Weakly-labeled Data.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Generalizing Question Answering System with Pre-trained Language Model Fine-tuning.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019


  Loading...