Hao Cheng

Affiliations:
  • Microsoft Research
  • University of Washington, Seattle, WA, USA (PhD)
  • University of Alberta, Canada (former)


According to our database1, Hao Cheng authored at least 66 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ReasonAgain: Using Extractable Symbolic Programs to Evaluate Mathematical Reasoning.
CoRR, 2024

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities.
CoRR, 2024

CorrectionLM: Self-Corrections with SLM for Dialogue State Tracking.
CoRR, 2024

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning.
CoRR, 2024

GRIN: GRadient-INformed MoE.
CoRR, 2024

Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering.
CoRR, 2024

Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts.
CoRR, 2024

Encode Once and Decode in Parallel: Efficient Transformer Decoding.
CoRR, 2024

ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fast-ELECTRA for Efficient Pre-training.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents.
Proceedings of the Computer Vision - ECCV 2024, 2024

Language Models as Inductive Reasoners.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

DocLens: Multi-aspect Fine-grained Medical Text Evaluation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Fine-tuning large neural language models for biomedical natural language processing.
Patterns, April, 2023

InSCIt: Information-Seeking Conversations with Mixed-Initiative Interactions.
Trans. Assoc. Comput. Linguistics, 2023

Enhancing Medical Text Evaluation with GPT-4.
CoRR, 2023

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks.
CoRR, 2023

MathVista: Evaluating Math Reasoning in Visual Contexts with GPT-4V, Bard, and Other Large Multimodal Models.
CoRR, 2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations.
CoRR, 2023

Self-Verification Improves Few-Shot Clinical Information Extraction.
CoRR, 2023

Chain-of-Skills: A Configurable Model for Open-domain Question Answering.
CoRR, 2023

Pre-training Transformers for Knowledge Graph Completion.
CoRR, 2023

Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback.
CoRR, 2023

Augmenting Language Models with Long-Term Memory.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Understand and Modularize Generator Optimization in ELECTRA-style Pretraining.
Proceedings of the International Conference on Machine Learning, 2023

Visually-Augmented Language Modeling.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Chain-of-Skills: A Configurable Model for Open-Domain Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing.
ACM Trans. Comput. Heal., 2022

A Survey of Knowledge-Intensive NLP with Pre-Trained Language Models.
CoRR, 2022

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Open-domain Question Answering via Chain of Reasoning over Heterogeneous Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Unsupervised Learning of Hierarchical Conversation Structure.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Knowledge-Rich Self-Supervision for Biomedical Entity Linking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Open Domain Question Answering with A Unified Knowledge Interface.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Knowledge-Rich Self-Supervised Entity Linking.
CoRR, 2021

CLUES: Few-Shot Learning Evaluation in Natural Language Understanding.
CoRR, 2021

Open Domain Question Answering over Virtual Documents: A Unified Approach for Data and Text.
CoRR, 2021

Few-Shot Learning Evaluation in Natural Language Understanding.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Targeted Adversarial Training for Natural Language Understanding.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Posterior Differential Regularization with f-divergence for Improving Model Robustness.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Dialogue State Tracking with a Language Model using Schema-Driven Prompting.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

UnitedQA: A Hybrid Approach for Open Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Adversarial Training for Large Neural Language Models.
CoRR, 2020


The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Probabilistic Assumptions Matter: Improved Models for Distantly-Supervised Document-Level Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
A Dynamic Speaker Model for Conversational Interactions.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Improving Span-based Question Answering Systems with Coarsely Labeled Data.
CoRR, 2018

Sounding Board: A User-Centric and Content-Driven Social Chatbot.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

2017
A Factored Neural Network Model for Characterizing Online Discussions in Vector Space.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Learning Latent Local Conversation Modes for Predicting Community Endorsement in Online Discussions.
CoRR, 2016

Bi-directional Attention with Agreement for Dependency Parsing.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Scalable and Sound Low-Rank Tensor Learning.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Learning Latent Local Conversation Modes for Predicting Comment Endorsement in Online Discussions.
Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

2015
Open-Domain Name Error Detection using a Multi-Task RNN.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Language Models for Image Captioning: The Quirks and What Works.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2013
Convex Relaxations of Bregman Divergence Clustering.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Convex Two-Layer Modeling.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Characterizing the Representer Theorem.
Proceedings of the 30th International Conference on Machine Learning, 2013


  Loading...