Huan Sun

Orcid: 0000-0001-6436-4813

Affiliations:
  • Ohio State University, OH, USA
  • University of California, Santa Barbara, CA, USA (former)


According to our database1, Huan Sun authored at least 109 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs.
CoRR, 2024

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents.
CoRR, 2024

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery.
CoRR, 2024

EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage.
CoRR, 2024

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark.
CoRR, 2024

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization.
CoRR, 2024

AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs.
CoRR, 2024

A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents.
CoRR, 2024

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset.
CoRR, 2024

TableLlama: Towards Open Large Generalist Models for Tables.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

GPT-4V(ision) is a Generalist Web Agent, if Grounded.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AgentBench: Evaluating LLMs as Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AttributionBench: How Hard is Automatic Attribution Evaluation?
Proceedings of the Findings of the Association for Computational Linguistics, 2024

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Mind2Web: Towards a Generalist Agent for the Web.
CoRR, 2023

Exploring Chain-of-Thought Style Prompting for Text-to-SQL.
CoRR, 2023

Can ChatGPT Defend the Truth? Automatic Dialectical Evaluation Elicits LLMs' Deficiencies in Reasoning.
CoRR, 2023

Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy.
Proceedings of the ACM Web Conference 2023, 2023

Models and Practice of Neural Table Representations.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mind2Web: Towards a Generalist Agent for the Web.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Automatic Evaluation of Attribution by Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploring Chain of Thought Style Prompting for Text-to-SQL.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Error Detection for Text-to-SQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DP-Forward: Fine-tuning and Inference on Language Models with Differential Privacy in Forward Pass.
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security, 2023

Biomedical Language Models are Robust to Sub-optimal Tokenization.
Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023

Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Text-to-SQL Error Correction with Language Models of Code.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
TURL: Table Understanding through Representation Learning.
SIGMOD Rec., 2022

Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe.
CoRR, 2022

Bootstrapping a User-Centered Task-Oriented Dialogue System.
CoRR, 2022

Shepherd Pre-trained Language Models to Develop a Train of Thought: An Iterative Prompting Approach.
CoRR, 2022

DOM-LM: Learning Generalizable Representations for HTML Documents.
CoRR, 2022

Iteratively Prompt Pre-trained Language Models for Chain of Thought.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Synthetic Question Value Estimation for Domain Adaptation of Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Modeling Context Pair Interaction for Pairwise Tasks on Graphs.
Proceedings of the WSDM '21, 2021

Structure-Grounded Pretraining for Text-to-SQL.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

TopNet: Learning from Neural Topic Model to Generate Long Stories.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

From Tables to Knowledge: Recent Advances in Table Understanding.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Learning Structural Edits via Incremental Tree Transformations.
Proceedings of the 9th International Conference on Learning Representations, 2021

COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ReasonBERT: Pre-trained to Reason with Distant Supervision.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

Differential Privacy for Text Analytics via Natural Text Sanitization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Discerning Influence Patterns with Beta-Poisson Factorization in Microblogging Environments.
IEEE Trans. Knowl. Data Eng., 2020

COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval.
CoRR, 2020

Practical Annotation Strategies for Question Answering Datasets.
CoRR, 2020

Graph embedding on biomedical networks: methods, applications and evaluations.
Bioinform., 2020

EndCold: An End-to-End Framework for Cold Question Routing in Community Question Answering Services.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Adversarial Training for Code Retrieval with Question-Description Relevance Regularization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

An Imitation Game for Learning Semantic Parsers from User Interaction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning a Cost-Effective Annotation Policy for Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Clinical Phrase Mining with Language Models.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Rationalizing Medical Relation Prediction from Corpus-level Statistics.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Question-Driven Purchasing Propensity Analysis for Recommendation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Easy-to-Hard: Leveraging Simple Questions for Complex Question Generation.
CoRR, 2019

An End-to-End Framework for Cold Question Routing in Community Question Answering Services.
CoRR, 2019

Automatic Table completion using Knowledge Base.
CoRR, 2019

CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning.
Proceedings of the World Wide Web Conference, 2019

Riker: Mining Rich Keyword Representations for Interpretable Product Question Answering.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

SurfCon: Synonym Discovery on Privacy-Aware Clinical Data.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Dynamic Bayesian Metric Learning for Personalized Product Search.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Reinforced Dynamic Reasoning for Conversational Question Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Global Relation Embedding for Relation Extraction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

2017
Overlapped Subarray Based Hybrid Beamforming for Millimeter Wave Multiuser Massive MIMO.
IEEE Signal Process. Lett., 2017

Reliable Medical Diagnosis from Crowdsourcing: Discover Trustworthy Answers from Non-Experts.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Multi-Panel Based Hybrid Beamforming for Multi-User Massive MIMO.
Proceedings of the 2017 IEEE Global Communications Conference, 2017

An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Mining Disparate Sources for Question Answering.
PhD thesis, 2016

Table Cell Search for Question Answering.
Proceedings of the 25th International Conference on World Wide Web, 2016

Entity Disambiguation with Linkless Knowledge Bases.
Proceedings of the 25th International Conference on World Wide Web, 2016

Design of a Wideband and Dual-Polarized CPW-Fed Monopole Antenna for Future 5G Communications.
Proceedings of the IEEE 84th Vehicular Technology Conference, 2016

Distributed Representations of Expertise.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

Augmented LSTM Framework to Construct Medical Self-Diagnosis Android.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Coordinated Hybrid Beamforming for Millimeter Wave Multi-User Massive MIMO Systems.
Proceedings of the 2016 IEEE Global Communications Conference, 2016

On Generating Characteristic-rich Question Sets for QA Evaluation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
Fine-Grained Knowledge Sharing in Collaborative Environments.
IEEE Trans. Knowl. Data Eng., 2015

Open Domain Question Answering via Semantic Enrichment.
Proceedings of the 24th International Conference on World Wide Web, 2015

Performance Evaluation of Distributed Scheduling for Downlink Coherent Joint Transmission.
Proceedings of the IEEE 82nd Vehicular Technology Conference, 2015

A low complexity scheme for realistic multi-cell downlink coherent joint transmission.
Proceedings of the 26th IEEE Annual International Symposium on Personal, 2015

Exploiting Relevance Feedback in Knowledge Graph Search.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

2014
Interpreting the Public Sentiment Variations on Twitter.
IEEE Trans. Knowl. Data Eng., 2014

Schemaless and Structureless Graph Querying.
Proc. VLDB Endow., 2014

SLQ: a user-friendly graph querying system.
Proceedings of the International Conference on Management of Data, 2014

A Probabilistic Approach to Uncovering Attributed Graph Anomalies.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Network mining and analysis for social applications.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Analyzing expert behaviors in collaborative networks.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

2013
Synthetic review spamming and defense.
Proceedings of the 22nd International World Wide Web Conference, 2013

Noise-Resistant Bicluster Recognition.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

2012
Enhanced Multiuser Eigenmode Transmission for Joint Frequency-Spatial Resource Allocation in OFDM-MIMO Downlink Systems.
Proceedings of the 75th IEEE Vehicular Technology Conference, 2012

2011
A priority-aware hybrid multi-hop energy saving strategy for inter-eNB scenario 2.
Proceedings of the IEEE 22nd International Symposium on Personal, 2011


  Loading...