2024
Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving.
CoRR, 2024
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents.
CoRR, 2024
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts.
CoRR, 2024
AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents.
CoRR, 2024
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs.
CoRR, 2024
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents.
CoRR, 2024
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage.
CoRR, 2024
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization.
CoRR, 2024
AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs.
CoRR, 2024
A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents.
CoRR, 2024
LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset.
CoRR, 2024
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
TableLlama: Towards Open Large Generalist Models for Tables.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024
GPT-4V(ision) is a Generalist Web Agent, if Grounded.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
AgentBench: Evaluating LLMs as Agents.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
AttributionBench: How Hard is Automatic Attribution Evaluation?
Proceedings of the Findings of the Association for Computational Linguistics, 2024
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Mind2Web: Towards a Generalist Agent for the Web.
CoRR, 2023
Exploring Chain-of-Thought Style Prompting for Text-to-SQL.
CoRR, 2023
Can ChatGPT Defend the Truth? Automatic Dialectical Evaluation Elicits LLMs' Deficiencies in Reasoning.
CoRR, 2023
Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy.
Proceedings of the ACM Web Conference 2023, 2023
Models and Practice of Neural Table Representations.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023
Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Mind2Web: Towards a Generalist Agent for the Web.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Automatic Evaluation of Attribution by Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Exploring Chain of Thought Style Prompting for Text-to-SQL.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Error Detection for Text-to-SQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
DP-Forward: Fine-tuning and Inference on Language Models with Differential Privacy in Forward Pass.
Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security, 2023
Biomedical Language Models are Robust to Sub-optimal Tokenization.
Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023
Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Text-to-SQL Error Correction with Language Models of Code.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
2022
TURL: Table Understanding through Representation Learning.
SIGMOD Rec., 2022
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe.
CoRR, 2022
Bootstrapping a User-Centered Task-Oriented Dialogue System.
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Shepherd Pre-trained Language Models to Develop a Train of Thought: An Iterative Prompting Approach.
CoRR, 2022
DOM-LM: Learning Generalizable Representations for HTML Documents.
CoRR, 2022
Iteratively Prompt Pre-trained Language Models for Chain of Thought.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Synthetic Question Value Estimation for Domain Adaptation of Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Modeling Context Pair Interaction for Pairwise Tasks on Graphs.
Proceedings of the WSDM '21, 2021
Structure-Grounded Pretraining for Text-to-SQL.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
TopNet: Learning from Neural Topic Model to Generate Long Stories.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
From Tables to Knowledge: Recent Advances in Table Understanding.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
Learning Structural Edits via Incremental Tree Transformations.
Proceedings of the 9th International Conference on Learning Representations, 2021
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
ReasonBERT: Pre-trained to Reason with Distant Supervision.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021
Differential Privacy for Text Analytics via Natural Text Sanitization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
Discerning Influence Patterns with Beta-Poisson Factorization in Microblogging Environments.
IEEE Trans. Knowl. Data Eng., 2020
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval.
CoRR, 2020
Practical Annotation Strategies for Question Answering Datasets.
CoRR, 2020
Graph embedding on biomedical networks: methods, applications and evaluations.
Bioinform., 2020
EndCold: An End-to-End Framework for Cold Question Routing in Community Question Answering Services.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Adversarial Training for Code Retrieval with Question-Description Relevance Regularization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
An Imitation Game for Learning Semantic Parsers from User Interaction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Learning a Cost-Effective Annotation Policy for Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Clinical Phrase Mining with Language Models.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Rationalizing Medical Relation Prediction from Corpus-level Statistics.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Question-Driven Purchasing Propensity Analysis for Recommendation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Easy-to-Hard: Leveraging Simple Questions for Complex Question Generation.
CoRR, 2019
An End-to-End Framework for Cold Question Routing in Community Question Answering Services.
CoRR, 2019
Automatic Table completion using Knowledge Base.
CoRR, 2019
CoaCor: Code Annotation for Code Retrieval with Reinforcement Learning.
Proceedings of the World Wide Web Conference, 2019
Riker: Mining Rich Keyword Representations for Interpretable Product Question Answering.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
SurfCon: Synonym Discovery on Privacy-Aware Clinical Data.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Dynamic Bayesian Metric Learning for Personalized Product Search.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019
Reinforced Dynamic Reasoning for Conversational Question Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
StaQC: A Systematically Mined Question-Code Dataset from Stack Overflow.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018
Global Relation Embedding for Relation Extraction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
2017
Overlapped Subarray Based Hybrid Beamforming for Millimeter Wave Multiuser Massive MIMO.
IEEE Signal Process. Lett., 2017
Reliable Medical Diagnosis from Crowdsourcing: Discover Trustworthy Answers from Non-Experts.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017
Multi-Panel Based Hybrid Beamforming for Multi-User Massive MIMO.
Proceedings of the 2017 IEEE Global Communications Conference, 2017
An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
2016
Mining Disparate Sources for Question Answering.
PhD thesis, 2016
Table Cell Search for Question Answering.
Proceedings of the 25th International Conference on World Wide Web, 2016
Entity Disambiguation with Linkless Knowledge Bases.
Proceedings of the 25th International Conference on World Wide Web, 2016
Design of a Wideband and Dual-Polarized CPW-Fed Monopole Antenna for Future 5G Communications.
Proceedings of the IEEE 84th Vehicular Technology Conference, 2016
Distributed Representations of Expertise.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016
Augmented LSTM Framework to Construct Medical Self-Diagnosis Android.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016
Coordinated Hybrid Beamforming for Millimeter Wave Multi-User Massive MIMO Systems.
Proceedings of the 2016 IEEE Global Communications Conference, 2016
On Generating Characteristic-rich Question Sets for QA Evaluation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
2015
Fine-Grained Knowledge Sharing in Collaborative Environments.
IEEE Trans. Knowl. Data Eng., 2015
Open Domain Question Answering via Semantic Enrichment.
Proceedings of the 24th International Conference on World Wide Web, 2015
Performance Evaluation of Distributed Scheduling for Downlink Coherent Joint Transmission.
Proceedings of the IEEE 82nd Vehicular Technology Conference, 2015
A low complexity scheme for realistic multi-cell downlink coherent joint transmission.
Proceedings of the 26th IEEE Annual International Symposium on Personal, 2015
Exploiting Relevance Feedback in Knowledge Graph Search.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015
2014
Interpreting the Public Sentiment Variations on Twitter.
IEEE Trans. Knowl. Data Eng., 2014
Schemaless and Structureless Graph Querying.
Proc. VLDB Endow., 2014
SLQ: a user-friendly graph querying system.
Proceedings of the International Conference on Management of Data, 2014
A Probabilistic Approach to Uncovering Attributed Graph Anomalies.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014
Network mining and analysis for social applications.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014
Analyzing expert behaviors in collaborative networks.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014
2013
Synthetic review spamming and defense.
Proceedings of the 22nd International World Wide Web Conference, 2013
Noise-Resistant Bicluster Recognition.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013
2012
Enhanced Multiuser Eigenmode Transmission for Joint Frequency-Spatial Resource Allocation in OFDM-MIMO Downlink Systems.
Proceedings of the 75th IEEE Vehicular Technology Conference, 2012
2011
A priority-aware hybrid multi-hop energy saving strategy for inter-eNB scenario 2.
Proceedings of the IEEE 22nd International Symposium on Personal, 2011