Caiming Xiong

Orcid: 0000-0003-0349-8628

According to our database1, Caiming Xiong authored at least 348 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
<tt>L2CEval</tt>: Evaluating Language-to-Code Generation Capabilities of Large Language Models.
Trans. Assoc. Comput. Linguistics, 2024

JudgeRank: Leveraging Large Language Models for Reasoning-Intensive Reranking.
CoRR, 2024

Asynchronous Tool Usage for Real-Time Agents.
CoRR, 2024

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent.
CoRR, 2024

Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency.
CoRR, 2024

xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs.
CoRR, 2024

Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage.
CoRR, 2024

XForecast: Evaluating Natural Language Explanations for Time Series Forecasting.
CoRR, 2024

Trust but Verify: Programmatic VLM Evaluation in the Wild.
CoRR, 2024

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts.
CoRR, 2024

GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation.
CoRR, 2024

Automatic Curriculum Expert Iteration for Reliable LLM Reasoning.
CoRR, 2024

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs.
CoRR, 2024

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows".
CoRR, 2024

ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement.
CoRR, 2024

Direct Judgement Preference Optimization.
CoRR, 2024

xLAM: A Family of Large Action Models to Empower AI Agent Systems.
CoRR, 2024

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations.
CoRR, 2024

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models.
CoRR, 2024

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents.
CoRR, 2024

Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research.
CoRR, 2024

Personalized Multi-task Training for Recommender System.
CoRR, 2024

ThinK: Thinner Key Cache by Query-Driven Pruning.
CoRR, 2024

Shared Imagination: LLMs Hallucinate Alike.
CoRR, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.
CoRR, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
CoRR, 2024

INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness.
CoRR, 2024

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets.
CoRR, 2024

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens.
CoRR, 2024

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases.
CoRR, 2024

UniTST: Effectively Modeling Inter-Series and Intra-Series Dependencies for Multivariate Time Series Forecasting.
CoRR, 2024

RLHF Workflow: From Reward Modeling to Online RLHF.
CoRR, 2024

Investigating the prompt leakage effect and black-box defenses for multi-turn LLM interactions.
CoRR, 2024

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments.
CoRR, 2024

How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library.
CoRR, 2024

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System.
CoRR, 2024

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning.
CoRR, 2024

Text2Data: Low-Resource Data Generation with Textual Control.
CoRR, 2024

Editing Arbitrary Propositions in LLMs without Subject Labels.
CoRR, 2024

Parameter-Efficient Detoxification with Contrastive Decoding.
CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.
CoRR, 2024

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions.
CoRR, 2024

Beyond the Chat: Executable and Verifiable Text-Editing with LLMs.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

Fair Abstractive Summarization of Diverse Perspectives.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

ARM: Alignment with Residual Energy-Based Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Hierarchical Point Attention for Indoor 3D Object Detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Unified Training of Universal Time Series Forecasting Transformers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Lemur: Harmonizing Natural Language and Code for Language Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer.
Proceedings of the Computer Vision - ECCV 2024, 2024

X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning.
Proceedings of the Computer Vision - ECCV 2024, 2024

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

ULIP-2: Towards Scalable Multimodal Pre-Training for 3D Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Diffusion Model Alignment Using Direct Preference Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HIVE: Harnessing Human Feedback for Instructional Visual Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Causal Layering via Conditional Entropy.
Proceedings of the Causal Learning and Reasoning, 2024

FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Marvista: Exploring the Design of a Human-AI Collaborative News Reading Tool.
ACM Trans. Comput. Hum. Interact., December, 2023

Improving Tail-Class Representation with Centroid Contrastive Learning.
Pattern Recognit. Lett., April, 2023

Merlion: End-to-End Machine Learning for Time Series.
J. Mach. Learn. Res., 2023

Unlocking Anticipatory Text Generation: A Constrained Approach for Faithful Decoding with Large Language Models.
CoRR, 2023

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning.
CoRR, 2023

ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?
CoRR, 2023

Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment.
CoRR, 2023

OpenAgents: An Open Platform for Language Agents in the Wild.
CoRR, 2023

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models.
CoRR, 2023

XGen-7B Technical Report.
CoRR, 2023

Exploring the Integration Strategies of Retriever and Large Language Models.
CoRR, 2023

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents.
CoRR, 2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
CoRR, 2023

REX: Rapid Exploration and eXploitation for AI Agents.
CoRR, 2023

LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond.
CoRR, 2023

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding.
CoRR, 2023

Answering Complex Questions over Text by Hybrid Question Parsing and Execution.
CoRR, 2023

CodeGen2: Lessons for Training LLMs on Programming and Natural Languages.
CoRR, 2023

On the Unlikelihood of D-Separation.
CoRR, 2023

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT.
CoRR, 2023

Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular Data.
CoRR, 2023

Model-Agnostic Hierarchical Attention for 3D Object Detection.
CoRR, 2023

Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

Preference-grounded Token-level Guidance for Language Model Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lower Bounds for Learning in Revealing POMDPs.
Proceedings of the International Conference on Machine Learning, 2023

Improved Online Conformal Prediction via Strongly Adaptive Online Learning.
Proceedings of the International Conference on Machine Learning, 2023

Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Binding Language Models in Symbolic Languages.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Robustness Evaluation of Transformer-Based Form Field Extractors via Form Attacks.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

HPE: Answering Complex Questions over Text by Hybrid Question Parsing and Execution.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SharPT: Shared Latent Space Prompt Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Long Document Summarization with Top-down and Bottom-up Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

What's New? Summarizing Contributions in Scientific Literature.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Best-k Search Algorithm for Neural Text Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Learning to Play General-Sum Games against Multiple Boundedly Rational Agents.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
FeTaQA: Free-form Table Question Answering.
Trans. Assoc. Comput. Linguistics, 2022

A Dynamic Frame Selection Framework for Fast Video Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding.
CoRR, 2022

FOLIO: Natural Language Reasoning with First-Order Logic.
CoRR, 2022

Generating Negative Samples for Sequential Recommendation.
CoRR, 2022

BigIssue: A Realistic Bug Localization Benchmark.
CoRR, 2022

Marvista: A Human-AI Collaborative Reading Tool.
CoRR, 2022

MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation.
CoRR, 2022

Improving Contrastive Learning with Model Augmentation.
CoRR, 2022

A Conversational Paradigm for Program Synthesis.
CoRR, 2022

Converse: A Tree-Based Modular Task-Oriented Dialogue System.
CoRR, 2022

Structure Extraction in Task-Oriented Dialogues with Slot Clustering.
CoRR, 2022

Intent Contrastive Learning for Sequential Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

RGRecSys: A Toolkit for Robustness Evaluation of Recommender Systems.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Local calibration: metrics and recalibration.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

ELECRec: Training Sequential Recommenders as Discriminators.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Policy Optimization for Markov Games: Unified Framework and Faster Convergence.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Ensemble of Averages: Improving Model Selection and Boosting Performance in Domain Generalization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MixQG: Neural Question Generation with Mixed Answer Types.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

NaturalCC: An Open-Source Toolkit for Code Intelligence.
Proceedings of the 44th IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2022

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation.
Proceedings of the International Conference on Machine Learning, 2022

Efficient and Differentiable Conformal Prediction with General Function Classes.
Proceedings of the Tenth International Conference on Learning Representations, 2022

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SPE: Symmetrical Prompt Enhancement for Fact Probing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Discord Questions: A Computational Approach To Diversity Analysis in News Coverage.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CTRLsum: Towards Generic Controllable Text Summarization.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Improving Factual Consistency in Summarization with Compression-Based Post-Editing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Open Vocabulary Object Detection with Pseudo Bounding-Box Labels.
Proceedings of the Computer Vision - ECCV 2022, 2022

Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Modeling Multi-hop Question Answering as Single Sequence Prediction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

QAConv: Question Answering on Informative Conversations.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

[CASPI] Causal-aware Safe Policy Improvement for Task-oriented Dialogue.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

DialFact: A Benchmark for Fact-Checking in Dialogue.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

ConTinTin: Continual Learning from Task Instructions.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Are Pre-trained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection.
Proceedings of the 4th Workshop on NLP for Conversational AI, 2022

2021
SummEval: Re-evaluating Summarization Evaluation.
Trans. Assoc. Comput. Linguistics, 2021

A Coarse-to-Fine Framework for Resource Efficient Video Recognition.
Int. J. Comput. Vis., 2021

Value Retrieval with Arbitrary Queries for Form-like Documents.
CoRR, 2021

Combining Data-driven Supervision with Human-in-the-loop Feedback for Entity Resolution.
CoRR, 2021

Towards Open Vocabulary Object Detection without Human-provided Bounding Boxes.
CoRR, 2021

Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE.
CoRR, 2021

Learning Rich Nearest Neighbor Representations from Self-supervised Ensembles.
CoRR, 2021

MixQG: Neural Question Generation with Mixed Answer Types.
CoRR, 2021

Field Extraction from Forms with Unlabeled Data.
CoRR, 2021

Modeling Dynamic Attributes for Next Basket Recommendation.
CoRR, 2021

Merlion: A Machine Learning Library for Time Series.
CoRR, 2021

Contrastive Self-supervised Sequential Recommendation with Robust Augmentation.
CoRR, 2021

ERMAS: Becoming Robust to Reward Function Sim-to-Real Gaps in Multi-Agent Simulations.
CoRR, 2021

Are Pretrained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection.
CoRR, 2021

FeTaQA: Free-form Table Question Answering.
CoRR, 2021

Causal-aware Safe Policy Improvement for Task-oriented dialogue.
CoRR, 2021

Localized Calibration: Metrics and Recalibration.
CoRR, 2021

Robustness Gym: Unifying the NLP Evaluation Landscape.
CoRR, 2021

Proposal Learning for Semi-Supervised Object Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Task similarity aware meta learning: theory-inspired improvement on MAML.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Pseudo Siamese Network for Few-shot Intent Generation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Evaluating State-of-the-Art Classification Models Against Bayes Optimality.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Understanding the Under-Coverage Bias in Uncertainty Estimation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Synthesize Data for Semantic Parsing.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

SCRIPT: Self-Critic PreTraining of Transformers.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

DART: Open-Domain Structured Data Record to Text Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification.
Proceedings of the 38th International Conference on Machine Learning, 2021

How Important is the Train-Validation Split in Meta-Learning?
Proceedings of the 38th International Conference on Machine Learning, 2021

BERTology Meets Biology: Interpreting Attention in Protein Language Models.
Proceedings of the 9th International Conference on Learning Representations, 2021

CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers.
Proceedings of the 9th International Conference on Learning Representations, 2021

Representation Learning for Sequence Data with Deep Autoencoding Predictive Components.
Proceedings of the 9th International Conference on Learning Representations, 2021

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing.
Proceedings of the 9th International Conference on Learning Representations, 2021

Prototypical Contrastive Learning of Unsupervised Representations.
Proceedings of the 9th International Conference on Learning Representations, 2021

MoPro: Webly Supervised Learning with Momentum Prototypes.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning from Noisy Data with Robust Representation Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CoMatch: Semi-supervised Learning with Contrastive Graph Regularization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Paraphrasing with Pretrained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Dense Hierarchical Retrieval for Open-domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Joint Energy-based Model Training for Better Calibrated Natural Language Understanding Models.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Structured Scene Memory for Vision-Language Navigation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

On the Diversity and Explainability of Recommender Systems: A Practical Framework for Enterprise App Recommendation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

GAEA: Graph Augmentation for Equitable Access via Reinforcement Learning.
Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

BatchMixup: Improving Training by Interpolating Hidden States of the Entire Mini-batch.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

DocNLI: A Large-scale Dataset for Document-level Natural Language Inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Unsupervised Out-of-Domain Detection via Pre-trained Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Controllable Abstractive Dialogue Summarization with Sketch Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
NaturalCC: A Toolkit to Naturalize the Source Code Corpus.
CoRR, 2020

Unsupervised Paraphrase Generation via Dynamic Blocking.
CoRR, 2020

Explaining and Improving Model Behavior with k Nearest Neighbor Representations.
CoRR, 2020

DART: Open-Domain Structured Data Record to Text Generation.
CoRR, 2020

EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading.
CoRR, 2020

Prototypical Contrastive Learning of Unsupervised Representations.
CoRR, 2020

ToD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogues.
CoRR, 2020

Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT.
CoRR, 2020

Towards Noise-resistant Object Detection with Noisy Annotations.
CoRR, 2020

Differentially Private Deep Learning with Smooth Sensitivity.
CoRR, 2020

Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning.
CoRR, 2020

Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width.
CoRR, 2020

Proposal Learning for Semi-Supervised Object Detection.
CoRR, 2020

Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking.
Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics, 2020

Theory-Inspired Path-Regularized Differential Network Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Structured Meta-learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

An Investigation of Phone-Based Subword Units for End-to-End Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering.
Proceedings of the 8th International Conference on Learning Representations, 2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act Tagging.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Composed Variational Natural Language Generation for Few-shot Intents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Probing Task-Oriented Dialogue Representation from Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Improving Limited Labeled Dialogue State Tracking with Self-Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

VD-BERT: A Unified Vision and Dialog Transformer with BERT.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Evaluating the Factual Consistency of Abstractive Text Summarization.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Thieves on Sesame Street are Polyglots - Extracting Multilingual Models from Monolingual APIs.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning From Noisy Anchors for One-Stage Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Building Salesforce Neural Machine Translation System.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

Assessing Local Generalization Capability in Deep Models.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Photon: A Robust Cross-Domain Text-to-SQL System.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ESPRIT: Explaining Solutions to Physical Reasoning Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ERASER: A Benchmark to Evaluate Rationalized NLP Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation for Pretrained Models.
CoRR, 2019

Sketch-Fill-A-R: A Persona-Grounded Chit-Chat Generation Framework.
CoRR, 2019

Global Capacity Measures for Deep ReLU Networks via Path Sampling.
CoRR, 2019

Entropy Penalty: Towards Generalization Beyond the IID Assumption.
CoRR, 2019

CTRL: A Conditional Transformer Language Model for Controllable Generation.
CoRR, 2019

Deleter: Leveraging BERT to Perform Unsupervised Successive Text Compression.
CoRR, 2019

Learning World Graphs to Accelerate Hierarchical Reinforcement Learning.
CoRR, 2019

XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering.
CoRR, 2019

Unifying Question Answering and Text Classification via Span Extraction.
CoRR, 2019

A High-Quality Multilingual Dataset for Structured Documentation Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019

LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Generalization Gap in Reparameterizable Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Taming MAML: Efficient unbiased meta-reinforcement learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting.
Proceedings of the 36th International Conference on Machine Learning, 2019

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering.
Proceedings of the 7th International Conference on Learning Representations, 2019

Global-to-local Memory Pointer Networks for Task-Oriented Dialogue.
Proceedings of the 7th International Conference on Learning Representations, 2019

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation.
Proceedings of the 7th International Conference on Learning Representations, 2019

Competitive experience replay.
Proceedings of the 7th International Conference on Learning Representations, 2019

Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation.
Proceedings of the 7th International Conference on Learning Representations, 2019

StartNet: Online Detection of Action Start in Untrimmed Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Neural Text Summarization: A Critical Evaluation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

WSLLN: Weakly Supervised Natural Language Localization Networks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AdaFrame: Adaptive Frame Selection for Fast Video Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

SParC: Cross-Domain Semantic Parsing in Context.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Explain Yourself! Leveraging Language Models for Commonsense Reasoning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

BERT is Not an Interlingua and the Bias of Tokenization.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

2018
Interactive Agent Modeling by Learning to Probe.
CoRR, 2018

Identifying Generalization Properties in Neural Networks.
CoRR, 2018

Augmented Cyclic Adversarial Learning for Domain Adaptation.
CoRR, 2018

The Natural Language Decathlon: Multitask Learning as Question Answering.
CoRR, 2018

Using Mode Connectivity for Loss Landscape Analysis.
CoRR, 2018

Global-Locally Self-Attentive Dialogue State Tracker.
CoRR, 2018

A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

DCN+: Mixed Objective And Deep Residual Coattention for Question Answering.
Proceedings of the 6th International Conference on Learning Representations, 2018

Interpretable Counting for Visual Question Answering.
Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

A Deep Reinforced Model for Abstractive Summarization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Non-Autoregressive Neural Machine Translation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Improving End-to-End Speech Recognition with Policy Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Hop Knowledge Graph Reasoning with Reward Shaping.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving Abstraction in Text Summarization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

End-to-End Dense Video Captioning With Masked Transformer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Efficient and Robust Question Answering from Minimal Context over Documents.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Global-Locally Self-Attentive Encoder for Dialogue State Tracking.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Abstract Style Transfer for Chinese Traditional Painting.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Active Clustering with Model-Based Uncertainty Reduction.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Block-diagonal Hessian-free Optimization for Training Neural Networks.
CoRR, 2017

Improved Regularization Techniques for End-to-End Speech Recognition.
CoRR, 2017

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning.
CoRR, 2017

Action Understanding with Multiple Classes of Actors.
CoRR, 2017

Learned in Translation: Contextualized Word Vectors.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Dynamic Coattention Networks For Question Answering.
Proceedings of the 5th International Conference on Learning Representations, 2017

Pointer Sentinel Mixture Models.
Proceedings of the 5th International Conference on Learning Representations, 2017

Quasi-Recurrent Neural Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Semi-Supervised Nonlinear Distance Metric Learning via Forests of Max-Margin Cluster Hierarchies.
IEEE Trans. Knowl. Data Eng., 2016

A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs.
CoRR, 2016

Grounded Semantic Role Labeling.
Proceedings of the NAACL HLT 2016, 2016

Robot learning with a spatial, temporal, and causal and-or graph.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Dynamic Memory Networks for Visual and Textual Question Answering.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Recognizing Car Fluents from Video.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Maximum Margin Dirichlet Process Mixtures for Clustering.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Accurate Annotation of Remote Sensing Images via Active Spectral Clustering with Little Expert Knowledge.
Remote. Sens., 2015

Can humans fly? Action understanding with multiple classes of actors.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Joint action recognition and pose estimation from video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

A Unified Framework for Human-Robot Knowledge Transfer.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Compositional Structure Learning for Action Understanding.
CoRR, 2014

Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Spectral active clustering of remote sensing images.
Proceedings of the 2014 IEEE Geoscience and Remote Sensing Symposium, 2014

Seeing is Worse than Believing: Reading People's Minds Better than Computer-Vision Methods Recognize Actions.
Proceedings of the Computer Vision - ECCV 2014, 2014

Actionness Ranking with Lattice Conditional Ordinal Random Fields.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Latent Domains Modeling for Visual Domain Adaptation.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Uncertainty Reduction for Active Image Clustering via a Hybrid Global-Local Uncertainty Model.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

Comprehensive Cross-Hierarchy Cluster Agreement Evaluation.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

2012
Random forests for metric learning with implicit pairwise position dependence.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Dictionary transfer for image denoising via domain adaptation.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Streaming Hierarchical Video Segmentation.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
AirTouch: Interacting with computer systems at a distance.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Towards a parts-based approach to sub-cortical brain structure parsing.
Proceedings of the Medical Imaging 2011: Image Processing, 2011

2009
From image parsing to painterly rendering.
ACM Trans. Graph., 2009

Marker-less registration based on template tracking for augmented reality.
Multim. Tools Appl., 2009


  Loading...