Percy Liang

Orcid: 0000-0002-0458-6139

Affiliations:
  • Stanford University, Computer Science Department


According to our database1, Percy Liang authored at least 278 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Anticipatory Music Transformer.
Trans. Mach. Learn. Res., 2024

Robust Distortion-free Watermarks for Language Models.
Trans. Mach. Learn. Res., 2024

Benchmarking Large Language Models for News Summarization.
Trans. Assoc. Comput. Linguistics, 2024

Lost in the Middle: How Language Models Use Long Contexts.
Trans. Assoc. Comput. Linguistics, 2024

Image2Struct: Benchmarking Structure Extraction for Vision-Language Models.
CoRR, 2024

Model Equality Testing: Which Model Is This API Serving?
CoRR, 2024

VideoAgent: Self-Improving Video Generation.
CoRR, 2024

Language model developers should report train-test overlap.
CoRR, 2024

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making.
CoRR, 2024

VHELM: A Holistic Evaluation of Vision Language Models.
CoRR, 2024

Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape Perspective.
CoRR, 2024

Instruction Following without Instruction Tuning.
CoRR, 2024

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models.
CoRR, 2024

AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies.
CoRR, 2024

The Foundation Model Transparency Index v1.1: May 2024.
CoRR, 2024

AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models.
CoRR, 2024

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies.
CoRR, 2024

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources.
CoRR, 2024

OpenVLA: An Open-Source Vision-Language-Action Model.
CoRR, 2024

BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments.
CoRR, 2024

Introducing v0.5 of the AI Safety Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators.
CoRR, 2024

FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning.
CoRR, 2024

BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text.
CoRR, 2024

On the Societal Impact of Open Foundation Models.
CoRR, 2024

A Safe Harbor for AI Evaluation and Red Teaming.
CoRR, 2024

Foundation Model Transparency Reports.
CoRR, 2024

Model Editing with Canonical Examples.
CoRR, 2024


Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Large Language Models as Analogical Reasoners.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Benchmarking and Improving Generator-Validator Consistency of Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

On the Learnability of Watermarks for Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024


2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

Holistic Evaluation of Language Models.
Trans. Mach. Learn. Res., 2023

Evaluating Human-Language Model Interaction.
Trans. Mach. Learn. Res., 2023

Foundation Models and Fair Use.
J. Mach. Learn. Res., 2023

Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation.
CoRR, 2023

The Foundation Model Transparency Index.
CoRR, 2023

Benchmarking Large Language Models As AI Research Agents.
CoRR, 2023

Just One Byte (per gradient): A Note on Low-Bandwidth Decentralized Language Model Finetuning Using Shared Randomness.
CoRR, 2023

Has the Machine Learning Review Process Become More Arbitrary as the Field Has Grown? The NeurIPS 2021 Consistency Experiment.
CoRR, 2023

Lexinvariant Language Models.
CoRR, 2023

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback.
CoRR, 2023

Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs.
CoRR, 2023

Ecosystem Graphs: The Social Footprint of Foundation Models.
CoRR, 2023

High-throughput Generative Inference of Large Language Models with a Single GPU.
CoRR, 2023

Improving Representational Continuity via Continued Pretraining.
CoRR, 2023

Generative Agents: Interactive Simulacra of Human Behavior.
Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, 2023

Language-Driven Representation Learning for Robotics.
Proceedings of the Robotics: Science and Systems XIX, Daegu, 2023

Data Selection for Language Models via Importance Resampling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cheaply Estimating Inference Efficiency Metrics for Autoregressive Transformer Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Holistic Evaluation of Text-to-Image Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lexinvariant Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PRODIGY: Enabling In-context Learning Over Graphs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Retrieval-Augmented Multimodal Language Modeling.
Proceedings of the International Conference on Machine Learning, 2023

CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks.
Proceedings of the International Conference on Machine Learning, 2023

Whose Opinions Do Language Models Reflect?
Proceedings of the International Conference on Machine Learning, 2023

Out-of-Domain Robustness via Targeted Augmentations.
Proceedings of the International Conference on Machine Learning, 2023

Evaluating Self-Supervised Learning via Risk Decomposition.
Proceedings of the International Conference on Machine Learning, 2023

One-sided Matrix Completion from Two Observations Per Row.
Proceedings of the International Conference on Machine Learning, 2023

FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU.
Proceedings of the International Conference on Machine Learning, 2023

Is a Caption Worth a Thousand Images? A Study on Representation Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Surgical Fine-Tuning Improves Adaptation to Distribution Shifts.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

No, to the Right: Online Language Corrections for Robotic Manipulation via Shared Autonomy.
Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction, 2023

Evaluating Verifiability in Generative Search Engines.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Do Question Answering Modeling Improvements Hold Across Benchmarks?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Are Sample-Efficient NLP Models More Robust?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Contrastive Decoding: Open-ended Text Generation as Optimization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Backpack Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Emergent Abilities of Large Language Models.
Trans. Mach. Learn. Res., 2022

Stronger data poisoning attacks break data sanitization defenses.
Mach. Learn., 2022

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP.
CoRR, 2022

Trustworthy Social Bias Measurement.
CoRR, 2022

How do Authors' Perceptions of their Papers Compare with Co-authors' Perceptions and Peer-review Decisions?
CoRR, 2022

Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning.
CoRR, 2022

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering.
CoRR, 2022

Social Simulacra: Creating Populated Prototypes for Social Computing Systems.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

Calibrated ensembles can mitigate accuracy tradeoffs under distribution shift.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Decentralized Training of Foundation Models in Heterogeneous Environments.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Deep Bidirectional Language-Knowledge Graph Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Insights into Pre-training via Simpler Synthetic Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Diffusion-LM Improves Controllable Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Self-Supervised Learning by Characterizing Idealized Representations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

What Can Transformers Learn In-Context? A Case Study of Simple Function Classes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Melody transcription via generative pre-training.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation.
Proceedings of the International Conference on Machine Learning, 2022

An Explanation of In-context Learning as Implicit Bayesian Inference.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Extending the WILDS Benchmark for Unsupervised Adaptation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Large Language Models Can Be Strong Differentially Private Learners.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Fine-Tuning can Distort Pretrained Features and Underperform Out-of-Distribution.
Proceedings of the Tenth International Conference on Learning Representations, 2022

GreaseLM: Graph REASoning Enhanced Language Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Truncation Sampling as Language Model Desmoothing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

LinkBERT: Pretraining Language Models with Document Links.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
No True State-of-the-Art? OOD Detection Methods are Inconsistent across Datasets.
CoRR, 2021

Can Small and Synthetic Benchmarks Drive Modeling Innovation? A Retrospective Study of Question Answering Modeling Approaches.
CoRR, 2021

Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases.
Proceedings of the WWW '21: The Web Conference 2021, 2021

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Swords: A Benchmark for Lexical Substitution with Improved Data Coverage and Quality.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Codified audio language modeling learns useful representations for music information retrieval.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Break-It-Fix-It: Unsupervised Learning for Program Repair.
Proceedings of the 38th International Conference on Machine Learning, 2021

Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices.
Proceedings of the 38th International Conference on Machine Learning, 2021

Just Train Twice: Improving Group Robustness without Training Group Information.
Proceedings of the 38th International Conference on Machine Learning, 2021


Catformer: Designing Stable Transformers via Sensitivity Analysis.
Proceedings of the 38th International Conference on Machine Learning, 2021

In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness.
Proceedings of the 9th International Conference on Learning Representations, 2021

Selective Classification Can Magnify Disparities Across Groups.
Proceedings of the 9th International Conference on Learning Representations, 2021

Removing Spurious Features can Hurt Accuracy and Affect Groups Disproportionately.
Proceedings of the FAccT '21: 2021 ACM Conference on Fairness, 2021

LM-Critic: Language Models for Unsupervised Grammatical Error Correction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Conditional probing: measuring usable information beyond a baseline.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

LILA: Language-Informed Latent Actions.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Prefix-Tuning: Optimizing Continuous Prompts for Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Task-Oriented Dialogue as Dataflow Synthesis.
Trans. Assoc. Comput. Linguistics, 2020

WILDS: A Benchmark of in-the-Wild Distribution Shifts.
CoRR, 2020

Learning Adaptive Language Interfaces through Decomposition.
CoRR, 2020

Explore then Execute: Adapting without Rewards via Factorized Meta-Reinforcement Learning.
CoRR, 2020

Learning Abstract Models for Strategic Exploration and Fast Reward Transfer.
CoRR, 2020

Simplifying Models with Unlabeled Output Data.
CoRR, 2020

A Tight Analysis of Greedy Yields Subexponential Time Approximation for Uniform Decision Tree.
Proceedings of the 2020 ACM-SIAM Symposium on Discrete Algorithms, 2020

Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Graph-based, Self-Supervised Program Repair from Diagnostic Feedback.
Proceedings of the 37th International Conference on Machine Learning, 2020

Robustness to Spurious Correlations via Human Annotations.
Proceedings of the 37th International Conference on Machine Learning, 2020

An Investigation of Why Overparameterization Exacerbates Spurious Correlations.
Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding and Mitigating the Tradeoff between Robustness and Accuracy.
Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding Self-Training for Gradual Domain Adaptation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Concept Bottleneck Models.
Proceedings of the 37th International Conference on Machine Learning, 2020

Feature Noise Induces Loss Discrepancy Across Groups.
Proceedings of the 37th International Conference on Machine Learning, 2020

Distributionally Robust Neural Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

Strategies for Pre-training Graph Neural Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

Selection via Proxy: Efficient Data Selection for Deep Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

RNNs can generate bounded hierarchical languages with optimal memory.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The EOS Decision and Length Extrapolation.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

ExpBERT: Representation Engineering with Natural Language Explanations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Selective Question Answering under Domain Shift.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Robust Encodings: A Framework for Combating Adversarial Typos.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Enabling Language Models to Fill in the Blanks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
FrAngel: component-based synthesis with control structures.
Proc. ACM Program. Lang., 2019

Noise Induces Loss Discrepancy Across Groups for Linear Regression.
CoRR, 2019

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization.
CoRR, 2019

Learning Autocomplete Systems as a Communication Game.
CoRR, 2019

Adversarial Training Can Hurt Generalization.
CoRR, 2019

Maximum Weighted Loss Discrepancy.
CoRR, 2019

Pre-training Graph Neural Networks.
CoRR, 2019

Ambitious Data Science Can Be Painless.
CoRR, 2019

Shaping Visual Representations with Language for Few-shot Classification.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

Verified Uncertainty Calibration.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

SPoC: Search-based Pseudocode to Code.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Accuracy of Influence Functions for Measuring Group Effects.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unlabeled Data Improves Adversarial Robustness.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Pun Generation with Surprise.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unifying Human and Statistical Evaluation for Natural Language Generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Learning a SAT Solver from Single-Bit Supervision.
Proceedings of the 7th International Conference on Learning Representations, 2019

Distributionally Robust Language Modeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Certified Robustness to Adversarial Word Substitutions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Designing and Interpreting Probes with Control Tasks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Defending against Whitebox Adversarial Attacks via Randomized Discretization.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Inferring Multidimensional Rates of Aging from Cross-Sectional Data.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Planning, Inference, and Pragmatics in Sequential Language Games.
Trans. Assoc. Comput. Linguistics, 2018

Generating Sentences by Editing Prototypes.
Trans. Assoc. Comput. Linguistics, 2018

Transforming Question Answering Datasets Into Natural Language Inference Datasets.
CoRR, 2018

Inferring Multi-Dimensional Rates of Aging from Cross-Sectional Data.
CoRR, 2018

The price of debiasing automatic metrics in natural language evaluation.
CoRR, 2018

Prediction with a short memory.
Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, 2018

Active learning of points-to specifications.
Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018

Semidefinite relaxations for certifying robustness to adversarial examples.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Uncertainty Sampling is Preconditioned Stochastic Gradient Descent on Zero-One Loss.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Retrieve-and-Edit Framework for Predicting Structured Outputs.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Delete, Retrieve, Generate: a Simple Approach to Sentiment and Style Transfer.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

On the Relationship between Data Efficiency and Error for Uncertainty Sampling.
Proceedings of the 35th International Conference on Machine Learning, 2018

Fairness Without Demographics in Repeated Loss Minimization.
Proceedings of the 35th International Conference on Machine Learning, 2018

Certified Defenses against Adversarial Examples.
Proceedings of the 6th International Conference on Learning Representations, 2018

Reinforcement Learning on Web Interfaces using Workflow-Guided Exploration.
Proceedings of the 6th International Conference on Learning Representations, 2018

Mapping natural language commands to web elements.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Textual Analogy Parsing: What's Shared and What's Compared among Analogous Facts.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Decoupling Strategy and Generation in Negotiation Dialogues.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

QuAC: Question Answering in Context.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Generalized Binary Search For Split-Neighborly Problems.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

Know What You Don't Know: Unanswerable Questions for SQuAD.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Training Classifiers with Natural Language Explanations.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

The price of debiasing automatic metrics in natural language evalaution.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Synthesizing program input grammars.
Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2017

Certified Defenses for Data Poisoning Attacks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning Overcomplete HMMs.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Unsupervised Transformation Learning via Convex Relaxations.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Convexified Convolutional Neural Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

World of Bits: An Open-Domain Platform for Web-Based Agents.
Proceedings of the 34th International Conference on Machine Learning, 2017

Developing Bug-Free Machine Learning Systems With Formal Mathematics.
Proceedings of the 34th International Conference on Machine Learning, 2017

Understanding Black-box Predictions via Influence Functions.
Proceedings of the 34th International Conference on Machine Learning, 2017

Macro Grammars and Holistic Triggering for Efficient Semantic Parsing.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Adversarial Examples for Evaluating Reading Comprehension Systems.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Importance sampling for unbiased on-demand evaluation of knowledge base population.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Hitting Time Analysis of Stochastic Gradient Langevin Dynamics.
Proceedings of the 30th Conference on Learning Theory, 2017

Naturalizing a Programming Language via Interactive Learning.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Learning Symmetric Collaborative Dialogue Agents with Dynamic Knowledge Graph Embeddings.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Learning executable semantic parsers for natural language understanding.
Commun. ACM, 2016

Unsupervised Risk Estimation Using Only Conditional Independence Structure.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Estimation from Indirect Supervision with Linear Moments.
Proceedings of the 33nd International Conference on Machine Learning, 2016

SQuAD: 100, 000+ Questions for Machine Comprehension of Text.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Learning Language Games through Interaction.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Inferring Logical Forms From Denotations.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Simpler Context-Dependent Logical Forms via Model Projections.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Unanimous Prediction for 100% Precision with Application to Learning Semantic Mappings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Data Recombination for Neural Semantic Parsing.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

How Much is 131 Million Dollars? Putting Numbers in Perspective with Compositional Descriptions.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Imitation Learning of Agenda-based Semantic Parsers.
Trans. Assoc. Comput. Linguistics, 2015

Simultaneous diagonalization: the asymmetric, low-rank, and noisy settings.
CoRR, 2015

On-the-Job Learning with Bayesian Decision Theory.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Estimating Mixture Models via Mixtures of Polynomials.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning with Relaxed Supervision.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Calibrated Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning Fast-Mixing Models for Structured Prediction.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Reified Context Models.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Traversing Knowledge Graphs in Vector Space.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Learning Where to Sample in Structured Prediction.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Tensor Factorization via Matrix Factorization.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Building a Semantic Parser Overnight.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Compositional Semantic Parsing on Semi-Structured Tables.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Environment-Driven Lexicon Induction for High-Level Instructions.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Talking to computers in natural language.
XRDS, 2014

Relaxations for inference in restricted Boltzmann machines.
Proceedings of the 2nd International Conference on Learning Representations, 2014

The Statistics of Streaming Sparse Regression.
CoRR, 2014

Altitude Training: Strong Bounds for Single-Layer Dropout.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Simple MAP Inference via Low-Rank Relaxations.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Adaptivity and Optimism: An Improved Exponentiated Gradient Algorithm.
Proceedings of the 31th International Conference on Machine Learning, 2014

Filtering with Abstract Particles.
Proceedings of the 31th International Conference on Machine Learning, 2014

Estimating Latent-Variable Graphical Models using Moments and Likelihoods.
Proceedings of the 31th International Conference on Machine Learning, 2014

Linking People in Videos with "Their" Names Using Coreference Resolution.
Proceedings of the Computer Vision - ECCV 2014, 2014

Zero-shot Entity Extraction from Web Pages.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Semantic Parsing via Paraphrasing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Lambda Dependency-Based Compositional Semantics.
CoRR, 2013

Learning Dependency-Based Compositional Semantics.
Comput. Linguistics, 2013

Dropout Training as Adaptive Regularization.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Spectral Experts for Estimating Mixtures of Linear Regressions.
Proceedings of the 30th International Conference on Machine Learning, 2013

Video Event Understanding Using Natural Language Descriptions.
Proceedings of the IEEE International Conference on Computer Vision, 2013

A Data Driven Approach for Algebraic Loop Invariants.
Proceedings of the Programming Languages and Systems, 2013

Feature Noising for Log-Linear Structured Prediction.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Semantic Parsing on Freebase from Question-Answer Pairs.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Identifiability and Unmixing of Latent Parse Trees.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011
Learning Dependency-Based Compositional Semantics.
PhD thesis, 2011

Learning minimal abstractions.
Proceedings of the 38th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2011

Scaling abstraction refinement via pruning.
Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, 2011

2010
A dynamic evaluation of the precision of static heap abstractions.
Proceedings of the 25th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2010

Type-Based MCMC.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

On the Interaction between Norm and Dimensionality: Multiple Regimes in Learning.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Learning Programs: A Hierarchical Bayesian Approach.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

A Game-Theoretic Approach to Generating Spatial Descriptions.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

A Simple Domain-Independent Probabilistic Approach to Generation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

2009
Asymptotically Optimal Regularization in Smooth Parametric Models.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Online EM for Unsupervised Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Learning from measurements in exponential families.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Learning Semantic Correspondences with Less Supervision.
Proceedings of the ACL 2009, 2009

2008
An asymptotic analysis of generative, discriminative, and pseudolikelihood estimators.
Proceedings of the Machine Learning, 2008

Structure compilation: trading structure for features.
Proceedings of the Machine Learning, 2008

Analyzing the Errors of Unsupervised Learning.
Proceedings of the ACL 2008, 2008

Learning Bilingual Lexicons from Monolingual Corpora.
Proceedings of the ACL 2008, 2008

2007
Agreement-Based Learning.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

A Probabilistic Approach to Language Change.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

A permutation-augmented sampler for DP mixture models.
Proceedings of the Machine Learning, 2007

The Infinite PCFG Using Hierarchical Dirichlet Processes.
Proceedings of the EMNLP-CoNLL 2007, 2007

A Probabilistic Approach to Diachronic Phonology.
Proceedings of the EMNLP-CoNLL 2007, 2007

2006
Alignment by Agreement.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

An End-to-End Discriminative Approach to Machine Translation.
Proceedings of the ACL 2006, 2006

2005
Efficient Geometric Algorithms for Parsing in Two Dimensions.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

Learning Non-Generative Grammatical Models for Document Analysis.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005


  Loading...