David A. Sontag

Affiliations:
  • MIT, Cambridge, MA, USA


According to our database1, David A. Sontag authored at least 145 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Joint AI-driven event prediction and longitudinal modeling in newly diagnosed and relapsed multiple myeloma.
npj Digit. Medicine, 2024

Machine learning to predict notes for chart review in the oncology setting: a proof of concept strategy for improving clinician note-writing.
J. Am. Medical Informatics Assoc., 2024

Need Help? Designing Proactive AI Assistants for Programming.
CoRR, 2024

Seq-to-Final: A Benchmark for Tuning from Sequential Distributions to a Final Time Point.
CoRR, 2024

Theoretical Analysis of Weak-to-Strong Generalization.
CoRR, 2024

Evaluating Physician-AI Interaction for Cancer Management: Paving the Path towards Precision Oncology.
CoRR, 2024

The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers.
CoRR, 2024

Non-Invasive Medical Digital Twins using Physics-Informed Self-Supervised Learning.
CoRR, 2024

A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models.
CoRR, 2024

Impact of Large Language Model Assistance on Patients Reading Clinical Notes: A Mixed-Methods Study.
CoRR, 2024

Prediction-powered Generalization of Causal Inferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Benchmarking Observational Studies with Experimental Data under Right-Censoring.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Learning to Decode Collaboratively with Multiple Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Towards Verifiable Text Generation with Symbolic References.
CoRR, 2023

Closing the Gap in High-Risk Pregnancy Care Using Machine Learning and Human-AI Collaboration.
CoRR, 2023

Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks.
CoRR, 2023

Effective Human-AI Teams via Learned Natural Language Rules and Onboarding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes.
Proceedings of the Machine Learning for Healthcare Conference, 2023

Large-Scale Study of Temporal Shift in Health Insurance Claims.
Proceedings of the Conference on Health, Inference, and Learning, 2023

Who Should Predict? Exact Algorithms For Learning to Defer to Humans.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Falsification of Internal and External Validity in Observational Studies via Conditional Moment Restrictions.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

TabLLM: Few-shot Classification of Tabular Data with Large Language Models.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Conformalized Unconditional Quantile Regression.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects.
J. Mach. Learn. Res., 2022

Large Language Models are Zero-Shot Clinical Information Extractors.
CoRR, 2022

Evaluating Robustness to Dataset Shift via Parametric Robustness Sets.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Training Subset Selection for Weak Supervision.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Falsification before Extrapolation in Causal Effect Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ETAB: A Benchmark Suite for Visual Representation Learning in Echocardiography.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Co-training Improves Prompt-based Learning for Large Language Models.
Proceedings of the International Conference on Machine Learning, 2022

Sample Efficient Learning of Predictors that Complement Humans.
Proceedings of the International Conference on Machine Learning, 2022

Large language models are few-shot clinical information extractors.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Using time-series privileged information for provably efficient learning of prediction models.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Leveraging Time Irreversibility with Order-Contrastive Pre-training.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Teaching Humans When to Defer to a Classifier via Exemplars.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Clustering Interval-Censored Time-Series for Disease Phenotyping.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Pulse of the pandemic: Iterative topic filtering for clinical information extraction from social media.
J. Biomed. Informatics, 2021

Teaching Humans When To Defer to a Classifier via Examplars.
CoRR, 2021

Clustering Left-Censored Multivariate Time-Series.
CoRR, 2021

MedKnowts: Unified Documentation and Information Retrieval for Electronic Health Records.
Proceedings of the UIST '21: The 34th Annual ACM Symposium on User Interface Software and Technology, 2021

Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Directing Human Attention in Event Localization for Clinical Timeline Creation.
Proceedings of the Machine Learning for Healthcare Conference, 2021

Regularizing towards Causal Invariance: Linear Models with Proxies.
Proceedings of the 38th International Conference on Machine Learning, 2021

Graph Cuts Always Find a Global Optimum for Potts Models (With a Catch).
Proceedings of the 38th International Conference on Machine Learning, 2021

Neural Pharmacodynamic State Space Modeling.
Proceedings of the 38th International Conference on Machine Learning, 2021

Assessing the Impact of Automated Suggestions on Decision Making: Domain Experts Mediate Model Errors but Take Less Initiative.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

PClean: Bayesian Data Cleaning at Scale with Domain-Specific Probabilistic Programming.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Beyond Perturbation Stability: LP Recovery Guarantees for MAP Inference on Noisy Stable Instances.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Deep Contextual Clinical Prediction with Reverse Distillation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Derivation and validation of a machine learning record linkage algorithm between emergency medical services and the emergency department.
J. Am. Medical Informatics Assoc., 2020

Graph cuts always find a global optimum (with a catch).
CoRR, 2020

Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies.
CoRR, 2020

Deep Contextual Clinical Prediction with Reverse Distillation.
CoRR, 2020

Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health KnowledgeGraph.
Proceedings of the Pacific Symposium on Biocomputing 2020, 2020

Knowledge Base Completion for Constructing Problem-Oriented Medical Records.
Proceedings of the Machine Learning for Healthcare Conference, 2020

Fast, Structured Clinical Documentation via Contextual Autocomplete.
Proceedings of the Machine Learning for Healthcare Conference, 2020

Robust Benchmarking for Machine Learning of Clinical Entity Extraction.
Proceedings of the Machine Learning for Healthcare Conference, 2020

Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Consistent Estimators for Learning to Defer to an Expert.
Proceedings of the 37th International Conference on Machine Learning, 2020

Estimation of Bounds on Potential Outcomes For Decision Making.
Proceedings of the 37th International Conference on Machine Learning, 2020

Empirical Study of the Benefits of Overparameterization in Learning Latent Variable Models.
Proceedings of the 37th International Conference on Machine Learning, 2020

Characterization of Overlap in Observational Studies.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Train and Test Tightness of LP Relaxations in Structured Prediction.
J. Mach. Learn. Res., 2019

Improving documentation of presenting problems in the emergency department using a domain-specific ontology and machine learning-driven user interfaces.
Int. J. Medical Informatics, 2019

Estimation of Utility-Maximizing Bounds on Potential Outcomes.
CoRR, 2019

Open Set Medical Diagnosis.
CoRR, 2019

Robustly Extracting Medical Knowledge from EHRs: A Case Study of Learning a Health Knowledge Graph.
CoRR, 2019

Benefits of Overparameterization in Single-Layer Latent Variable Generative Models.
CoRR, 2019

Few-Shot Learning for Dermatological Disease Diagnosis.
Proceedings of the Machine Learning for Healthcare Conference, 2019

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models.
Proceedings of the 36th International Conference on Machine Learning, 2019

Overcomplete Independent Component Analysis via SDP.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Block Stability for MAP Inference.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Support and Invertibility in Domain-Invariant Representations.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Prototypical Clustering Networks for Dermatological Disease Diagnosis.
CoRR, 2018

Evaluating Reinforcement Learning Algorithms in Observational Health Settings.
CoRR, 2018

Learning topic models - provably and efficiently.
Commun. ACM, 2018

Max-margin learning with the Bayes factor.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Cell-specific prediction and application of drug-induced gene expression .
Proceedings of the Biocomputing 2018: Proceedings of the Pacific Symposium, 2018

Why Is My Classifier Discriminatory?
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Semi-Amortized Variational Autoencoders.
Proceedings of the 35th International Conference on Machine Learning, 2018

Optimality of Approximate Inference Algorithms on Stable Instances.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Alpha-expansion is Exact on Stable Instances.
CoRR, 2017

Grounded Recurrent Neural Networks.
CoRR, 2017

Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning.
CoRR, 2017

Causal Effect Inference with Deep Latent-Variable Models.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Estimating individual treatment effect: generalization bounds and algorithms.
Proceedings of the 34th International Conference on Machine Learning, 2017

Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation.
Proceedings of the 34th International Conference on Machine Learning, 2017

Electronic phenotyping with APHRODITE and the Observational Health Sciences and Informatics (OHDSI) data network.
Proceedings of the Summit on Clinical Research Informatics, 2017

Objective assessment of depressive symptoms with machine learning and wearable sensors data.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

Structured Inference Networks for Nonlinear State Space Models.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Electronic medical record phenotyping using the anchor and learn framework.
J. Am. Medical Informatics Assoc., 2016

Bounding and Minimizing Counterfactual Error.
CoRR, 2016

Multi-task Prediction of Disease Onsets from Longitudinal Lab Tests.
CoRR, 2016

Simultaneous Learning of Trees and Representations for Extreme Classification, with Application to Language Modeling.
CoRR, 2016

Recurrent Neural Networks for Multivariate Time Series with Missing Values.
CoRR, 2016

Multi-task Prediction of Disease Onsets from Longitudinal Laboratory Tests.
Proceedings of the 1st Machine Learning in Health Care, 2016

Identifiable Phenotyping using Constrained Non-Negative Matrix Factorization.
Proceedings of the 1st Machine Learning in Health Care, 2016

Clinical Tagging with Joint Probabilistic Models.
Proceedings of the 1st Machine Learning in Health Care, 2016

Train and Test Tightness of LP Relaxations in Structured Prediction.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Learning Representations for Counterfactual Inference.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Learning Low-Dimensional Representations of Medical Concepts.
Proceedings of the Summit on Clinical Research Informatics, 2016

Comparison of Approaches for Heart Failure Case Identification from EHR Data.
Proceedings of the AMIA 2016, 2016

Data Mining for Medical Informatics (DMMI) - Learning Health.
Proceedings of the AMIA 2016, 2016

Tightness of LP Relaxations for Almost Balanced Models.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Character-Aware Neural Language Models.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Temporal Convolutional Neural Networks for Diagnosis from Lab Tests.
CoRR, 2015

On the Tightness of LP Relaxations for Structured Prediction.
CoRR, 2015

Deep Kalman Filters.
CoRR, 2015

Anchored Discrete Factor Analysis.
CoRR, 2015

Incorporating Type II Error Probabilities from Independence Tests into Score-Based Learning of Bayesian Network Structure.
CoRR, 2015

Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.
Big Data, 2015

Barrier Frank-Wolfe for Marginal Inference.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

A Fast Variational Approach for Learning Markov Random Field Language Models.
Proceedings of the 32nd International Conference on Machine Learning, 2015

How Hard is Inference for Structured Prediction?
Proceedings of the 32nd International Conference on Machine Learning, 2015

Gaussian Processes for interpreting Multiple Prostate Specific Antigen measurements for Prostate Cancer Prediction.
Proceedings of the AMIA 2015, 2015

Visual Exploration of Temporal Data in Electronic Medical Records.
Proceedings of the AMIA 2015, 2015

2014
Tight Error Bounds for Structured Prediction.
CoRR, 2014

Understanding the Bethe Approximation: When and How can it go Wrong?
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Lifted Tree-Reweighted Variational Inference.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Unsupervised learning of disease progression models.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Instance Segmentation of Indoor Scenes Using a Coverage Loss.
Proceedings of the Computer Vision - ECCV 2014, 2014

Using Anchors to Estimate Clinical State without Labeled Data.
Proceedings of the AMIA 2014, 2014

2013
Unsupervised Learning of Noisy-Or Bayesian Networks.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

SparsityBoost: A New Scoring Function for Learning Bayesian Network Structure.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Discovering Hidden Variables in Noisy-Or Networks using Quartet Tests.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

A Practical Algorithm for Topic Modeling with Provable Guarantees.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Probabilistic models for personalizing web search.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Efficiently Searching for Frustrated Cycles in MAP Inference.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

2011
Complexity of Inference in Latent Dirichlet Allocation.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Personalizing web search results by reading level.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Approximate inference in graphical models using linear programming relaxations.
PhD thesis, 2010

Learning Bayesian Network Structure using LP Relaxations.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

More data means less inference: A pseudo-max approach to structured learning.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning Efficiently with Approximate Inference via Dual Losses.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

On Dual Decomposition and Linear Programming Relaxations for Natural Language Processing.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Dual Decomposition for Parsing with Non-Projective Head Automata.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

2009
Tree Block Coordinate Descent for MAP in Graphical Models.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Scaling all-pairs overlay routing.
Proceedings of the 2009 ACM Conference on Emerging Networking Experiments and Technology, 2009

2008
Tightening LP Relaxations for MAP using Message Passing.
Proceedings of the UAI 2008, 2008

Clusters and Coarse Partitions in LP Relaxations.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

2007
Probabilistic Modeling of Systematic Errors in Two-Hybrid Experiments.
Proceedings of the Biocomputing 2007, 2007

New Outer Bounds on the Marginal Polytope.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

2005
BLOG: Probabilistic Models with Unknown Objects.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Approximate Inference for Infinite Contingent Bayesian Networks.
Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, 2005


  Loading...