Zachary C. Lipton

Orcid: 0000-0002-3824-4241

Affiliations:
  • Carnegie Mellon University, Machine Learning Department, Pittsburgh, PA, USA
  • University of California, San Diego, CA, USA (PhD 2017)


According to our database1, Zachary C. Lipton authored at least 180 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Resolving the Human-Subjects Status of ML's Crowdworkers.
Commun. ACM, May, 2024

Resolving the Human-subjects Status of Machine Learning's Crowdworkers: What ethical framework should govern the interaction of ML researchers and crowdworkers?
ACM Queue, 2024

Failure Modes of LLMs for Causal Reasoning on Narratives.
CoRR, 2024

Towards characterizing the value of edge embeddings in Graph Neural Networks.
CoRR, 2024

The Fragility of Fairness: Causal Sensitivity Analysis for Fair Machine Learning.
CoRR, 2024

A Unified Causal Framework for Auditing Recommender Systems for Ethical Concerns.
CoRR, 2024

LLM-Select: Feature Selection with Large Language Models.
CoRR, 2024

Understanding Hallucinations in Diffusion Models through Mode Interpolation.
CoRR, 2024

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions.
CoRR, 2024

Rethinking LLM Memorization through the Lens of Adversarial Compression.
CoRR, 2024

Post-Hoc Reversal: Are We Selecting Models Prematurely?
CoRR, 2024

GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence.
CoRR, 2024

Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing.
CoRR, 2024

Contrastive Multiple Instance Learning for Weakly Supervised Person ReID.
CoRR, 2024

Personalized Language Modeling from Personalized Human Feedback.
CoRR, 2024

Red-Teaming for Generative AI: Silver Bullet or Security Theater?
CoRR, 2024

TOFU: A Task of Fictitious Unlearning for LLMs.
CoRR, 2024

Foundations of Testing for Finite-Sample Causal Discovery.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

T-MARS: Improving Visual Representations by Circumventing Text Feature Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Time-Varying Propensity Score to Bridge the Gap between the Past and Present.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

The Impact of Differential Feature Under-reporting on Algorithmic Fairness.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evaluating the Factuality of Zero-shot Summarizers Across Varied Domains.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Goodhart's Law Applies to NLP's Explanation Benchmarks.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Scaling Laws for Data Filtering - Data Curation Cannot be Compute Agnostic.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

The Future of Web Data Mining: Insights from Multimodal and Code-based Extraction Methods.
Proceedings of the 7th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2024

Partially Interpretable Models with Guarantees on Coverage and Accuracy.
Proceedings of the International Conference on Algorithmic Learning Theory, 2024

Timing as an Action: Learning When to Observe and Act.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Auditing Fairness under Unobserved Confounding.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation.
Trans. Mach. Learn. Res., 2023

Perspectives on the State and Future of Deep Learning - 2023.
CoRR, 2023

MoCo-Transfer: Investigating out-of-distribution contrastive learning for limited-data domains.
CoRR, 2023

Reading Between the Mud: A Challenging Motorcycle Racer Number Dataset.
CoRR, 2023

MUDD: A New Re-Identification Dataset with Efficient Annotation for Off-Road Racers in Extreme Conditions.
CoRR, 2023

PromptNER: Prompting For Named Entity Recognition.
CoRR, 2023

Users are the North Star for AI Transparency.
CoRR, 2023

Discovering Optimal Scoring Mechanisms in Causal Strategic Prediction.
CoRR, 2023

Meta-Learning Mini-Batch Risk Functionals.
CoRR, 2023

Risk-limiting financial audits via weighted sampling without replacement.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Deep Equilibrium Based Neural Operators for Steady-State PDEs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Online Label Shift: Optimal Dynamic Regret meets Practical Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reliable deep learning in dynamic environments.
Proceedings of the Medical Imaging 2023: Computer-Aided Diagnosis, San Diego, 2023

Valla: Standardizing and Benchmarking Authorship Attribution and Verification Through Empirical Evaluation and Comparative Analysis.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets.
Proceedings of the International Conference on Machine Learning, 2023

Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective.
Proceedings of the International Conference on Machine Learning, 2023

Can Neural Network Memorization Be Localized?
Proceedings of the International Conference on Machine Learning, 2023

RLSbench: Domain Adaptation Under Relaxed Label Shift.
Proceedings of the International Conference on Machine Learning, 2023

Disentangling the Mechanisms Behind Implicit Regularization in SGD.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DeepDrake ft. BTS-GAN and TayloRVC: An Exploratory Analysis of Musical Deepfakes and Hosting Platforms.
Proceedings of the 2nd Workshop on Human-Centric Music Information Retrieval 2023 co-located with the 24th International Society for Music Information Retrieval Conference (ISMIR 2023), 2023

Model-tuning Via Prompts Makes NLP Models Adversarially Robust.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

USB: A Unified Summarization Benchmark Across Tasks and Domains.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Local Causal Discovery for Estimating Causal Effects.
Proceedings of the Conference on Causal Learning and Reasoning, 2023

Evaluating Model Performance in Medical Datasets Over Time.
Proceedings of the Conference on Health, Inference, and Learning, 2023

A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Domain Adaptation under Missingness Shift.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

From Preference Elicitation to Participatory ML: A Critical Survey & Guidelines for Future Research.
Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023

Downstream Datasets Make Surprisingly Good Pretraining Corpora.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Moral Machine or Tyranny of the Majority?
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Evaluating Explanations: How Much Do Explanations from the Teacher Aid Students?
Trans. Assoc. Comput. Linguistics, 2022

Model Evaluation in Medical Datasets Over Time.
CoRR, 2022

Data drift correction via time-varying importance weight estimator.
CoRR, 2022

On the State of the Art in Authorship Attribution and Authorship Verification.
CoRR, 2022

Resolving the Human Subjects Status of Machine Learning's Crowdworkers.
CoRR, 2022

Can Transformers be Strong Treatment Effect Estimators?
CoRR, 2022

Unsupervised Learning under Latent Label Shift.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Characterizing Datapoints via Second-Split Forgetting.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Domain Adaptation under Open Set Label Shift.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Supervised Learning with General Risk Functionals.
Proceedings of the International Conference on Machine Learning, 2022

Leveraging unlabeled data to predict out-of-distribution performance.
Proceedings of the Tenth International Conference on Learning Representations, 2022

On the Maximum Hessian Eigenvalue and Generalization.
Proceedings of the Proceedings on "I Can't Believe It's Not Better!, 2022

Homophily and Incentive Effects in Use of Algorithms.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Practical Benefits of Feature Feedback Under Distribution Shift.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

Learning Clinical Concepts for Predicting Risk of Progression to Severe COVID-19.
Proceedings of the AMIA 2022, 2022

Off-Policy Risk Assessment for Markov Decision Processes.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Modeling Attrition in Recommender Systems with Departing Bandits.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
When Curation Becomes Creation: Algorithms, microcontent, and the vanishing distinction between platforms and creators.
ACM Queue, 2021

The Impact of Algorithmic Risk Assessments on Human Predictions and its Analysis via Crowdsourcing Studies.
Proc. ACM Hum. Comput. Interact., 2021

Dive into Deep Learning.
CoRR, 2021

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk.
CoRR, 2021

When curation becomes creation.
Commun. ACM, 2021

Estimating treatment effects with observed confounders and mediators.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Parametric Complexity Bounds for Approximating PDEs with Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Rebounding Bandits for Modeling Satiation Effects.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Off-Policy Risk Assessment in Contextual Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Efficient Online Estimation of Causal Effects by Deciding What to Observe.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mixture Proportion Estimation and PU Learning: A Modern Approach.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Correcting Exposure Bias for Link Recommendation.
Proceedings of the 38th International Conference on Machine Learning, 2021

On Proximal Policy Optimization's Heavy-tailed Gradients.
Proceedings of the 38th International Conference on Machine Learning, 2021

RATT: Leveraging Unlabeled Data to Guarantee Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Explaining the Efficacy of Counterfactually Augmented Data.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unsupervised Data Augmentation with Naive Augmentation and without Unlabeled Data.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Does Pretraining for Summarization Require Knowledge Transfer?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Siamese Bert for Authorship Verification.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Unpacking the Drop in COVID-19 Case Fatality Rates: A Study of National and Florida Line-Level Data.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Causal Inference with Selectively Deconfounded Data.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

On the Validity of Arrest as a Proxy for Offense: Race and the Likelihood of Arrest for Violent Crimes.
Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

Fair Machine Learning Under Partial Compliance.
Proceedings of the AIES '21: AAAI/ACM Conference on AI, 2021

Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Symbolic Music Generation with Transformer-GANs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Tensor Regression Networks.
J. Mach. Learn. Res., 2020

Evaluating Explanations: How much do explanations from the teacher aid students?
CoRR, 2020

Decoding and Diversity in Machine Translation.
CoRR, 2020

Extracting Structured Data from Physician-Patient Conversations By Predicting Noteworthy Utterances.
CoRR, 2020

Predicting Mortality Risk in Viral and Unspecified Pneumonia to Assist Clinicians with COVID-19 ECMO Planning.
CoRR, 2020

Generating SOAP Notes from Doctor-Patient Conversations.
CoRR, 2020

How Transferable are the Representations Learned by Deep Q Agents?
CoRR, 2020

Machine Learning for Healthcare: Beyond i.i.d. Prediction.
Proceedings of the ACM WSDM 2020 Health Search and Data Mining Workshop, 2020

SIGMOD 2020 Tutorial on Fairness and Bias in Peer Review and Other Sociotechnical Intelligent Systems.
Proceedings of the 2020 International Conference on Management of Data, 2020

A Unified View of Label Shift Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Uncertainty-Aware Lookahead Factor Models for Quantitative Investing.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning The Difference That Makes A Difference With Counterfactually-Augmented Data.
Proceedings of the 8th International Conference on Learning Representations, 2020

Efficient Candidate Screening Under Multiple Tests and Implications for Fairness.
Proceedings of the 1st Symposium on Foundations of Responsible Computing, 2020

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Weakly- and Semi-supervised Evidence Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Mortality Risk Score for Critically Ill Patients with Viral or Unspecified Pneumonia: Assisting Clinicians with COVID-19 ECMO Planning.
Proceedings of the Artificial Intelligence in Medicine, 2020

Algorithmic Fairness from a Non-ideal Perspective.
Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020

Learning to Deceive with Attention-Based Explanations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Troubling Trends in Machine Learning Scholarship.
ACM Queue, 2019

Are Perceptually-Aligned Gradients a General Property of Robust Classifiers?
CoRR, 2019

Accelerating Deep Learning by Focusing on the Biggest Losers.
CoRR, 2019

Estimating brain age based on a healthy population with deep learning and structural MRI.
CoRR, 2019

Learning Causal State Representations of Partially Observable Environments.
CoRR, 2019

Temporal-Clustering Invariance in Irregular Healthcare Time Series.
CoRR, 2019

Research for practice: troubling trends in machine-learning scholarship.
Commun. ACM, 2019

Game Design for Eliciting Distinguishable Behavior.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Robust Global Representations by Penalizing Local Predictive Power.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Embryo Staging with Weakly-Supervised Region Selection and Dynamically-Decoded Predictions.
Proceedings of the Machine Learning for Healthcare Conference, 2019

AmazonQA: A Review-Based Question Answering Task.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Domain Adaptation with Asymmetrically-Relaxed Distribution Alignment.
Proceedings of the 36th International Conference on Machine Learning, 2019

What is the Effect of Importance Weighting in Deep Learning?
Proceedings of the 36th International Conference on Machine Learning, 2019

Learning Robust Representations by Projecting Superficial Statistics Out.
Proceedings of the 7th International Conference on Learning Representations, 2019

Active Learning with Partial Feedback.
Proceedings of the 7th International Conference on Learning Representations, 2019

Practical Obstacles to Deploying Active Learning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Entity Projection via Machine Translation for Cross-Lingual NER.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Combating Adversarial Misspellings with Robust Word Recognition.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
The Mythos of Model Interpretability.
ACM Queue, 2018

Weighted Risk Minimization & Deep Learning.
CoRR, 2018

How transferable are the datasets collected by active learners?
CoRR, 2018

Sample-Efficient Deep RL with Generative Adversarial Tree Search.
CoRR, 2018

Correction by Projection: Denoising Images with Generative Adversarial Networks.
CoRR, 2018

Learning Noise-Invariant Representations for Robust Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Does mitigating ML's impact disparity require treatment disparity?
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Detecting and Correcting for Label Shift with Black Box Predictors.
Proceedings of the 35th International Conference on Machine Learning, 2018

Born-Again Neural Networks.
Proceedings of the 35th International Conference on Machine Learning, 2018

Predicting Embryo Morphokinetics in Videos with Late Fusion Nets & Dynamic Decoders.
Proceedings of the 6th International Conference on Learning Representations, 2018

Learning From Noisy Singly-labeled Data.
Proceedings of the 6th International Conference on Learning Representations, 2018

Semantically Decomposing the Latent Spaces of Generative Adversarial Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Stochastic Activation Pruning for Robust Adversarial Defense.
Proceedings of the 6th International Conference on Learning Representations, 2018

Deep Bayesian Active Learning for Natural Language Processing: Results of a Large-Scale Empirical Study.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning from Temporally-Structured Human Activities Data.
PhD thesis, 2017

Does mitigating ML's disparate impact require disparate treatment?
CoRR, 2017

Improving Factor-Based Quantitative Investing by Forecasting Company Fundamentals.
CoRR, 2017

Tensor Regression Networks.
CoRR, 2017

Deep Active Learning for Named Entity Recognition.
Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Predicting Surgery Duration with Neural Heteroscedastic Regression.
Proceedings of the Machine Learning for Health Care Conference, 2017

Estimating Reactions and Recommending Products with Generative Models of Reviews.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Dance Dance Convolution.
Proceedings of the 34th International Conference on Machine Learning, 2017

Precise Recovery of Latent Vectors from Generative Adversarial Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

Tensor Contraction Layers for Parsimonious Deep Nets.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Learning to Diagnose with LSTM Recurrent Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Efficient Exploration for Dialog Policy Learning with Deep BBQ Networks \& Replay Buffer Spiking.
CoRR, 2016

Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear.
CoRR, 2016

Stuck in a What? Adventures in Weight Space.
CoRR, 2016

A User Simulator for Task-Completion Dialogues.
CoRR, 2016

Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series.
Proceedings of the 1st Machine Learning in Health Care, 2016

Context Matters: Refining Object Detection in Video with Recurrent Neural Networks.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
Capturing Meaning in Product Reviews with Character-Level Generative Text Models.
CoRR, 2015

Phenotyping of Clinical Time Series with LSTM Recurrent Neural Networks.
CoRR, 2015

Efficient Elastic Net Regularization for Sparse Linear Models.
CoRR, 2015

A Critical Review of Recurrent Neural Networks for Sequence Learning.
CoRR, 2015

2014
F1-Optimal Thresholding in the Multi-Label Setting.
CoRR, 2014

Differential Privacy and Machine Learning: a Survey and Review.
CoRR, 2014

Optimal Thresholding of Classifiers to Maximize F1 Measure.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014


  Loading...