2025

Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models.

[DOI]

Roberto-Rafael Maura-Rivero

Chirag Nagpal

Roma Patel

Francesco Visin

CoRR, January, 2025

2024

InfAlign: Inference-aware language model alignment.

[DOI]

Ananda Theertha Suresh

Ahmad Beirami

CoRR, 2024

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning.

[DOI]

CoRR, 2024

Robust Preference Optimization through Reward Model Distillation.

[DOI]

CoRR, 2024

A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models.

[DOI]

CoRR, 2024

Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation.

[DOI]

CoRR, 2024

Theoretical guarantees on the best-of-n alignment policy.

[DOI]

Ananda Theertha Suresh

CoRR, 2024

Transforming and Combining Rewards for Aligning Large Language Models.

[DOI]

Alexander Nicholas D'Amour

Sanmi Koyejo

Victor Veitch

Proceedings of the Forty-first International Conference on Machine Learning, 2024

The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa.

[DOI]

Mercy Nyamewaa Asiedu

Proceedings of the 4th ACM Conference on Equity and Access in Algorithms, 2024

2023

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking.

[DOI]

CoRR, 2023

Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes.

[DOI]

Chirag Nagpal

Vedant Sanil

Artur Dubrawski

CoRR, 2023

Participatory Systems for Personalized Prediction.

[DOI]

CoRR, 2023

Participatory Personalization in Classification.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data.

[DOI]

Chirag Nagpal

Willa Potosnak

Artur Dubrawski

Proceedings of the Machine Learning for Healthcare Conference, 2022

Counterfactual Phenotyping with Censored Time-to-Events.

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021

Deep Survival Machines: Fully Parametric Survival Regression and Representation Learning for Censored Data With Competing Risks.

[DOI]

Chirag Nagpal

Xinyu Li

Artur Dubrawski

IEEE J. Biomed. Health Informatics, 2021

Deep Parametric Time-to-Event Regression with Time-Varying Covariates.

[DOI]

Chirag Nagpal

Vincent Jeanselme

Artur Dubrawski

Proceedings of AAAI Symposium on Survival Prediction, 2021

Deep Cox Mixtures for Survival Regression.

[DOI]

Proceedings of the Machine Learning for Healthcare Conference, 2021

2020

Latent Bayesian Inference for Robust Earnings Estimates.

[DOI]

CoRR, 2020

Interpretable subgroup discovery in treatment effect estimation with application to opioid prescribing guidelines.

[DOI]

Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020

2019

Nonlinear Semi-Parametric Models for Survival Analysis.

[DOI]

CoRR, 2019

Dynamically Personalized Detection of Hemorrhage.

[DOI]

Proceedings of the Machine Learning for Healthcare Conference, 2019

2017

Preserving Intermediate Objectives: One Simple Trick to Improve Learning for Hierarchical Models.

[DOI]

Abhilasha Ravichander

Louis-Philippe Morency

CoRR, 2017

Semi-Supervised Prediction of Comorbid Rare Conditions Using Medical Claims Data.

[DOI]

Proceedings of the 2017 IEEE International Conference on Data Mining Workshops, 2017

An Entity Resolution Approach to Isolate Instances of Human Trafficking Online.

[DOI]

Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2014

Twitter User Classification using Ambient Metadata.

[DOI]

Chirag Nagpal

Khushboo Singhal

CoRR, 2014