Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models.
CoRR, January, 2025
InfAlign: Inference-aware language model alignment.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning.
CoRR, 2024
Robust Preference Optimization through Reward Model Distillation.
CoRR, 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation.
CoRR, 2024
Theoretical guarantees on the best-of-n alignment policy.
CoRR, 2024
Transforming and Combining Rewards for Aligning Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
The Case for Globalizing Fairness: A Mixed Methods Study on Colonialism, AI, and Health in Africa.
Proceedings of the 4th ACM Conference on Equity and Access in Algorithms, 2024
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Recovering Sparse and Interpretable Subgroups with Heterogeneous Treatment Effects with Censored Time-to-Event Outcomes.
CoRR, 2023
Participatory Systems for Personalized Prediction.
CoRR, 2023
Participatory Personalization in Classification.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data.
Proceedings of the Machine Learning for Healthcare Conference, 2022
Counterfactual Phenotyping with Censored Time-to-Events.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Deep Survival Machines: Fully Parametric Survival Regression and Representation Learning for Censored Data With Competing Risks.
IEEE J. Biomed. Health Informatics, 2021
Deep Parametric Time-to-Event Regression with Time-Varying Covariates.
Proceedings of AAAI Symposium on Survival Prediction, 2021
Deep Cox Mixtures for Survival Regression.
Proceedings of the Machine Learning for Healthcare Conference, 2021
Latent Bayesian Inference for Robust Earnings Estimates.
CoRR, 2020
Interpretable subgroup discovery in treatment effect estimation with application to opioid prescribing guidelines.
Proceedings of the ACM CHIL '20: ACM Conference on Health, 2020
Nonlinear Semi-Parametric Models for Survival Analysis.
CoRR, 2019
Dynamically Personalized Detection of Hemorrhage.
Proceedings of the Machine Learning for Healthcare Conference, 2019
Preserving Intermediate Objectives: One Simple Trick to Improve Learning for Hierarchical Models.
CoRR, 2017
Semi-Supervised Prediction of Comorbid Rare Conditions Using Medical Claims Data.
Proceedings of the 2017 IEEE International Conference on Data Mining Workshops, 2017
An Entity Resolution Approach to Isolate Instances of Human Trafficking Online.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017
Twitter User Classification using Ambient Metadata.
CoRR, 2014