2025
AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025

Moving Beyond Medical Exam Questions: A Clinician-Annotated Dataset of Real-World Tasks and Ambiguity in Mental Healthcare.
CoRR, February, 2025

2024
Building Human Values into Recommender Systems: An Interdisciplinary Synthesis.
Trans. Recomm. Syst., September, 2024

Risks from Language Models for Automated Mental Healthcare: Ethics and Structure for Implementation.
CoRR, 2024

Risks from Language Models for Automated Mental Healthcare: Ethics and Structure for Implementation (Extended Abstract).
Proceedings of the Seventh AAAI/ACM Conference on AI, Ethics, and Society (AIES-24) - Full Archival Papers, October 21-23, 2024, San Jose, California, USA, 2024

2022
Building Human Values into Recommender Systems: An Interdisciplinary Synthesis.
CoRR, 2022