CoRR, 2022

Building Human Values into Recommender Systems: An Interdisciplinary Synthesis.

[DOI]

CoRR, 2022

How to talk so your robot will learn: Instructions, descriptions, and pragmatics.

[DOI]

CoRR, 2022

Linguistic communication as (inverse) reward design.

[DOI]

CoRR, 2022

Towards Psychologically-Grounded Dynamic Preference Models.

[DOI]

Mihaela Curmei

Andreas A. Haupt

Benjamin Recht

Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

How to talk so AI will learn: Instructions, descriptions, and autonomy.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Robust Feature-Level Adversaries are Interpretability Tools.

[DOI]

Stephen Casper

Max Nadeau

Gabriel Kreiman

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Estimating and Penalizing Induced Preference Shifts in Recommender Systems.

[DOI]

Micah D. Carroll

Proceedings of the International Conference on Machine Learning, 2022

A Penalty Default Approach to Preemptive Harm Disclosure and Mitigation for AI Systems.

[DOI]

Rui-Jie Yew

Proceedings of the AIES '22: AAAI/ACM Conference on AI, Ethics, and Society, Oxford, United Kingdom, May 19, 2022

2021

When Curation Becomes Creation: Algorithms, microcontent, and the vanishing distinction between platforms and creators.

[DOI]

Liu Leqi

Zachary C. Lipton

ACM Queue, 2021

What are you optimizing for? Aligning Recommender Systems with Human Values.

[DOI]

CoRR, 2021

When curation becomes creation.

[DOI]

Liu Leqi

Zachary C. Lipton

Commun. ACM, 2021

Estimating and Penalizing Preference Shift in Recommender Systems.

[DOI]

Micah Carroll

Proceedings of the RecSys '21: Fifteenth ACM Conference on Recommender Systems, Amsterdam, The Netherlands, 27 September 2021, 2021

Guided Imitation of Task and Motion Planning.

[DOI]

Michael James McDonald

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020

Multi-Principal Assistance Games: Definition and Collegial Mechanisms.

[DOI]

Arnaud Fickinger

Simon Zhuang

Andrew Critch

CoRR, 2020

Multi-Principal Assistance Games.

[DOI]

Arnaud Fickinger

Simon Zhuang

CoRR, 2020

Consequences of Misaligned AI.

[DOI]

Simon Zhuang

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Silly Rules Improve the Capacity of Agents to Learn Stable Enforcement and Compliance Behaviors.

[DOI]

Raphael Koster

Gillian K. Hadfield

Joel Z. Leibo

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Conservative Agency via Attainable Utility Preservation.

[DOI]

Alexander Matt Turner

Prasad Tadepalli

Proceedings of the AIES '20: AAAI/ACM Conference on AI, 2020

2019

An Extensible Interactive Interface for Agent Design.

[DOI]

Matthew Rahtz

James Fang

CoRR, 2019

Adversarial Training with Voronoi Constraints.

[DOI]

Marc Khoury

CoRR, 2019

Conservative Agency.

[DOI]

Alexander Matt Turner

Prasad Tadepalli

Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the 28th International Joint Conference on Artificial Intelligence, 2019

On the Utility of Model Learning in HRI.

[DOI]

Rohan Choudhury

Gokul Swamy

Proceedings of the 14th ACM/IEEE International Conference on Human-Robot Interaction, 2019

The Assistive Multi-Armed Bandit.

[DOI]

Lawrence Chan

Siddhartha S. Srinivasa

Proceedings of the 14th ACM/IEEE International Conference on Human-Robot Interaction, 2019

Human-AI Learning Performance in Multi-Armed Bandits.

[DOI]

Ravi Pandya

Sandy H. Huang

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

Incomplete Contracting and AI Alignment.

[DOI]

Gillian K. Hadfield

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

Legible Normativity for AI Alignment: The Value of Silly Rules.

[DOI]

McKane Andrus

Gillian K. Hadfield

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society, 2019

2018

On the Geometry of Adversarial Examples.

[DOI]

Marc Khoury

CoRR, 2018

Active Inverse Reward Design.

[DOI]

Sören Mindermann

Rohin Shah

Adam Gleave

CoRR, 2018

Simplifying Reward Design through Divide-and-Conquer.

[DOI]

Ellis Ratner

Proceedings of the Robotics: Science and Systems XIV, 2018

An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning.

[DOI]

Dhruv Malik

Malayandi Palaniappan

Jaime F. Fisac

Proceedings of the 35th International Conference on Machine Learning, 2018

2017

Inverse Reward Design.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Pragmatic-Pedagogic Value Alignment.

[DOI]

Malayandi Palaniappan

Proceedings of the Robotics Research, The 18th International Symposium, 2017

Should Robots be Obedient?

[DOI]

Smitha Milli

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Expressive Robot Motion Timing.

[DOI]

Allan Zhou

Anusha Nagabandi

Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, 2017

The Off-Switch Game.

[DOI]

Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Cooperative Inverse Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Sequential quadratic programming for task plan optimization.

[DOI]

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Guided search for task and motion plans using learned heuristics.

[DOI]

Rohan Chitnis

Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

2015

Multitasking: Optimal Planning for Bandit Superprocesses.

[DOI]

Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Modular task and motion planning in belief space.

[DOI]

Edward Groshev

Rohan Chitnis

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Beyond lowest-warping cost action selection in trajectory transfer.

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

2014

Unifying scene registration and trajectory optimization for learning from demonstrations with application to manipulation of deformable objects.

[DOI]

Alex X. Lee

Sandy H. Huang

Eric Tzeng

Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

2013

Optimization in the now: Dynamic peephole optimization for hierarchical planning.

[DOI]