2023
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States.
CoRR, 2022

2020
Policy Evaluation Networks.
CoRR, 2020

2018
The Barbados 2018 List of Open Issues in Continual Learning.
CoRR, 2018

When Waiting Is Not an Option: Learning Options With a Deliberation Cost.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learnings Options End-to-End for Continuous Action Tasks.
CoRR, 2017

Investigating Recurrence and Eligibility Traces in Deep Q-Networks.
CoRR, 2017

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Option-Critic Architecture.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017