Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States.
CoRR, 2022
Policy Evaluation Networks.
CoRR, 2020
The Barbados 2018 List of Open Issues in Continual Learning.
CoRR, 2018
When Waiting Is Not an Option: Learning Options With a Deliberation Cost.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Learnings Options End-to-End for Continuous Action Tasks.
CoRR, 2017
Investigating Recurrence and Eligibility Traces in Deep Q-Networks.
CoRR, 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
The Option-Critic Architecture.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017