Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Iterative Option Discovery for Planning, by Planning.
CoRR, 2023
The Benefits of Model-Based Generalization in Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
Hindsight Network Credit Assignment.
CoRR, 2020
Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning.
CoRR, 2020
Variance Reduced Advantage Estimation with δ Hindsight Credit Assignment.
CoRR, 2019
MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments.
CoRR, 2019
Metatrace Actor-Critic: Online Step-Size Tuning by Meta-gradient Descent for Reinforcement Learning Control.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling.
CoRR, 2018
Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control.
CoRR, 2018
Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods.
CoRR, 2018
Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018
NeuroHex: A Deep Q-learning Hex Agent.
Proceedings of the Computer Games - 5th Workshop on Computer Games, 2016
Proceedings of the Computers and Games - 9th International Conference, 2016