2024

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning.

[DOI]

Aditya A. Ramesh

Kenny John Young

Louis Kirsch

Jürgen Schmidhuber

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Iterative Option Discovery for Planning, by Planning.

[DOI]

Kenny Young

Richard S. Sutton

CoRR, 2023

The Benefits of Model-Based Generalization in Reinforcement Learning.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions.

[DOI]

Tian Tian

Kenny Young

Richard S. Sutton

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units.

[DOI]

Kenny Young

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020

Hindsight Network Credit Assignment.

[DOI]

Kenny Young

CoRR, 2020

Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning.

[DOI]

Kenny Young

Richard S. Sutton

CoRR, 2020

2019

Variance Reduced Advantage Estimation with δ Hindsight Credit Assignment.

[DOI]

Kenny Young

CoRR, 2019

MinAtar: An Atari-inspired Testbed for More Efficient Reinforcement Learning Experiments.

[DOI]

Kenny Young

Tian Tian

CoRR, 2019

Metatrace Actor-Critic: Online Step-Size Tuning by Meta-gradient Descent for Reinforcement Learning Control.

[DOI]

Kenny Young

Baoxiang Wang

Matthew E. Taylor

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2018

Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling.

[DOI]

Kenny J. Young

Richard S. Sutton

Shuo Yang

CoRR, 2018

Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control.

[DOI]

Kenny Young

Baoxiang Wang

Matthew E. Taylor

CoRR, 2018

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods.

[DOI]

CoRR, 2018

Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return.

[DOI]

Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

2016

NeuroHex: A Deep Q-learning Hex Agent.

[DOI]

Kenny Young

Gautham Vasan

Ryan Hayward

Proceedings of the Computer Games - 5th Workshop on Computer Games, 2016

A Reverse Hex Solver.

[DOI]

Kenny Young

Ryan B. Hayward

Proceedings of the Computers and Games - 9th International Conference, 2016