Hado van Hasselt

Affiliations:
  • Google DeepMind, London, UK
  • Utrecht University, The Netherlands (PhD 2011)


According to our database1, Hado van Hasselt authored at least 73 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Disentangling the Causes of Plasticity Loss in Neural Networks.
CoRR, 2024

2023
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning.
CoRR, 2023

A Definition of Continual Reinforcement Learning.
CoRR, 2023

On the Convergence of Bounded Agents.
CoRR, 2023

Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration.
CoRR, 2023

Optimistic Meta-Gradients.
CoRR, 2023

Optimistic Meta-Gradients.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Definition of Continual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Human-level Atari 200x faster.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Exploration via Epistemic Value Estimation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Selective Credit Assignment.
CoRR, 2022

Learning by Directional Gradient Descent.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Bootstrapped Meta-Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Chaining Value Functions for Off-Policy Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Introducing Symmetries to Black Box Meta Reinforcement Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Learning Expected Emphatic Traces for Deep RL.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Podracer architectures for scalable Reinforcement Learning.
CoRR, 2021

Synthetic Returns for Long-Term Credit Assignment.
CoRR, 2021

Discovery of Options via Meta-Learned Subgoals.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Consistent Models and Values.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Emphatic Algorithms for Deep Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Muesli: Combining Improvements in Policy Optimization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Expected Eligibility Traces.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Self-Tuning Deep Reinforcement Learning.
CoRR, 2020

A Self-Tuning Actor-Critic Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Gradient Reinforcement Learning with an Objective Discovered Online.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Discovering Reinforcement Learning Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Forethought and Hindsight in Credit Assignment.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

What Can Learned Intrinsic Rewards Capture?
Proceedings of the 37th International Conference on Machine Learning, 2020

Behaviour Suite for Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Conditional Importance Sampling for Off-Policy Learning.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
General non-linear Bellman equations.
CoRR, 2019

On Inductive Biases in Deep Reinforcement Learning.
CoRR, 2019

Meta-learning of Sequential Strategies.
CoRR, 2019

Discovery of Useful Questions as Auxiliary Tasks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

When to use parametric models in reinforcement learning?
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Hindsight Credit Assignment.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Universal Successor Features Approximators.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-Task Deep Reinforcement Learning with PopArt.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Deep Reinforcement Learning and the Deadly Triad.
CoRR, 2018

The Barbados 2018 List of Open Issues in Continual Learning.
CoRR, 2018

Observe and Look Further: Achieving Consistent Performance on Atari.
CoRR, 2018

Unicorn: Continual Learning with a Universal, Off-policy Agent.
CoRR, 2018

Meta-Gradient Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems.
Proceedings of the 35th International Conference on Machine Learning, 2018

Distributed Prioritized Experience Replay.
Proceedings of the 6th International Conference on Learning Representations, 2018

Rainbow: Combining Improvements in Deep Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
StarCraft II: A New Challenge for Reinforcement Learning.
CoRR, 2017

Natural Value Approximators: Learning when to Trust Past Estimates.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Successor Features for Transfer in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Predictron: End-To-End Learning and Planning.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Learning functions across many orders of magnitudes.
CoRR, 2016

Learning values across many orders of magnitude.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Dueling Network Architectures for Deep Reinforcement Learning.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep Reinforcement Learning with Double Q-Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Learning to Predict Independent of Span.
CoRR, 2015

2014
Off-policy TD( l) with a true online equivalence.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Weighted importance sampling for off-policy learning with linear function approximation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

A new Q(lambda) with interim forward view and Monte Carlo equivalence.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average
CoRR, 2013

Stacking under uncertainty: We know how to predict, but how should we act?
Proceedings of the IEEE Symposium on Computational Intelligence In Production And Logistics Systems, 2013

2012
Reinforcement Learning in Continuous State and Action Spaces.
Proceedings of the Reinforcement Learning, 2012

2011
Insights in reinforcement rearning : formal analysis and empirical evaluation of temporal-difference learning algorithms.
PhD thesis, 2011

Exploiting Best-Match Equations for Efficient Reinforcement Learning.
J. Mach. Learn. Res., 2011

Reinforcement learning algorithms for solving classification problems.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010
Double Q-learning.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

2009
Using continuous action spaces to solve discrete problems.
Proceedings of the International Joint Conference on Neural Networks, 2009

Adaptive Serious Games Using Agent Organizations.
Proceedings of the Agents for Games and Simulations, 2009

The QV family compared to other reinforcement learning algorithms.
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

A theoretical and empirical analysis of Expected Sarsa.
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008
Ensemble Algorithms in Reinforcement Learning.
IEEE Trans. Syst. Man Cybern. Part B, 2008

On-line adapting games using agent organizations.
Proceedings of the 2008 IEEE Symposium on Computational Intelligence and Games, 2008


  Loading...