We stand with Ukraine

We stand with Ukraine

Hado van Hasselt

Affiliations:

Google DeepMind, London, UK
Utrecht University, The Netherlands (PhD 2011)

According to our database¹, Hado van Hasselt authored at least 73 papers between 2008 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on hadovanhasselt.com
on scholar.google.com

On csauthors.net:

Bibliography

2024

Disentangling the Causes of Plasticity Loss in Neural Networks.

[BibT_eX]

[DOI]

,

,

Khimya Khetarpal

,

Hado van Hasselt

,

,

,

CoRR, 2024

2023

A Survey of Temporal Credit Assignment in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Eduardo Pignatelli

,

,

,

,

Hado van Hasselt

,

CoRR, 2023

A Definition of Continual Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Benjamin Van Roy

,

,

Hado van Hasselt

,

CoRR, 2023

On the Convergence of Bounded Agents.

[BibT_eX]

[DOI]

,

,

Hado van Hasselt

,

Benjamin Van Roy

,

,

CoRR, 2023

Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration.

[BibT_eX]

[DOI]

,

Nan Rosemary Ke

,

Hado van Hasselt

CoRR, 2023

Optimistic Meta-Gradients.

[BibT_eX]

[DOI]

Sebastian Flennerhag

,

,

Brendan O'Donoghue

,

Hado van Hasselt

,

András György

,

CoRR, 2023

Optimistic Meta-Gradients.

[BibT_eX]

[DOI]

Sebastian Flennerhag

,

,

Brendan O'Donoghue

,

Hado Philip van Hasselt

,

András György

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Definition of Continual Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Benjamin Van Roy

,

,

Hado Philip van Hasselt

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Human-level Atari 200x faster.

[BibT_eX]

[DOI]

Steven Kapturowski

,

,

,

Nemanja Rakicevic

,

Hado van Hasselt

,

Charles Blundell

,

Adrià Puigdomènech Badia

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Exploration via Epistemic Value Estimation.

[BibT_eX]

[DOI]

,

John Shawe-Taylor

,

Hado van Hasselt

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Selective Credit Assignment.

[BibT_eX]

[DOI]

,

,

,

Hado van Hasselt

CoRR, 2022

Learning by Directional Gradient Descent.

[BibT_eX]

[DOI]

,

,

,

,

Hado van Hasselt

Proceedings of the Tenth International Conference on Learning Representations, 2022

Bootstrapped Meta-Learning.

[BibT_eX]

[DOI]

Sebastian Flennerhag

,

Yannick Schroecker

,

,

Hado van Hasselt

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Chaining Value Functions for Off-Policy Learning.

[BibT_eX]

[DOI]

,

John Shawe-Taylor

,

Hado van Hasselt

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Introducing Symmetries to Black Box Meta Reinforcement Learning.

[BibT_eX]

[DOI]

,

Sebastian Flennerhag

,

Hado van Hasselt

,

Abram L. Friesen

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Learning Expected Emphatic Traces for Deep RL.

[BibT_eX]

[DOI]

,

Shangtong Zhang

,

,

,

Hado van Hasselt

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Podracer architectures for scalable Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Hado van Hasselt

CoRR, 2021

Synthetic Returns for Long-Term Credit Assignment.

[BibT_eX]

[DOI]

,

,

,

,

Theophane Weber

,

Matt M. Botvinick

,

Hado van Hasselt

,

H. Francis Song

CoRR, 2021

Discovery of Options via Meta-Learned Subgoals.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Hado van Hasselt

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Consistent Models and Values.

[BibT_eX]

[DOI]

Gregory Farquhar

,

,

,

,

,

Hado Philip van Hasselt

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Emphatic Algorithms for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Charles Blundell

,

Hado van Hasselt

Proceedings of the 38th International Conference on Machine Learning, 2021

Muesli: Combining Improvements in Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Theophane Weber

,

,

Hado van Hasselt

Proceedings of the 38th International Conference on Machine Learning, 2021

Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity.

[BibT_eX]

[DOI]

,

Wojciech Marian Czarnecki

,

,

Dhruva Tirumala

,

,

,

Hado van Hasselt

,

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Expected Eligibility Traces.

[BibT_eX]

[DOI]

Hado van Hasselt

,

Sephora Madjiheurem

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Self-Tuning Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Hado van Hasselt

,

,

CoRR, 2020

A Self-Tuning Actor-Critic Algorithm.

[BibT_eX]

[DOI]

,

,

,

,

,

Hado van Hasselt

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Meta-Gradient Reinforcement Learning with an Objective Discovered Online.

[BibT_eX]

[DOI]

,

Hado Philip van Hasselt

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Discovering Reinforcement Learning Algorithms.

[BibT_eX]

[DOI]

,

,

Wojciech M. Czarnecki

,

,

Hado van Hasselt

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Forethought and Hindsight in Credit Assignment.

[BibT_eX]

[DOI]

,

,

Hado van Hasselt

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

What Can Learned Intrinsic Rewards Capture?

[BibT_eX]

[DOI]

,

,

,

,

,

Hado van Hasselt

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Behaviour Suite for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Katrina McKinney

,

,

Csaba Szepesvári

,

,

Benjamin Van Roy

,

Richard S. Sutton

,

,

Hado van Hasselt

Proceedings of the 8th International Conference on Learning Representations, 2020

Conditional Importance Sampling for Off-Policy Learning.

[BibT_eX]

[DOI]

,

Anna Harutyunyan

,

Hado van Hasselt

,

,

,

,

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019

General non-linear Bellman equations.

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

,

,

,

CoRR, 2019

On Inductive Biases in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

Hado van Hasselt

,

,

CoRR, 2019

Meta-learning of Sequential Strategies.

[BibT_eX]

[DOI]

CoRR, 2019

Discovery of Useful Questions as Auxiliary Tasks.

[BibT_eX]

[DOI]

,

,

,

Janarthanan Rajendran

,

Richard L. Lewis

,

,

Hado van Hasselt

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

When to use parametric models in reinforcement learning?

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Hindsight Credit Assignment.

[BibT_eX]

[DOI]

Anna Harutyunyan

,

,

,

Mohammad Gheshlaghi Azar

,

,

,

Hado van Hasselt

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Universal Successor Features Approximators.

[BibT_eX]

[DOI]

,

,

,

Daniel J. Mankowitz

,

Hado van Hasselt

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-Task Deep Reinforcement Learning with PopArt.

[BibT_eX]

[DOI]

,

,

,

Wojciech Czarnecki

,

,

Hado van Hasselt

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Deep Reinforcement Learning and the Deadly Triad.

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

,

,

Nicolas Sonnerat

,

CoRR, 2018

The Barbados 2018 List of Open Issues in Continual Learning.

[BibT_eX]

[DOI]

,

Hado van Hasselt

,

,

,

,

Pierre-Luc Bacon

,

,

,

Marc G. Bellemare

,

CoRR, 2018

Observe and Look Further: Achieving Consistent Performance on Atari.

[BibT_eX]

[DOI]

,

,

,

Mohammad Gheshlaghi Azar

,

,

,

Gabriel Barth-Maron

,

Hado van Hasselt

,

,

,

,

,

Olivier Pietquin

CoRR, 2018

Unicorn: Continual Learning with a Universal, Off-policy Agent.

[BibT_eX]

[DOI]

Daniel J. Mankowitz

,

Augustin Zídek

,

,

,

,

,

,

Hado van Hasselt

,

,

CoRR, 2018

Meta-Gradient Reinforcement Learning.

[BibT_eX]

[DOI]

,

Hado van Hasselt

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Coordinate with Coordination Graphs in Repeated Single-Stage Multi-Agent Decision Problems.

[BibT_eX]

[DOI]

Eugenio Bargiacchi

,

Timothy Verstraeten

,

Diederik M. Roijers

,

,

Hado van Hasselt

Proceedings of the 35th International Conference on Machine Learning, 2018

Distributed Prioritized Experience Replay.

[BibT_eX]

[DOI]

,

,

,

Gabriel Barth-Maron

,

,

Hado van Hasselt

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Rainbow: Combining Improvements in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

Hado van Hasselt

,

,

Georg Ostrovski

,

,

,

,

Mohammad Gheshlaghi Azar

,

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

StarCraft II: A New Challenge for Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2017

Natural Value Approximators: Learning when to Trust Past Estimates.

[BibT_eX]

[DOI]

,

,

Hado van Hasselt

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Successor Features for Transfer in Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Jonathan J. Hunt

,

,

,

Hado van Hasselt

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Predictron: End-To-End Learning and Planning.

[BibT_eX]

[DOI]

,

Hado van Hasselt

,

,

,

,

,

Gabriel Dulac-Arnold

,

David P. Reichert

,

Neil C. Rabinowitz

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Learning functions across many orders of magnitudes.

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

,

CoRR, 2016

Learning values across many orders of magnitude.

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Dueling Network Architectures for Deep Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

Hado van Hasselt

,

,

Nando de Freitas

Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep Reinforcement Learning with Double Q-Learning.

[BibT_eX]

[DOI]

Hado van Hasselt

,

,

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Learning to Predict Independent of Span.

[BibT_eX]

[DOI]

Hado van Hasselt

,

Richard S. Sutton

CoRR, 2015

2014

Off-policy TD( l) with a true online equivalence.

[BibT_eX]

[DOI]

Hado van Hasselt

,

Ashique Rupam Mahmood

,

Richard S. Sutton

Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Weighted importance sampling for off-policy learning with linear function approximation.

[BibT_eX]

[DOI]

Ashique Rupam Mahmood

,

Hado van Hasselt

,

Richard S. Sutton

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

A new Q(lambda) with interim forward view and Monte Carlo equivalence.

[BibT_eX]

[DOI]

Richard S. Sutton

,

Ashique Rupam Mahmood

,

,

Hado van Hasselt

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

[BibT_eX]

[DOI]

Hado van Hasselt

CoRR, 2013

Stacking under uncertainty: We know how to predict, but how should we act?

[BibT_eX]

[DOI]

Hado van Hasselt

,

Proceedings of the IEEE Symposium on Computational Intelligence In Production And Logistics Systems, 2013

2012

Reinforcement Learning in Continuous State and Action Spaces.

[BibT_eX]

[DOI]

Hado van Hasselt

Proceedings of the Reinforcement Learning, 2012

2011

Insights in reinforcement rearning : formal analysis and empirical evaluation of temporal-difference learning algorithms.

[BibT_eX]

[DOI]

Hado Philip van Hasselt

PhD thesis, 2011

Exploiting Best-Match Equations for Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

Harm van Seijen

,

Shimon Whiteson

,

Hado van Hasselt

,

Marco A. Wiering

J. Mach. Learn. Res., 2011

Reinforcement learning algorithms for solving classification problems.

[BibT_eX]

[DOI]

Marco A. Wiering

,

Hado van Hasselt

,

Auke-Dirk Pietersma

,

Lambert Schomaker

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010

Double Q-learning.

[BibT_eX]

[DOI]

Hado van Hasselt

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

2009

Using continuous action spaces to solve discrete problems.

[BibT_eX]

[DOI]

Hado van Hasselt

,

Marco A. Wiering

Proceedings of the International Joint Conference on Neural Networks, 2009

Adaptive Serious Games Using Agent Organizations.

[BibT_eX]

[DOI]

,

Hado van Hasselt

,

,

Virginia Dignum

Proceedings of the Agents for Games and Simulations, 2009

The QV family compared to other reinforcement learning algorithms.

[BibT_eX]

[DOI]

Marco A. Wiering

,

Hado van Hasselt

Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

A theoretical and empirical analysis of Expected Sarsa.

[BibT_eX]

[DOI]

Harm van Seijen

,

Hado van Hasselt

,

Shimon Whiteson

,

Marco A. Wiering

Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008

Ensemble Algorithms in Reinforcement Learning.

[BibT_eX]

[DOI]

Marco A. Wiering

,

Hado van Hasselt

IEEE Trans. Syst. Man Cybern. Part B, 2008

On-line adapting games using agent organizations.

[BibT_eX]

[DOI]

,

Hado van Hasselt

,

Virginia Dignum

,

Proceedings of the 2008 IEEE Symposium on Computational Intelligence and Games, 2008

Loading...