Tom Schaul

Orcid: 0000-0002-2961-8782

Affiliations:
  • Google DeepMind, London, UK
  • New York University, Courant Institute of Mathematical Sciences, NY, USA
  • IDSIA, Manno-Lugano, Switzerland
  • TU Munich, Germany (PhD 2011)


According to our database1, Tom Schaul authored at least 84 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Open-Endedness is Essential for Artificial Superhuman Intelligence.
CoRR, 2024

Position: Open-Endedness is Essential for Artificial Superhuman Intelligence.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Vision-Language Models as a Source of Rewards.
CoRR, 2023

Scaling Goal-based Exploration via Pruning Proto-goals.
CoRR, 2023

Scaling Goal-based Exploration via Pruning Proto-goals.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Discovering Evolution Strategies via Meta-Black-Box Optimization.
Proceedings of the Companion Proceedings of the Conference on Genetic and Evolutionary Computation, 2023

Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization.
Proceedings of the Genetic and Evolutionary Computation Conference, 2023

2022
AI for the Social Good (Dagstuhl Seminar 22091).
Dagstuhl Reports, 2022

The Phenomenon of Policy Churn.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Model-Value Inconsistency as a Signal for Epistemic Uncertainty.
Proceedings of the International Conference on Machine Learning, 2022

When should agents explore?
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Return-based Scaling: Yet Another Normalisation Trick for Deep RL.
CoRR, 2021

2020
Policy Evaluation Networks.
CoRR, 2020

Conditional Importance Sampling for Off-Policy Learning.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Grandmaster level in StarCraft II using multi-agent reinforcement learning.
Nat., 2019

Artificial and Computational Intelligence in Games: Revolutions in Computational Game AI (Dagstuhl Seminar 19511).
Dagstuhl Reports, 2019

AI for the Social Good (Dagstuhl Seminar 19082).
Dagstuhl Reports, 2019

Adapting Behaviour for Learning Progress.
CoRR, 2019

Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods.
CoRR, 2019

Ray Interference: a Source of Plateaus in Deep Reinforcement Learning.
CoRR, 2019

Universal Successor Features Approximators.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
The Barbados 2018 List of Open Issues in Continual Learning.
CoRR, 2018

Unicorn: Continual Learning with a Universal, Off-policy Agent.
CoRR, 2018

Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement.
Proceedings of the 35th International Conference on Machine Learning, 2018

Meta-learning by the baldwin effect.
Proceedings of the Genetic and Evolutionary Computation Conference Companion, 2018

Deep Q-learning From Demonstrations.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Rainbow: Combining Improvements in Deep Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Building Machines that Learn and Think for Themselves: Commentary on Lake et al., Behavioral and Brain Sciences, 2017.
CoRR, 2017

StarCraft II: A New Challenge for Reinforcement Learning.
CoRR, 2017

Learning from Demonstrations for Real World Reinforcement Learning.
CoRR, 2017

Natural Value Approximators: Learning when to Trust Past Estimates.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Successor Features for Transfer in Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

FeUdal Networks for Hierarchical Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

The Predictron: End-To-End Learning and Planning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Reinforcement Learning with Unsupervised Auxiliary Tasks.
Proceedings of the 5th International Conference on Learning Representations, 2017

2016
The 2014 General Video Game Playing Competition.
IEEE Trans. Comput. Intell. AI Games, 2016

Prioritized Experience Replay.
Proceedings of the 4th International Conference on Learning Representations, 2016

Successor Features for Transfer in Reinforcement Learning.
CoRR, 2016

Unifying Count-Based Exploration and Intrinsic Motivation.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning to learn by gradient descent by gradient descent.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Dueling Network Architectures for Deep Reinforcement Learning.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Analyzing the robustness of general video game playing agents.
Proceedings of the IEEE Conference on Computational Intelligence and Games, 2016

General Video Game AI: Competition, Challenges and Opportunities.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Universal Value Function Approximators.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
An Extensible Description Language for Video Games.
IEEE Trans. Comput. Intell. AI Games, 2014

Natural evolution strategies.
J. Mach. Learn. Res., 2014

Unit Tests for Stochastic Optimization.
Proceedings of the 2nd International Conference on Learning Representations, 2014

2013
Adaptive learning rates and parallelization for stochastic, sparse, non-smooth gradients
Proceedings of the 1st International Conference on Learning Representations, 2013

Better Generalization with Forecasts.
Proceedings of the IJCAI 2013, 2013

No more pesky learning rates.
Proceedings of the 30th International Conference on Machine Learning, 2013

A linear time natural evolution strategy for non-separable functions.
Proceedings of the Genetic and Evolutionary Computation Conference, 2013

General Video Game Playing.
Proceedings of the Artificial and Computational Intelligence in Games, 2013

Towards a Video Game Description Language.
Proceedings of the Artificial and Computational Intelligence in Games, 2013

A video game description language for model-based or interactive learning.
Proceedings of the 2013 IEEE Conference on Computational Inteligence in Games (CIG), 2013

2012
The organization of behavior into temporal and spatial neighborhoods.
Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012

Comparing natural evolution strategies to BIPOP-CMA-ES on noiseless and noisy black-box optimization testbeds.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

Benchmarking natural evolution strategies with adaptation sampling on the noiseless and noisy black-box optimization testbeds.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

Investigating the impact of adaptation sampling in natural evolution strategies on black-box optimization testbeds.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

Benchmarking exponential natural evolution strategies on the noiseless and noisy black-box optimization testbeds.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

Benchmarking separable natural evolution strategies on the noiseless and noisy black-box optimization testbeds.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

Natural evolution strategies converge on sphere functions.
Proceedings of the Genetic and Evolutionary Computation Conference, 2012

2011
Studies in Continuous Black-box Optimization.
PhD thesis, 2011

Measuring Intelligence through Games
CoRR, 2011

Natural Evolution Strategies
CoRR, 2011

Q-Error as a Selection Mechanism in Modular Reinforcement-Learning Systems.
Proceedings of the IJCAI 2011, 2011

The two-dimensional organization of behavior.
Proceedings of the 1st International Conference on Development and Learning and on Epigenetic Robotics, 2011

High dimensions and heavy tails for natural evolution strategies.
Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

Curiosity-driven optimization.
Proceedings of the IEEE Congress on Evolutionary Computation, 2011

Coherence Progress: A Measure of Interestingness Based on Fixed Compressors.
Proceedings of the Artificial General Intelligence - 4th International Conference, 2011

2010
Metalearning.
Scholarpedia, 2010

Exploring parameter space in reinforcement learning.
Paladyn J. Behav. Robotics, 2010

PyBrain.
J. Mach. Learn. Res., 2010

A Natural Evolution Strategy for Multi-objective Optimization.
Proceedings of the Parallel Problem Solving from Nature, 2010

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients.
Proceedings of the Artificial Neural Networks, 2010

Exponential natural evolution strategies.
Proceedings of the Genetic and Evolutionary Computation Conference, 2010

2009
Ontogenetic and Phylogenetic Reinforcement Learning.
Künstliche Intell., 2009

Stochastic search using the natural gradient.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Scalable Neural Networks for Board Games.
Proceedings of the Artificial Neural Networks, 2009

Efficient natural evolution strategies.
Proceedings of the Genetic and Evolutionary Computation Conference, 2009

2008
Fitness Expectation Maximization.
Proceedings of the Parallel Problem Solving from Nature, 2008

Countering Poisonous Inputs with Memetic Neuroevolution.
Proceedings of the Parallel Problem Solving from Nature, 2008

Episodic Reinforcement Learning by Logistic Reward-Weighted Regression.
Proceedings of the Artificial Neural Networks, 2008

A scalable neural network architecture for board games.
Proceedings of the 2008 IEEE Symposium on Computational Intelligence and Games, 2008

Natural Evolution Strategies.
Proceedings of the IEEE Congress on Evolutionary Computation, 2008


  Loading...