2024

Data Release for: Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning.

[DOI]

Dataset, April, 2024

Learning agile soccer skills for a bipedal robot with deep reinforcement learning.

[DOI]

Sci. Robotics, 2024

Replay across Experiments: A Natural Extension of Off-Policy RL.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative Adversarial Equilibrium Solvers.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning.

[DOI]

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

2022

Figure Data for the paper "From Motor Control to Team Play in Simulated Humanoid Football".

[DOI]

Dataset, August, 2022

From motor control to team play in simulated humanoid football.

[DOI]

Sci. Robotics, 2022

Developing, evaluating and scaling learning agents in multi-agent environments.

[DOI]

AI Commun., 2022

2020

A Generalized Training Approach for Multiagent Learning.

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Biases for Emergent Communication in Multi-agent Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Reinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems.

[DOI]

Proceedings of the 2019 Conference on Artificial Life, 2019

Emergent Coordination Through Competition.

[DOI]

Siqi Liu

Guy Lever

Josh Merel

Saran Tunyasuvunakool

Nicolas Heess

Thore Graepel

Proceedings of the 7th International Conference on Learning Representations, 2019

The Body is Not a Given: Joint Agent Policy Learning and Morphology Evolution.

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning.

[DOI]

Max Jaderberg

Wojciech M. Czarnecki

Iain Dunning

Luke Marris

Guy Lever

Antonio García Castañeda

CoRR, 2018

Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward.

[DOI]

Peter Sunehag

Guy Lever

Audrunas Gruslys

Wojciech Marian Czarnecki

Vinícius Flores Zambaldi

Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017

Value-Decomposition Networks For Cooperative Multi-Agent Learning.

[DOI]

Peter Sunehag

Guy Lever

Audrunas Gruslys

Wojciech Marian Czarnecki

Vinícius Flores Zambaldi

CoRR, 2017

Nesterov's accelerated gradient and momentum as approximations to regularised update descent.

[DOI]

Aleksandar Botev

Guy Lever

David Barber

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

2016

Approximate Newton Methods for Policy Search in Markov Decision Processes.

[DOI]

Thomas Furmston

Guy Lever

David Barber

J. Mach. Learn. Res., 2016

Compressed Conditional Mean Embeddings for Model-Based Reinforcement Learning.

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

A Gauss-Newton Method for Markov Decision Processes.

[DOI]

Thomas Furmston

Guy Lever

CoRR, 2015

Modelling Policies in MDPs in Reproducing Kernel Hilbert Space.

[DOI]

Guy Lever

Ronnie Stafford

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014

Deterministic Policy Gradient Algorithms.

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

2013

Tighter PAC-Bayes bounds through distribution-dependent priors.

[DOI]

Guy Lever

François Laviolette

John Shawe-Taylor

Theor. Comput. Sci., 2013

2012

Data dependent kernels in nearly-linear time.

[DOI]

Guy Lever

Tom Diethe

John Shawe-Taylor

Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Conditional mean embeddings as regressors - supplementary

[DOI]

CoRR, 2012

Conditional mean embeddings as regressors.

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

Modelling transition dynamics in MDPs with RKHS embeddings.

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

2011

Exploiting structure defined by data in machine learning : some new analyses.

[DOI]

Guy Lever

PhD thesis, 2011

2010

Relating Function Class Complexity and Cluster Structure in the Function Domain with Applications to Transduction.

[DOI]

Guy Lever

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Distribution-Dependent PAC-Bayes Priors.

[DOI]

Guy Lever

François Laviolette

John Shawe-Taylor

Proceedings of the Algorithmic Learning Theory, 21st International Conference, 2010

2009

Predicting the Labelling of a Graph via Minimum $p$-Seminorm Interpolation.

[DOI]

Mark Herbster

Guy Lever

Proceedings of the COLT 2009, 2009

2008

Online Prediction on Large Diameter Graphs.

[DOI]

Mark Herbster

Guy Lever

Massimiliano Pontil

Proceedings of the Advances in Neural Information Processing Systems 21, 2008