Guy Lever

Orcid: 0000-0001-9551-1839

According to our database1, Guy Lever authored at least 31 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning agile soccer skills for a bipedal robot with deep reinforcement learning.
Sci. Robotics, 2024

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning.
CoRR, 2024

Replay across Experiments: A Natural Extension of Off-Policy RL.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generative Adversarial Equilibrium Solvers.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022

From motor control to team play in simulated humanoid football.
Sci. Robotics, 2022

Developing, evaluating and scaling learning agents in multi-agent environments.
AI Commun., 2022

2020
A Generalized Training Approach for Multiagent Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Biases for Emergent Communication in Multi-agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Reinforcement Learning Agents acquire Flocking and Symbiotic Behaviour in Simulated Ecosystems.
Proceedings of the 2019 Conference on Artificial Life, 2019

Emergent Coordination Through Competition.
Proceedings of the 7th International Conference on Learning Representations, 2019

The Body is Not a Given: Joint Agent Policy Learning and Morphology Evolution.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning.
CoRR, 2018

Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017
Value-Decomposition Networks For Cooperative Multi-Agent Learning.
CoRR, 2017

Nesterov's accelerated gradient and momentum as approximations to regularised update descent.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

2016
Approximate Newton Methods for Policy Search in Markov Decision Processes.
J. Mach. Learn. Res., 2016

Compressed Conditional Mean Embeddings for Model-Based Reinforcement Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A Gauss-Newton Method for Markov Decision Processes.
CoRR, 2015

Modelling Policies in MDPs in Reproducing Kernel Hilbert Space.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Deterministic Policy Gradient Algorithms.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
Tighter PAC-Bayes bounds through distribution-dependent priors.
Theor. Comput. Sci., 2013

2012
Data dependent kernels in nearly-linear time.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Conditional mean embeddings as regressors - supplementary
CoRR, 2012

Conditional mean embeddings as regressors.
Proceedings of the 29th International Conference on Machine Learning, 2012

Modelling transition dynamics in MDPs with RKHS embeddings.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Exploiting structure defined by data in machine learning : some new analyses.
PhD thesis, 2011

2010
Relating Function Class Complexity and Cluster Structure in the Function Domain with Applications to Transduction.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Distribution-Dependent PAC-Bayes Priors.
Proceedings of the Algorithmic Learning Theory, 21st International Conference, 2010

2009
Predicting the Labelling of a Graph via Minimum $p$-Seminorm Interpolation.
Proceedings of the COLT 2009, 2009

2008
Online Prediction on Large Diameter Graphs.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008


  Loading...