Abbas Abdolmaleki

Bilal Piot

Bobak Shahriari

CoRR, 2024

Game On: Towards Language Models as RL Experimenters.

[BibT_eX]

[DOI]

Jingwei Zhang

Thomas Lampe

CoRR, 2024

Real-world fluid directed rigid body control via deep reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Offline Actor-Critic Reinforcement Learning Scales to Large Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Policy composition in reinforcement learning via multi-objective policy optimization.

[BibT_eX]

[DOI]

CoRR, 2023

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2023

Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains.

[BibT_eX]

[DOI]

Jingwei Zhang

CoRR, 2023

2022

Figure Data for the paper "From Motor Control to Team Play in Simulated Humanoid Football".

[BibT_eX]

[DOI]

Dataset, August, 2022

From motor control to team play in simulated humanoid football.

[BibT_eX]

[DOI]

Sci. Robotics, 2022

Magnetic control of tokamak plasmas through deep reinforcement learning.

[BibT_eX]

[DOI]

Nat., 2022

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach.

[BibT_eX]

[DOI]

Nicolas Heess

Matt Hoffman

CoRR, 2022

Offline Distillation for Robot Lifelong Learning with Imbalanced Experience.

[BibT_eX]

[DOI]

CoRR, 2022

How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation.

[BibT_eX]

[DOI]

Alex X. Lee

Coline Devin

Yuxiang Zhou

Thomas Lampe

Alessandro Davide Ialongo

Konstantinos Bousmalis

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Evaluating Model-Based Planning and Planner Amortization for Continuous Control.

[BibT_eX]

[DOI]

Yuval Tassa

Proceedings of the Tenth International Conference on Learning Representations, 2022

Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data.

[BibT_eX]

[DOI]

Proceedings of the Conference on Lifelong Learning Agents, 2022

2021

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning.

[BibT_eX]

[DOI]

Shruti Mishra

Dhruva TB

Arunkumar Byravan

Konstantinos Bousmalis

CoRR, 2021

Rethinking Exploration for Sample-Efficient Policy Learning.

[BibT_eX]

[DOI]

William F. Whitney

Michael Bloesch

CoRR, 2021

Data-efficient Hindsight Off-policy Option Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

A Constrained Multi-Objective Reinforcement Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020

"What, not how": Solving an under-actuated insertion task from scratch.

[BibT_eX]

[DOI]

CoRR, 2020

Local Search for Policy Iteration in Continuous Control.

[BibT_eX]

[DOI]

CoRR, 2020

Acme: A Research Framework for Distributed Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Noah Y. Siegel

CoRR, 2020

Compositional Transfer in Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Markus Wulfmeier

Roland Hafner

Proceedings of the Robotics: Science and Systems XVI, 2020

A distributional view on multi-objective policy optimization.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control.

[BibT_eX]

[DOI]

H. Francis Song

Proceedings of the 8th International Conference on Learning Representations, 2020

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Noah Y. Siegel

Proceedings of the 8th International Conference on Learning Representations, 2020

Robust Reinforcement Learning for Continuous Control with Model Misspecification.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Contextual Direct Policy Search - With Regularized Covariance Matrix Estimation.

[BibT_eX]

[DOI]

J. Intell. Robotic Syst., 2019

Quinoa: a Q-function You Infer Normalized Over Actions.

[BibT_eX]

[DOI]

Jonas Degrave

Nicolas Heess

CoRR, 2019

Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer.

[BibT_eX]

[DOI]

CoRR, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models.

[BibT_eX]

[DOI]

Arunkumar Byravan

CoRR, 2019

Augmenting learning using symmetry in a biologically-inspired domain.

[BibT_eX]

[DOI]

CoRR, 2019

Regularized Hierarchical Policies for Compositional Transfer in Robotics.

[BibT_eX]

[DOI]

Markus Wulfmeier

Roland Hafner

CoRR, 2019

Robust Reinforcement Learning for Continuous Control with Model Misspecification.

[BibT_eX]

[DOI]

Timothy A. Mann

Todd Hester

CoRR, 2019

Value constrained model-free continuous control.

[BibT_eX]

[DOI]

CoRR, 2019

Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup.

[BibT_eX]

[DOI]

Devin Schwab

Murilo Fernandes Martins

Proceedings of the Robotics: Science and Systems XV, 2019

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Tranferable Latent Dynamics Models.

[BibT_eX]

[DOI]

Arunkumar Byravan

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018

Information theoretic stochastic search

[BibT_eX]

[DOI]

PhD thesis, 2018

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2018

Relative Entropy Regularized Policy Iteration.

[BibT_eX]

[DOI]

CoRR, 2018

DeepMind Control Suite.

[BibT_eX]

[DOI]

CoRR, 2018

Eager and Memory-Based Non-Parametric Stochastic Search Methods for Learning Control.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Guide Actor-Critic for Continuous Control.

[BibT_eX]

[DOI]

Voot Tangkaratt

Masashi Sugiyama

Proceedings of the 6th International Conference on Learning Representations, 2018

Maximum a Posteriori Policy Optimisation.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Contextual Covariance Matrix Adaptation Evolutionary Strategies.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deriving and improving CMA-ES with information geometric trust regions.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2017

Stochastic Search In Changing Situations.

[BibT_eX]

[DOI]

Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Contextual Policy Search for Linear and Nonlinear Generalization of a Humanoid Walking Controller.

[BibT_eX]

[DOI]

J. Intell. Robotic Syst., 2016

Learning a Humanoid Kick with Controlled Distance.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

Non-parametric contextual stochastic search.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Model-Free Trajectory Optimization for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 33nd International Conference on Machine Learning, 2016

Contextual Relative Entropy Policy Search with Covariance Matrix Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Autonomous Robot Systems and Competitions, 2016

Contextual Stochastic Search.

[BibT_eX]

[DOI]

Proceedings of the Genetic and Evolutionary Computation Conference, 2016

2015

Model-Based Relative Entropy Stochastic Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Contextual Policy Search for Generalizing a Parameterized Biped Walking Controller.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Autonomous Robot Systems and Competitions, 2015

Regularized covariance estimation for weighted maximum likelihood policy search methods.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

2014

Omnidirectional Walking with a Compliant Inverted Pendulum Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014

2013

Omnidirectional Walking and Active Balance for Soccer Humanoid Robot.

[BibT_eX]

[DOI]

Proceedings of the Progress in Artificial Intelligence, 2013

2012

A Model for Context Aware Mobile Payment.

[BibT_eX]

[DOI]

Leila Abedi

Mohammad Ali Nematbakhsh