Abbas Abdolmaleki

Orcid: 0000-0001-6692-5856

According to our database1, Abbas Abdolmaleki authored at least 67 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation.
Trans. Mach. Learn. Res., 2024

Preference Optimization as Probabilistic Inference.
CoRR, 2024

Game On: Towards Language Models as RL Experimenters.
CoRR, 2024

Real-world fluid directed rigid body control via deep reinforcement learning.
Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Offline Actor-Critic Reinforcement Learning Scales to Large Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration.
Trans. Mach. Learn. Res., 2023

Policy composition in reinforcement learning via multi-objective policy optimization.
CoRR, 2023

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation.
CoRR, 2023

Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains.
CoRR, 2023

2022

From motor control to team play in simulated humanoid football.
Sci. Robotics, 2022

Magnetic control of tokamak plasmas through deep reinforcement learning.
Nat., 2022

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach.
CoRR, 2022

Offline Distillation for Robot Lifelong Learning with Imbalanced Experience.
CoRR, 2022

How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Evaluating Model-Based Planning and Planner Amortization for Continuous Control.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data.
Proceedings of the Conference on Lifelong Learning Agents, 2022

2021
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning.
CoRR, 2021

Rethinking Exploration for Sample-Efficient Policy Learning.
CoRR, 2021

Data-efficient Hindsight Off-policy Option Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021


A Constrained Multi-Objective Reinforcement Learning Framework.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020
"What, not how": Solving an under-actuated insertion task from scratch.
CoRR, 2020

Local Search for Policy Iteration in Continuous Control.
CoRR, 2020

Acme: A Research Framework for Distributed Reinforcement Learning.
CoRR, 2020

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning.
CoRR, 2020

Compositional Transfer in Hierarchical Reinforcement Learning.
Proceedings of the Robotics: Science and Systems XVI, 2020

A distributional view on multi-objective policy optimization.
Proceedings of the 37th International Conference on Machine Learning, 2020

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control.
Proceedings of the 8th International Conference on Learning Representations, 2020

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Robust Reinforcement Learning for Continuous Control with Model Misspecification.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Contextual Direct Policy Search - With Regularized Covariance Matrix Estimation.
J. Intell. Robotic Syst., 2019

Quinoa: a Q-function You Infer Normalized Over Actions.
CoRR, 2019

Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer.
CoRR, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models.
CoRR, 2019

Augmenting learning using symmetry in a biologically-inspired domain.
CoRR, 2019

Regularized Hierarchical Policies for Compositional Transfer in Robotics.
CoRR, 2019

Robust Reinforcement Learning for Continuous Control with Model Misspecification.
CoRR, 2019

Value constrained model-free continuous control.
CoRR, 2019

Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup.
Proceedings of the Robotics: Science and Systems XV, 2019

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Tranferable Latent Dynamics Models.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
Information theoretic stochastic search
PhD thesis, 2018

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement.
J. Mach. Learn. Res., 2018

Relative Entropy Regularized Policy Iteration.
CoRR, 2018

DeepMind Control Suite.
CoRR, 2018

Eager and Memory-Based Non-Parametric Stochastic Search Methods for Learning Control.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Guide Actor-Critic for Continuous Control.
Proceedings of the 6th International Conference on Learning Representations, 2018

Maximum a Posteriori Policy Optimisation.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Contextual Covariance Matrix Adaptation Evolutionary Strategies.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deriving and improving CMA-ES with information geometric trust regions.
Proceedings of the Genetic and Evolutionary Computation Conference, 2017

Stochastic Search In Changing Situations.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Contextual Policy Search for Linear and Nonlinear Generalization of a Humanoid Walking Controller.
J. Intell. Robotic Syst., 2016

Learning a Humanoid Kick with Controlled Distance.
Proceedings of the RoboCup 2016: Robot World Cup XX [Leipzig, Germany, June 30, 2016

Non-parametric contextual stochastic search.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Model-Free Trajectory Optimization for Reinforcement Learning.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Contextual Relative Entropy Policy Search with Covariance Matrix Adaptation.
Proceedings of the 2016 International Conference on Autonomous Robot Systems and Competitions, 2016

Contextual Stochastic Search.
Proceedings of the Genetic and Evolutionary Computation Conference, 2016

2015
Model-Based Relative Entropy Stochastic Search.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Contextual Policy Search for Generalizing a Parameterized Biped Walking Controller.
Proceedings of the 2015 IEEE International Conference on Autonomous Robot Systems and Competitions, 2015

Regularized covariance estimation for weighted maximum likelihood policy search methods.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015

2014
Omnidirectional Walking with a Compliant Inverted Pendulum Model.
Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014

2013
Omnidirectional Walking and Active Balance for Soccer Humanoid Robot.
Proceedings of the Progress in Artificial Intelligence, 2013

2012
A Model for Context Aware Mobile Payment.
J. Theor. Appl. Electron. Commer. Res., 2012

A Distributed Cooperative Reinforcement Learning Method for Decision Making in Fire Brigade Teams.
Proceedings of the RoboCup 2012: Robot Soccer World Cup XVI [papers from the 16th Annual RoboCup International Symposium, 2012

2011
A Reinforcement Learning Based Method for Optimizing the Process of Decision Making in Fire Brigade Agents.
Proceedings of the Progress in Artificial Intelligence, 2011


  Loading...