Martin A. Riedmiller

Orcid: 0000-0002-8465-5690

  • DeepMind, London, UK
  • Albert Ludwigs University Freiburg, Machine Learning Lab, Germany (former)

According to our database1, Martin A. Riedmiller authored at least 167 papers between 1993 and 2024.

RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation.
Trans. Mach. Learn. Res., 2024

Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning.
CoRR, 2024

Real-world fluid directed rigid body control via deep reinforcement learning.
Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Offline Actor-Critic Reinforcement Learning Scales to Large Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Replay across Experiments: A Natural Extension of Off-Policy RL.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration.
Trans. Mach. Learn. Res., 2023

Faster sorting algorithms discovered using deep reinforcement learning.
Nat., 2023

Less is more - the Dispatcher/ Executor principle for multi-task Reinforcement Learning.
CoRR, 2023

Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning.
CoRR, 2023

Policy composition in reinforcement learning via multi-objective policy optimization.
CoRR, 2023

Towards practical reinforcement learning for tokamak magnetic control.
CoRR, 2023

Towards A Unified Agent with Foundation Models.
CoRR, 2023

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation.
CoRR, 2023

A Generalist Dynamics Model for Control.
CoRR, 2023

Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains.
CoRR, 2023

Solving Continuous Control via Q-learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Magnetic control of tokamak plasmas through deep reinforcement learning.
Nat., 2022

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach.
CoRR, 2022

The Challenges of Exploration for Offline Reinforcement Learning.
CoRR, 2022

Evaluating Model-Based Planning and Planner Amortization for Continuous Control.
Proceedings of the Tenth International Conference on Learning Representations, 2022

MO2: Model-Based Offline Options.
Proceedings of the Conference on Lifelong Learning Agents, 2022

Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration.
CoRR, 2021

On Multi-objective Policy Optimization as a Tool for Reinforcement Learning.
CoRR, 2021

Rethinking Exploration for Sample-Efficient Policy Learning.
CoRR, 2021

Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Representation Matters: Improving Perception and Exploration for Robotics.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Data-efficient Hindsight Off-policy Option Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Collect & Infer - a fresh look at data-efficient Reinforcement Learning.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

A Constrained Multi-Objective Reinforcement Learning Framework.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Towards Real Robot Learning in the Wild: A Case Study in Bipedal Locomotion.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

"What, not how": Solving an under-actuated insertion task from scratch.
CoRR, 2020

Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification.
CoRR, 2020

Local Search for Policy Iteration in Continuous Control.
CoRR, 2020

Simple Sensor Intentions for Exploration.
CoRR, 2020

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning.
CoRR, 2020

Compositional Transfer in Hierarchical Reinforcement Learning.
Proceedings of the Robotics: Science and Systems XVI, 2020

A distributional view on multi-objective policy optimization.
Proceedings of the 37th International Conference on Machine Learning, 2020

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control.
Proceedings of the 8th International Conference on Learning Representations, 2020

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Robust Reinforcement Learning for Continuous Control with Model Misspecification.
Proceedings of the 8th International Conference on Learning Representations, 2020

Towards General and Autonomous Learning of Core Skills: A Case Study in Locomotion.
Proceedings of the 4th Conference on Robot Learning, 2020

Adaptive long-term control of biological neural networks with Deep Reinforcement Learning.
Neurocomputing, 2019

Quinoa: a Q-function You Infer Normalized Over Actions.
CoRR, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models.
CoRR, 2019

Regularized Hierarchical Policies for Compositional Transfer in Robotics.
CoRR, 2019

Robust Reinforcement Learning for Continuous Control with Model Misspecification.
CoRR, 2019

Self-supervised Learning of Image Embedding for Continuous Control.
CoRR, 2019

Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup.
Proceedings of the Robotics: Science and Systems XV, 2019

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Imagined Value Gradients: Model-Based Policy Optimization with Tranferable Latent Dynamics Models.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Relative Entropy Regularized Policy Iteration.
CoRR, 2018

DeepMind Control Suite.
CoRR, 2018

Graph Networks as Learnable Physics Engines for Inference and Control.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning by Playing Solving Sparse Reward Tasks from Scratch.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning an Embedding Space for Transferable Robot Skills.
Proceedings of the 6th International Conference on Learning Representations, 2018

Maximum a Posteriori Policy Optimisation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Controlling biological neural networks with deep reinforcement learning.
Proceedings of the 26th European Symposium on Artificial Neural Networks, 2018

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards.
CoRR, 2017

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation.
CoRR, 2017

PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations.
CoRR, 2017

Emergence of Locomotion Behaviours in Rich Environments.
CoRR, 2017

Autonomous Optimization of Targeted Stimulation of Neuronal Networks.
PLoS Comput. Biol., 2016

Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Learning and Transfer of Modulated Locomotor Controllers.
CoRR, 2016

Human-level control through deep reinforcement learning.
Nat., 2015

Autonomous Learning of State Representations for Control: An Emerging Field Aims to Autonomously Learn State Representations for Reinforcement Learning Agents from Their Real-World Sensor Observations.
Künstliche Intell., 2015

Striving for Simplicity: The All Convolutional Net.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Multimodal deep learning for robust RGB-D object recognition.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Improving Deep Neural Networks with Probabilistic Maxout Units.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Discriminative Unsupervised Feature Learning with Convolutional Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

A brain-computer interface for high-level remote control of an autonomous, reinforcement-learning-based robotic system for reaching and grasping.
Proceedings of the 19th International Conference on Intelligent User Interfaces, 2014

Approximate model-assisted Neural Fitted Q-Iteration.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Deterministic Policy Gradient Algorithms.
Proceedings of the 31th International Conference on Machine Learning, 2014

Approximate real-time optimal control based on sparse Gaussian process models.
Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014

Playing Atari with Deep Reinforcement Learning.
CoRR, 2013

Improvement of a Web Browser Game Through the Knowledge Extracted from Player Behavior.
Proceedings of the Knowledge, Information and Creativity Support Systems: Recent Trends, Advances and Solutions - Selected Papers from KICSS'2013, 2013

Acquiring visual servoing reaching and grasping skills using neural reinforcement learning.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Learning machines that perceive, act and communicate.
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems, 2013

Optimization of Gaussian process hyperparameters using Rprop.
Proceedings of the 21st European Symposium on Artificial Neural Networks, 2013

Electricity Demand Forecasting using Gaussian Processes.
Proceedings of the Trading Agent Design and Analysis, 2013

10 Steps and Some Tricks to Set up Neural Reinforcement Controllers.
Proceedings of the Neural Networks: Tricks of the Trade - Second Edition, 2012

Unsupervised Learning of Local Features for Music Classification.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Taming the reservoir: Feedforward training for recurrent neural networks.
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

Autonomous reinforcement learning on raw visual input data in a real world application.
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

A learned feature descriptor for object recognition in RGB-D data.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Learning Temporal Coherent Features through Life-Time Sparsity.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

Learn to Swing Up and Balance a Real Pole Based on Raw Visual Input Data.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

Batch Reinforcement Learning.
Proceedings of the Reinforcement Learning, 2012

Reinforcement learning in feedback control - Challenges and benchmarks from technical process control.
Mach. Learn., 2011

Enhancing the episodic natural actor-critic algorithm by a regularisation term to stabilize learning of control structures.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Improved neural fitted Q iteration applied to a novel computer gaming and learning benchmark.
Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Cognitive concepts in autonomous soccer playing robots.
Cogn. Syst. Res., 2010

On Progress in RoboCup: The Simulation League Showcase.
Proceedings of the RoboCup 2010: Robot Soccer World Cup XIV [papers from the 14th annual RoboCup International Symposium, 2010

Deep auto-encoder neural networks in reinforcement learning.
Proceedings of the International Joint Conference on Neural Networks, 2010

Deep learning of visual control policies.
Proceedings of the 18th European Symposium on Artificial Neural Networks, 2010

Efficient Identification of State in Reinforcement Learning.
Künstliche Intell., 2009

Computational object recognition: a biologically motivated approach.
Biol. Cybern., 2009

Reinforcement learning for robot soccer.
Auton. Robots, 2009

The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting.
Proceedings of the International Conference on Machine Learning and Applications, 2009

09371 Abstracts Collection - Algorithmic Methods for Distributed Cooperative Systems.
Proceedings of the Algorithmic Methods for Distributed Cooperative Systems, 06.09., 2009

Incremental GRLVQ: Learning relevant features for 3D object recognition.
Neurocomputing, 2008

A Case Study on Improving Defense Behavior in Soccer Simulation 2D: The NeuroHassle Approach.
Proceedings of the RoboCup 2008: Robot Soccer World Cup XII [papers from the 12th annual RoboCup International Symposium, 2008

Joint Equilibrium Policy Search for Multi-Agent Scheduling Problems.
Proceedings of the Multiagent System Technologies, 6th German Conference, 2008

Learning to dribble on a real robot by success and failure.
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Increasing Precision of Credible Case-Based Inference.
Proceedings of the Advances in Case-Based Reasoning, 9th European Conference, 2008

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Making a Robot Learn to Play Soccer Using Reward and Punishment.
Proceedings of the KI 2007: Advances in Artificial Intelligence, 2007

Neural Reinforcement Learning Controllers for a Real Robot Application.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs.
Proceedings of the Case-Based Reasoning Research and Development, 2007

Learning to Drive a Real Car in 20 Minutes.
Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies 2007, 2007

Reinforcement learning in a nutshell.
Proceedings of the 15th European Symposium on Artificial Neural Networks, 2007

Real-time 3D Ball Recognition using Perspective and Catadioptric Cameras.
Proceedings of the 3rd European Conference on Mobile Robots, 2007

Safe Q-Learning on Complete History Spaces.
Proceedings of the Machine Learning: ECML 2007, 2007

Scaling Adaptive Agent-Based Reactive Job-Shop Scheduling to Large-Scale Problems.
Proceedings of the 2007 IEEE Symposium on Computational Intelligence in Scheduling, 2007

On Experiences in a Complex and Competitive Gaming Domain: Reinforcement Learning Meets RoboCup.
Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Games, 2007

Learning a Partial Behavior for a Competitive Robotic Soccer Agent.
Künstliche Intell., 2006

Die Brainstormers: Entwurfsprinzipien lernfähiger autonomer Roboter.
Inform. Spektrum, 2006

Appearance-Based Robot Discrimination Using Eigenimages.
Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Multi-agent Case-Based Reasoning for Cooperative Reinforcement Learners.
Proceedings of the Advances in Case-Based Reasoning, 8th European Conference, 2006

Reducing policy degradation in neuro-dynamic programming.
Proceedings of the 14th European Symposium on Artificial Neural Networks, 2006

06251 Abstracts Collection - Multi-Robot Systems: Perception, Behaviors, Learning, and Action.
Proceedings of the Multi-Robot Systems: Perception, Behaviors, Learning, and Action, 19.06., 2006

Effective Methods for Reinforcement Learning in Large Multi-Agent Domains.
it Inf. Technol., 2005

Learning policies for abstract state spaces.
Proceedings of the IEEE International Conference on Systems, 2005

Comparing different methods to speed up reinforcement learning in a complex domain.
Proceedings of the IEEE International Conference on Systems, 2005

Neural reinforcement learning to swing-up and balance a real pole.
Proceedings of the IEEE International Conference on Systems, 2005

Calculating the Perfect Match: An Efficient and Accurate Approach for Robot Self-localization.
Proceedings of the RoboCup 2005: Robot Soccer World Cup IX, 2005

Reinforcement Learning Using a Grid Based Function Approximator.
Proceedings of the Biomimetic Neural Learning for Intelligent Robots, 2005

Modeling Moving Objects in a Dynamically Changing Robot Application.
Proceedings of the KI 2005: Advances in Artificial Intelligence, 2005

CBR for State Value Function Approximation in Reinforcement Learning.
Proceedings of the Case-Based Reasoning, 2005

Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method.
Proceedings of the Machine Learning: ECML 2005, 2005

Fynesse: An architecture for integrating prior knowledge in autonomously learning agents.
Soft Comput., 2004

Invited talks.
Künstliche Intell., 2004

RoboCup-2003: New Scientific and Technical Advances.
AI Mag., 2004

Evolution of Computer Vision Subsystems in Robot Navigation and Image Classification Tasks.
Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

Machine Learning for Autonomous Robots.
Proceedings of the KI 2004: Advances in Artificial Intelligence, 2004

Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems.
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Reinforcement learning on explicitly specified time scales.
Neural Comput. Appl., 2003

Overview of RoboCup 2003 Competition and Conferences.
Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

RoboCup: Yesterday, Today, and Tomorrow Workshop of the Executive Committee in Blaubeuren, October 2003.
Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Reinforcement learning on an omnidirectional mobile robot.
Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

The Smaller the Better: Comparison of Two Approaches for Sales Rate Prediction.
Proceedings of the Advances in Intelligent Data Analysis V, 2003

Learning to Control at Multiple Time Scales.
Proceedings of the Artificial Neural Networks and Neural Information Processing, 2003

Speeding-up Reinforcement Learning with Multi-step Actions.
Proceedings of the Artificial Neural Networks, 2002

Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer.
Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer.
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Karlsruhe Brainstormers 2000 Team Description.
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Learning Situation Dependent Success Rates of Actions in a RoboCup Scenario.
Proceedings of the PRICAI 2000, Topics in Artificial Intelligence, 6th Pacific Rim International Conference on Artificial Intelligence, Melbourne, Australia, August 28, 2000

An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Reinforcement Learning for Cooperating and Communicating Reactive Agents in Electrical Power Grids.
Proceedings of the Balancing Reactivity and Social Deliberation in Multi-Agent Systems, 2000

Concepts and Facilities of a Neural Reinforcement Learning Control Architecture for Technical Process Control.
Neural Comput. Appl., 1999

Karlsruhe Brainstormers - Design Principles.
Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

A Neural Reinforcement Learning Approach to Learn Local Dispatching Policies in Production Scheduling.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Distributed Value Functions.
Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

A Neural Approach for the Control of Piezoelectric Micromanipulation Robots.
J. Intell. Robotic Syst., 1998

Selbständig lernende neuronale Steuerungen.
PhD thesis, 1997

A new method for the analysis of neural reference model control.
Proceedings of International Conference on Neural Networks (ICNN'97), 1997

Application of a self-learning controller with continuous control signals based on the DOE-approach.
Proceedings of the 5th Eurorean Symposium on Artificial Neural Networks, 1997

Fast Network Pruning and Feature Extraction by using the Unit-OBS Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

Application of sequential reinforcement learning to control dynamic systems.
Proceedings of International Conference on Neural Networks (ICNN'96), 1996

Self-learning neural control of a mobile robot.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995

Massively Parallel Training of Multi Layer Perceptrons With Irregular Topologies.
Proceedings of the Artificial Neural Nets and Genetic Algorithms, 1995

A direct adaptive method for faster backpropagation learning: the RPROP algorithm.
Proceedings of International Conference on Neural Networks (ICNN'88), San Francisco, CA, USA, March 28, 1993
