Martin A. Riedmiller

Joschka Boedecker

Klaus Obermayer

Künstliche Intell., 2015

Striving for Simplicity: The All Convolutional Net.

[BibT_eX]

[DOI]

Alexey Dosovitskiy

Thomas Brox

Proceedings of the 3rd International Conference on Learning Representations, 2015

Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images.

[BibT_eX]

[DOI]

Manuel Watter

Joschka Boedecker

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Multimodal deep learning for robust RGB-D object recognition.

[BibT_eX]

[DOI]

Andreas Eitel

Luciano Spinello

Wolfram Burgard

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

2014

Improving Deep Neural Networks with Probabilistic Maxout Units.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Conference on Learning Representations, 2014

Discriminative Unsupervised Feature Learning with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Alexey Dosovitskiy

Lukas Dominique Josef Fiederer

Thomas Brox

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

A brain-computer interface for high-level remote control of an autonomous, reinforcement-learning-based robotic system for reaching and grasping.

[BibT_eX]

[DOI]

Thomas Lampe

Proceedings of the 19th International Conference on Intelligent User Interfaces, 2014

Approximate model-assisted Neural Fitted Q-Iteration.

[BibT_eX]

[DOI]

Thomas Lampe

Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Deterministic Policy Gradient Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Approximate real-time optimal control based on sparse Gaussian process models.

[BibT_eX]

[DOI]

Joschka Boedecker

Jan Wülfing

Proceedings of the 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2014

2013

Playing Atari with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2013

Improvement of a Web Browser Game Through the Knowledge Extracted from Player Behavior.

[BibT_eX]

[DOI]

Proceedings of the Knowledge, Information and Creativity Support Systems: Recent Trends, Advances and Solutions - Selected Papers from KICSS'2013, 2013

Acquiring visual servoing reaching and grasping skills using neural reinforcement learning.

[BibT_eX]

[DOI]

Thomas Lampe

Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Learning machines that perceive, act and communicate.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems, 2013

Optimization of Gaussian process hyperparameters using Rprop.

[BibT_eX]

[DOI]

Manuel Blum

Proceedings of the 21st European Symposium on Artificial Neural Networks, 2013

Electricity Demand Forecasting using Gaussian Processes.

[BibT_eX]

[DOI]

Manuel Blum

Proceedings of the Trading Agent Design and Analysis, 2013

2012

10 Steps and Some Tricks to Set up Neural Reinforcement Controllers.

[BibT_eX]

[DOI]

Proceedings of the Neural Networks: Tricks of the Trade - Second Edition, 2012

Unsupervised Learning of Local Features for Music Classification.

[BibT_eX]

[DOI]

Jan Wülfing

Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Taming the reservoir: Feedforward training for recurrent neural networks.

[BibT_eX]

[DOI]

Oliver Obst

Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

Autonomous reinforcement learning on raw visual input data in a real world application.

[BibT_eX]

[DOI]

Arne Voigtländer

Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

A learned feature descriptor for object recognition in RGB-D data.

[BibT_eX]

[DOI]

Manuel Blum

Jan Wülfing

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Learning Temporal Coherent Features through Life-Time Sparsity.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 19th International Conference, 2012

Learn to Swing Up and Balance a Real Pole Based on Raw Visual Input Data.

[BibT_eX]

[DOI]

Jan Mattner

Proceedings of the Neural Information Processing - 19th International Conference, 2012

Batch Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Reinforcement Learning, 2012

2011

Reinforcement learning in feedback control - Challenges and benchmarks from technical process control.

[BibT_eX]

[DOI]

Roland Hafner

Mach. Learn., 2011

Enhancing the episodic natural actor-critic algorithm by a regularisation term to stabilize learning of control structures.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

Improved neural fitted Q iteration applied to a novel computer gaming and learning benchmark.

[BibT_eX]

[DOI]

Christian Lutz

Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning, 2011

2010

Cognitive concepts in autonomous soccer playing robots.

[BibT_eX]

[DOI]

Cogn. Syst. Res., 2010

On Progress in RoboCup: The Simulation League Showcase.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2010: Robot Soccer World Cup XIV [papers from the 14th annual RoboCup International Symposium, 2010

Deep auto-encoder neural networks in reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2010

Deep learning of visual control policies.

[BibT_eX]

[DOI]

Proceedings of the 18th European Symposium on Artificial Neural Networks, 2010

2009

Efficient Identification of State in Reinforcement Learning.

[BibT_eX]

[DOI]

Stephan Timmer

Künstliche Intell., 2009

Computational object recognition: a biologically motivated approach.

[BibT_eX]

[DOI]

Tim C. Kietzmann

Biol. Cybern., 2009

Reinforcement learning for robot soccer.

[BibT_eX]

[DOI]

Auton. Robots, 2009

The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting.

[BibT_eX]

[DOI]

Tim C. Kietzmann

Proceedings of the International Conference on Machine Learning and Applications, 2009

09371 Abstracts Collection - Algorithmic Methods for Distributed Cooperative Systems.

[BibT_eX]

[DOI]

Proceedings of the Algorithmic Methods for Distributed Cooperative Systems, 06.09., 2009

2008

Incremental GRLVQ: Learning relevant features for 3D object recognition.

[BibT_eX]

[DOI]

Tim C. Kietzmann

Neurocomputing, 2008

A Case Study on Improving Defense Behavior in Soccer Simulation 2D: The NeuroHassle Approach.

[BibT_eX]

[DOI]

Florian Trost

Proceedings of the RoboCup 2008: Robot Soccer World Cup XII [papers from the 12th annual RoboCup International Symposium, 2008

Joint Equilibrium Policy Search for Multi-Agent Scheduling Problems.

[BibT_eX]

[DOI]

Proceedings of the Multiagent System Technologies, 6th German Conference, 2008

Learning to dribble on a real robot by success and failure.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action Sets.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Increasing Precision of Credible Case-Based Inference.

[BibT_eX]

[DOI]

Proceedings of the Advances in Case-Based Reasoning, 9th European Conference, 2008

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

2007

Making a Robot Learn to Play Soccer Using Reward and Punishment.

[BibT_eX]

[DOI]

Proceedings of the KI 2007: Advances in Artificial Intelligence, 2007

Neural Reinforcement Learning Controllers for a Real Robot Application.

[BibT_eX]

[DOI]

Roland Hafner

Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

An Analysis of Case-Based Value Function Approximation by Approximating State Transition Graphs.

[BibT_eX]

[DOI]

Proceedings of the Case-Based Reasoning Research and Development, 2007

Learning to Drive a Real Car in 20 Minutes.

[BibT_eX]

[DOI]

Michael Montemerlo

Hendrik Dahlkamp

Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies 2007, 2007

Reinforcement learning in a nutshell.

[BibT_eX]

[DOI]

Verena Heidrich-Meisner

Christian Igel

Proceedings of the 15th European Symposium on Artificial Neural Networks, 2007

Real-time 3D Ball Recognition using Perspective and Catadioptric Cameras.

[BibT_eX]

[DOI]

Proceedings of the 3rd European Conference on Mobile Robots, 2007

Safe Q-Learning on Complete History Spaces.

[BibT_eX]

[DOI]

Stephan Timmer

Proceedings of the Machine Learning: ECML 2007, 2007

Scaling Adaptive Agent-Based Reactive Job-Shop Scheduling to Large-Scale Problems.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Symposium on Computational Intelligence in Scheduling, 2007

On Experiences in a Complex and Competitive Gaming Domain: Reinforcement Learning Meets RoboCup.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Symposium on Computational Intelligence and Games, 2007

2006

Learning a Partial Behavior for a Competitive Robotic Soccer Agent.

[BibT_eX]

[DOI]

Künstliche Intell., 2006

Die Brainstormers: Entwurfsprinzipien lernfähiger autonomer Roboter.

[BibT_eX]

[DOI]

Inform. Spektrum, 2006

Appearance-Based Robot Discrimination Using Eigenimages.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2006: Robot Soccer World Cup X, 2006

Multi-agent Case-Based Reasoning for Cooperative Reinforcement Learners.

[BibT_eX]

[DOI]

Proceedings of the Advances in Case-Based Reasoning, 8th European Conference, 2006

Reducing policy degradation in neuro-dynamic programming.

[BibT_eX]

[DOI]

Proceedings of the 14th European Symposium on Artificial Neural Networks, 2006

06251 Abstracts Collection - Multi-Robot Systems: Perception, Behaviors, Learning, and Action.

[BibT_eX]

[DOI]

Proceedings of the Multi-Robot Systems: Perception, Behaviors, Learning, and Action, 19.06., 2006

2005

Effective Methods for Reinforcement Learning in Large Multi-Agent Domains.

[BibT_eX]

[DOI]

Daniel Withopf

it Inf. Technol., 2005

Learning policies for abstract state spaces.

[BibT_eX]

[DOI]

Stephan Timmer

Proceedings of the IEEE International Conference on Systems, 2005

Comparing different methods to speed up reinforcement learning in a complex domain.

[BibT_eX]

[DOI]

Daniel Withopf

Proceedings of the IEEE International Conference on Systems, 2005

Neural reinforcement learning to swing-up and balance a real pole.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2005

Calculating the Perfect Match: An Efficient and Accurate Approach for Robot Self-localization.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2005: Robot Soccer World Cup IX, 2005

Reinforcement Learning Using a Grid Based Function Approximator.

[BibT_eX]

[DOI]

Alexander Sung

Artur Merke

Proceedings of the Biomimetic Neural Learning for Intelligent Robots, 2005

Modeling Moving Objects in a Dynamically Changing Robot Application.

[BibT_eX]

[DOI]

Proceedings of the KI 2005: Advances in Artificial Intelligence, 2005

CBR for State Value Function Approximation in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Case-Based Reasoning, 2005

Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning: ECML 2005, 2005

2004

Fynesse: An architecture for integrating prior knowledge in autonomously learning agents.

[BibT_eX]

[DOI]

Martin Spott

Soft Comput., 2004

Invited talks.

[BibT_eX]

[DOI]

Künstliche Intell., 2004

RoboCup-2003: New Scientific and Technical Advances.

[BibT_eX]

[DOI]

AI Mag., 2004

Evolution of Computer Vision Subsystems in Robot Navigation and Image Classification Tasks.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2004: Robot Soccer World Cup VIII, 2004

Machine Learning for Autonomous Robots.

[BibT_eX]

[DOI]

Proceedings of the KI 2004: Advances in Artificial Intelligence, 2004

Reinforcement Learning for Stochastic Cooperative Multi-Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

2003

Reinforcement learning on explicitly specified time scales.

[BibT_eX]

[DOI]

Neural Comput. Appl., 2003

Overview of RoboCup 2003 Competition and Conferences.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

RoboCup: Yesterday, Today, and Tomorrow Workshop of the Executive Committee in Blaubeuren, October 2003.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Reinforcement learning on an omnidirectional mobile robot.

[BibT_eX]

[DOI]

Roland Hafner

Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

The Smaller the Better: Comparison of Two Approaches for Sales Rate Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Intelligent Data Analysis V, 2003

Learning to Control at Multiple Time Scales.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Neural Information Processing, 2003

2002

Speeding-up Reinforcement Learning with Multi-step Actions.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks, 2002

2001

Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer.

[BibT_eX]

[DOI]

Artur Merke

Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

2000

Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Karlsruhe Brainstormers 2000 Team Description.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Learning Situation Dependent Success Rates of Actions in a RoboCup Scenario.

[BibT_eX]

[DOI]

Sebastian Buck

Proceedings of the PRICAI 2000, Topics in Artificial Intelligence, 6th Pacific Rim International Conference on Artificial Intelligence, Melbourne, Australia, August 28, 2000

An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems.

[BibT_eX]

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Reinforcement Learning for Cooperating and Communicating Reactive Agents in Electrical Power Grids.

[BibT_eX]

[DOI]

Andrew W. Moore

Jeff G. Schneider

Proceedings of the Balancing Reactivity and Social Deliberation in Multi-Agent Systems, 2000

1999

Concepts and Facilities of a Neural Reinforcement Learning Control Architecture for Technical Process Control.

[BibT_eX]

[DOI]

Neural Comput. Appl., 1999

Karlsruhe Brainstormers - Design Principles.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

A Neural Reinforcement Learning Approach to Learn Local Dispatching Policies in Production Scheduling.

[BibT_eX]

[DOI]

Simone C. Riedmiller

Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Distributed Value Functions.

[BibT_eX]

Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

1998

A Neural Approach for the Control of Piezoelectric Micromanipulation Robots.

[BibT_eX]

[DOI]

Karoly Santa

Michael Mews

J. Intell. Robotic Syst., 1998

1997

Selbständig lernende neuronale Steuerungen.

[BibT_eX]

[DOI]

PhD thesis, 1997

A new method for the analysis of neural reference model control.

[BibT_eX]

[DOI]

Michael Wigbers

Proceedings of International Conference on Neural Networks (ICNN'97), 1997

Application of a self-learning controller with continuous control signals based on the DOE-approach.

[BibT_eX]

[DOI]

Proceedings of the 5th Eurorean Symposium on Artificial Neural Networks, 1997

1996

Fast Network Pruning and Feature Extraction by using the Unit-OBS Algorithm.

[BibT_eX]

[DOI]

Achim Stahlberger

Proceedings of the Advances in Neural Information Processing Systems 9, 1996

Application of sequential reinforcement learning to control dynamic systems.

[BibT_eX]

[DOI]

Proceedings of International Conference on Neural Networks (ICNN'96), 1996

1995

Self-learning neural control of a mobile robot.

[BibT_eX]

[DOI]

Barbara Janusz

Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995

Massively Parallel Training of Multi Layer Perceptrons With Irregular Topologies.

[BibT_eX]

[DOI]

D. Koll

Heinrich Braun

Proceedings of the Artificial Neural Nets and Genetic Algorithms, 1995

1993

A direct adaptive method for faster backpropagation learning: the RPROP algorithm.

[BibT_eX]

[DOI]