David Silver

Proceedings of the Experimental Robotics, 2012

Active learning from demonstration for robust autonomous navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Compositional Planning Using Optimal Option Models.

[BibT_eX]

[DOI]

Kamil Ciosek

Proceedings of the 29th International Conference on Machine Learning, 2012

Gradient Temporal Difference Networks.

[BibT_eX]

[DOI]

Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

Actor-Critic Reinforcement Learning with Energy-Based Policies.

[BibT_eX]

[DOI]

Nicolas Heess

Yee Whye Teh

Proceedings of the Tenth European Workshop on Reinforcement Learning, 2012

2011

A Monte-Carlo AIXI Approximation.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2011

Monte-Carlo tree search and rapid action value estimation in computer Go.

[BibT_eX]

[DOI]

Sylvain Gelly

Artif. Intell., 2011

Monte Carlo Localization and registration to prior data for outdoor navigation.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Non-Linear Monte-Carlo Search in Civilization II.

[BibT_eX]

[DOI]

S. R. K. Branavan

Regina Barzilay

Proceedings of the IJCAI 2011, 2011

2010

Learning Preference Models for Autonomous Mobile Robots in Complex Domains.

[BibT_eX]

[DOI]

PhD thesis, 2010

Learning for Autonomous Navigation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Mag., 2010

Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain.

[BibT_eX]

[DOI]

Int. J. Robotics Res., 2010

Monte-Carlo Planning in Large POMDPs.

[BibT_eX]

[DOI]

Joel Veness

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Reinforcement Learning via AIXI Approximation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009

A Monte Carlo AIXI Approximation

[BibT_eX]

[DOI]

CoRR, 2009

Learning to search: Functional gradient techniques for imitation learning.

[BibT_eX]

[DOI]

Nathan D. Ratliff

Auton. Robots, 2009

Bootstrapping from Game Tree Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Perceptual Interpretation for Autonomous Navigation through Dynamic Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Robotics Research - The 14th International Symposium, 2009

Fast gradient-descent methods for temporal-difference learning with linear function approximation.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Monte-Carlo simulation balancing.

[BibT_eX]

[DOI]

Gerald Tesauro

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Applied Imitation Learning for Autonomous Navigation in Complex Natural Terrain.

[BibT_eX]

[DOI]

Proceedings of the Field and Service Robotics, Results of the 7th International Conference, 2009

2008

History, Hype, and Hope: An Afterward.

[BibT_eX]

[DOI]

First Monday, 2008

High Performance Outdoor Navigation from Overhead Data using Imitation Learning.

[BibT_eX]

[DOI]

James A. Bagnell

Proceedings of the Robotics: Science and Systems IV, 2008

Sample-based learning and search with permanent and transient memories.

[BibT_eX]

[DOI]

Richard S. Sutton

Martin Müller

Proceedings of the Machine Learning, 2008

Achieving Master Level Play in 9 x 9 Computer Go.

[BibT_eX]

[DOI]

Sylvain Gelly

Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007

Reinforcement Learning of Local Shape in the Game of Go.

[BibT_eX]

[DOI]

Richard S. Sutton

Martin Müller

Proceedings of the IJCAI 2007, 2007

On the role of tracking in stationary environments.

[BibT_eX]

[DOI]

Richard S. Sutton

Anna Koop

Proceedings of the Machine Learning, 2007

Combining online and offline knowledge in UCT.

[BibT_eX]

[DOI]

Sylvain Gelly

Proceedings of the Machine Learning, 2007

2006

Topological exploration of subterranean environments.

[BibT_eX]

[DOI]

J. Field Robotics, 2006

Recent developments in subterranean robotics.

[BibT_eX]

[DOI]

J. Field Robotics, 2006

Experimental Analysis of Overhead Data Processing To Support Long Range Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

2005

The hierarchical atlas.

[BibT_eX]

[DOI]

IEEE Trans. Robotics, 2005

Towards Topological Exploration of Abandoned Mines.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

Topological Global Localization for Subterranean Voids.

[BibT_eX]

[DOI]

Joseph Carsten

Scott Thayer

Proceedings of the Field and Service Robotics, Results of the 5th International Conference, 2005

Cooperative Pathfinding.

[BibT_eX]

Proceedings of the First Artificial Intelligence and Interactive Digital Entertainment Conference, 2005

2004

Internet/Cyberculture/ Digital Culture/New Media/ Fill-in-the-Blank Studies.

[BibT_eX]

[DOI]

New Media Soc., 2004

Scan matching for flooded subterranean voids.

[BibT_eX]

[DOI]

David M. Bradley

Scott Thayer

Proceedings of the 2004 IEEE Conference on Robotics, Automation and Mechatronics, 2004

A regional point descriptor for global topological localization in flooded subterranean environments.

[BibT_eX]

[DOI]

David M. Bradley

Scott Thayer

Proceedings of the 2004 IEEE Conference on Robotics, Automation and Mechatronics, 2004

Feature extraction for topological mine maps.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Arc Carving: Obtaining Accurate, Low Latency Maps from Ultrasonic Range Sensors.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

2003

Hierarchical simultaneous localization and mapping.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

2000

Book Review: Life Online: Researching Real Experience in Virtual Space.

[BibT_eX]

[DOI]