Sridhar Mahadevan

Proceedings of the IEEE 29th Symposium on Mass Storage Systems and Technologies, 2013

Manifold Alignment Preserving Global Geometry.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

Basis Adaptation for Sparse Nonlinear Reinforcement Learning.

[BibT_eX]

[DOI]

Stephen Giguere

Nicholas Jacek

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Multiscale Manifold Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012

Sparse Q-learning with Mirror Descent.

[BibT_eX]

[DOI]

Bo Liu

Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Regularized Off-Policy TD-Learning.

[BibT_eX]

[DOI]

Bo Liu

Ji Liu

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Manifold Warping: Manifold Alignment over Time.

[BibT_eX]

[DOI]

Hoa Trong Vu

Clifton Carey

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

A GPU-Based Approximate SVD Algorithm.

[BibT_eX]

[DOI]

Blake Foster

Rui Wang

Proceedings of the Parallel Processing and Applied Mathematics, 2011

Jointly Learning Data-Dependent Label and Locality-Preserving Projections.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

Heterogeneous Domain Adaptation Using Manifold Alignment.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

2010

Basis Construction from Power Series Expansions of Value Functions.

[BibT_eX]

[DOI]

Bo Liu

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Basis function construction for hierarchical reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Representation Discovery in Sequential Decision Making.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009

Hybrid least-squares algorithms for approximate policy evaluation.

[BibT_eX]

[DOI]

Marek Petrik

Mach. Learn., 2009

Learning Representation and Control in Markov Decision Processes: New Frontiers.

[BibT_eX]

[DOI]

Found. Trends Mach. Learn., 2009

Multiscale Analysis of Document Corpora Based on Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2009, 2009

Manifold Alignment without Correspondence.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2009, 2009

Transfer Learning and Representation Discovery in Intelligent Tutoring Systems.

[BibT_eX]

[DOI]

Kimberly Ferguson

Beverly Park Woolf

Proceedings of the Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling, 2009

A General Framework for Manifold Alignment.

[BibT_eX]

[DOI]

Proceedings of the Manifold Learning and Its Applications, 2009

2008

Representation Discovery using Harmonic Analysis

[BibT_eX]

[DOI]

Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01546-5, 2008

Manifold alignment using Procrustes analysis.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2008

Fast Spectral Learning using Lanczos Eigenspace Projections.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007

Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes.

[BibT_eX]

[DOI]

Mauro Maggioni

J. Mach. Learn. Res., 2007

Hierarchical Average Reward Reinforcement Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2007

Learning state-action basis functions for hierarchical MDPs.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2007

Adaptive mesh compression in 3D computer graphics using multiscale manifold learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2007

Constructing basis functions from directed graphs for value function approximation.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2007

Learning to Plan Using Harmonic Analysis of Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007

Repairing Disengagement With Non-Invasive Interventions.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence in Education, 2007

Representation Discovery in Planning using Harmonic Analysis.

[BibT_eX]

[DOI]

Proceedings of the Computational Approaches to Representation Change during Learning and Development, 2007

Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

Hierarchical multi-agent reinforcement learning.

[BibT_eX]

[DOI]

Rajbala Makar

Auton. Agents Multi Agent Syst., 2006

Estimating Student Proficiency Using an Item Response Theory Model.

[BibT_eX]

[DOI]

Beverly Park Woolf

Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006

Improving Intelligent Tutoring Systems: Using Expectation Maximization to Learn Student Skill Levels.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes.

[BibT_eX]

[DOI]

Mauro Maggioni

Proceedings of the Machine Learning, 2006

Learning Representation and Control in Continuous Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

2005

Representation Policy Iteration.

[BibT_eX]

[DOI]

Proceedings of the UAI '05, 2005

Switching kalman filters for prediction and tracking in an adaptive meteorological sensing network.

[BibT_eX]

[DOI]

Victoria Manfredi

James F. Kurose

Proceedings of the Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2005

Value Function Approximation with Diffusion Wavelets and Laplacian Eigenfunctions.

[BibT_eX]

[DOI]

Mauro Maggioni

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Coarticulation: an approach for generating concurrent plans in Markov decision processes.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2005

Proto-value functions: developmental reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2005

Samuel Meets Amarel: Automating Value Function Approximation Using Global State Space Analysis.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

A Variational Learning Algorithm for the Abstract Hidden Markov Model.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2005

2004

Coarticulation in Markov Decision Processes.

[BibT_eX]

[DOI]

Robert Platt Jr.

Roderic A. Grupen

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Learning hierarchical models of activity.

[BibT_eX]

[DOI]

Victoria Manfredi

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Learning to Communicate and Act Using Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004

Probabilistic Plan Recognition in Multiagent Systems.

[BibT_eX]

[DOI]

Suchi Saria

Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS 2004), 2004

2003

Recent Advances in Hierarchical Reinforcement Learning.

[BibT_eX]

[DOI]

Andrew G. Barto

Discret. Event Dyn. Syst., 2003

Hierarchical Policy Gradient Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2003

2002

Spatiotemporal Abstraction of Stochastic Sequential Processes.

[BibT_eX]

[DOI]

Proceedings of the Abstraction, 2002

Learning to Take Concurrent Actions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Learning the hierarchical structure of spatial environments using multiresolution statistical models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

Approximate Planning with Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Robotics and Automation, 2002

Hierarchically Optimal Average Reward Reinforcement Learning.

[BibT_eX]

Proceedings of the Machine Learning, 2002

A multiagent reinforcement learning algorithm by dynamically merging markov decision processes.

[BibT_eX]

[DOI]

Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

2001

Decision-Theoretic Planning with Concurrent Temporally Extended Actions.

[BibT_eX]

[DOI]

Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001

Learning Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2001 IEEE International Conference on Robotics and Automation, 2001

Continuous-Time Hierarchical Reinforcement Learning.

[BibT_eX]

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

A reinforcement learning model of selective visual attention.

[BibT_eX]

[DOI]

Silviu Minut

Natalia Hernandez-Gardiol

Proceedings of the Fifth International Conference on Autonomous Agents, 2001

2000

Hierarchical Memory-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Face Recognition Using Foveal Vision.

[BibT_eX]

[DOI]

Proceedings of the Biologically Motivated Computer Vision, 2000

1999

Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes.

[BibT_eX]

Gang Wang

Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

1998

Rapid Concept Learning for Mobile Robots.

[BibT_eX]

[DOI]

Nikfar Khaleeli

Mach. Learn., 1998

Optimizing Production Manufacturing Using Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Florida Artificial Intelligence Research Society Conference, 1998

1996

Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results.

[BibT_eX]

[DOI]

Mach. Learn., 1996

The National Science Foundation Workshop on Reinforcement Learning.

[BibT_eX]

[DOI]

Leslie Pack Kaelbling

AI Mag., 1996

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning.

[BibT_eX]

Proceedings of the Machine Learning, 1996

An Average-Reward Reinforcement Learning Algorithm for Computing Bias-Optimal Policies.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996

1994

Quantifying Prior Determination Knowledge Using the PAC Learning Model.

[BibT_eX]

[DOI]

Prasad Tadepalli

Mach. Learn., 1994

To Discount or Not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning.

[BibT_eX]

[DOI]