Sridhar Mahadevan
Orcid: 0000-0001-6507-9109Affiliations:
- Adobe Research, USA
- University of Massachusetts Amherst, USA
According to our database1,
Sridhar Mahadevan
authored at least 121 papers
between 1985 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on cs.umass.edu
On csauthors.net:
Bibliography
2024
2023
Randomized and Deterministic Attention Sparsification Algorithms for Over-parameterized Feature Dimension.
CoRR, 2023
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Unifying Causal Inference and Reinforcement Learning using Higher-Order Category Theory.
CoRR, 2022
On The Universality of Diagrams for Causal Inference and The Causal Reproducing Property.
CoRR, 2022
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
2021
2020
Proceedings of the 37th International Conference on Machine Learning, 2020
2019
Proceedings of the Adjunct Publication of the 27th Conference on User Modeling, 2019
2018
Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity.
J. Artif. Intell. Res., 2018
CoRR, 2018
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Proceedings of the 5th International Conference on Learning Representations, 2017
2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
2015
CoRR, 2015
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Proceedings of the Computational Sustainability, 2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
IEEE Trans. Parallel Distributed Syst., 2014
Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces.
CoRR, 2014
Proceedings of the 2014 AAAI Fall Symposia, Arlington, Virginia, USA, November 13-15, 2014, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the IEEE 29th Symposium on Mass Storage Systems and Technologies, 2013
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013
2012
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012
2011
Proceedings of the Parallel Processing and Applied Mathematics, 2011
Proceedings of the IJCAI 2011, 2011
Proceedings of the IJCAI 2011, 2011
2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010
2009
Mach. Learn., 2009
Found. Trends Mach. Learn., 2009
Proceedings of the IJCAI 2009, 2009
Proceedings of the Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling, 2009
Proceedings of the Manifold Learning and Its Applications, 2009
2008
Synthesis Lectures on Artificial Intelligence and Machine Learning, Morgan & Claypool Publishers, ISBN: 978-3-031-01546-5, 2008
Proceedings of the Machine Learning, 2008
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008
2007
Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes.
J. Mach. Learn. Res., 2007
Proceedings of the Machine Learning, 2007
Adaptive mesh compression in 3D computer graphics using multiscale manifold learning.
Proceedings of the Machine Learning, 2007
Proceedings of the Machine Learning, 2007
Proceedings of the Seventeenth International Conference on Automated Planning and Scheduling, 2007
Proceedings of the Artificial Intelligence in Education, 2007
Proceedings of the Computational Approaches to Representation Change during Learning and Development, 2007
Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
2006
Auton. Agents Multi Agent Syst., 2006
Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006
Improving Intelligent Tutoring Systems: Using Expectation Maximization to Learn Student Skill Levels.
Proceedings of the Intelligent Tutoring Systems, 8th International Conference, 2006
Fast direct policy evaluation using multiscale analysis of Markov diffusion processes.
Proceedings of the Machine Learning, 2006
Proceedings of the Proceedings, 2006
2005
Switching kalman filters for prediction and tracking in an adaptive meteorological sensing network.
Proceedings of the Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
Coarticulation: an approach for generating concurrent plans in Markov decision processes.
Proceedings of the Machine Learning, 2005
Proceedings of the Machine Learning, 2005
Samuel Meets Amarel: Automating Value Function Approximation Using Global State Space Analysis.
Proceedings of the Proceedings, 2005
Proceedings of the Proceedings, 2005
2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004
Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS 2004), 2004
2003
Discret. Event Dyn. Syst., 2003
2002
Proceedings of the Abstraction, 2002
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002
Learning the hierarchical structure of spatial environments using multiresolution statistical models.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002
Approximate Planning with Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation.
Proceedings of the 2002 IEEE International Conference on Robotics and Automation, 2002
Hierarchically Optimal Average Reward Reinforcement Learning.
Proceedings of the Machine Learning, 2002
A multiagent reinforcement learning algorithm by dynamically merging markov decision processes.
Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002
2001
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001
Learning Hierarchical Partially Observable Markov Decision Process Models for Robot Navigation.
Proceedings of the 2001 IEEE International Conference on Robotics and Automation, 2001
Continuous-Time Hierarchical Reinforcement Learning.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001
Proceedings of the Fifth International Conference on Autonomous Agents, 2001
2000
Proceedings of the Advances in Neural Information Processing Systems 13, 2000
Proceedings of the Biologically Motivated Computer Vision, 2000
1999
Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes.
Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999
1998
Proceedings of the Eleventh International Florida Artificial Intelligence Research Society Conference, 1998
1996
Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results.
Mach. Learn., 1996
Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning.
Proceedings of the Machine Learning, 1996
An Average-Reward Reinforcement Learning Algorithm for Computing Bias-Optimal Policies.
Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996
1994
Mach. Learn., 1994
To Discount or Not to Discount in Reinforcement Learning: A Case Study Comparing R Learning and Q Learning.
Proceedings of the Machine Learning, 1994
1993
1992
Artif. Intell., 1992
Enhancing Transfer in Reinforcement Learning by Building Stochastic Models of Robot Actions.
Proceedings of the Ninth International Workshop on Machine Learning (ML 1992), 1992
1991
Scaling Reinforcement Learning to Robotics by Exploiting the Subsumption Architecture.
Proceedings of the Eighth International Workshop (ML91), 1991
1989
Using Determinations in EBL: A Solution to the incomplete Theory Problem.
Proceedings of the Sixth International Workshop on Machine Learning (ML 1989), 1989
1988
On the Tractability of Learning from Incomplete Theories.
Proceedings of the Machine Learning, 1988
1985
Proceedings of the 9th International Joint Conference on Artificial Intelligence. Los Angeles, 1985
Verification-based Learning: A Generalized Strategy for Inferring Problem-Reduction Methods.
Proceedings of the 9th International Joint Conference on Artificial Intelligence. Los Angeles, 1985