Richard S. Sutton
Orcid: 0000-0002-3679-3415Affiliations:
- DeepMind Alberta, Edmonton, AB, Canada
- University of Alberta, Department of Computing Science, Edmonton, AB, Canada
- University of Massachusetts Amherst, MA, USA (PhD 1984)
According to our database1,
Richard S. Sutton
authored at least 163 papers
between 1983 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes.
CoRR, 2024
CoRR, 2024
Reward Centering.
RLJ, 2024
SwiftTD: A Fast and Robust Algorithm for Temporal Difference Learning.
RLJ, 2024
An Idiosyncrasy of Time-discretization in Reinforcement Learning.
RLJ, 2024
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint).
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Artif. Intell., November, 2023
Communicative capital: a key resource for human-machine shared agency and collaborative capacity.
Neural Comput. Appl., August, 2023
From eye-blinks to state construction: Diagnostic benchmarks for online representation learning.
Adapt. Behav., February, 2023
J. Mach. Learn. Res., 2023
A Note on Stability in Asynchronous Stochastic Approximation without Communication Delays.
CoRR, 2023
CoRR, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Conference on Lifelong Learning Agents, 2023
Proceedings of the Conference on Lifelong Learning Agents, 2023
2022
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
2021
IEEE Trans. Syst. Man Cybern. Syst., 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment.
CoRR, 2021
CoRR, 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task.
CoRR, 2021
Policy iterations for reinforcement learning problems in continuous time and space - Fundamental theory and methods.
Autom., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
2020
Special Issue "On Defining Artificial Intelligence" - Commentaries and Author's Response.
J. Artif. Gen. Intell., 2020
Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning.
CoRR, 2020
Document-editing Assistants and Model-based Reinforcement Learning as a Path to Conversational AI.
CoRR, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning.
CoRR, 2019
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target.
CoRR, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Extending Sliding-Step Importance Weighting from Supervised Learning to Reinforcement Learning.
Proceedings of the Artificial Intelligence. IJCAI 2019 International Workshops, 2019
Prediction in Intelligence: An Empirical Comparison of Off-policy Algorithms on Robots.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
2018
J. Mach. Learn. Res., 2018
Frontiers Robotics AI, 2018
Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling.
CoRR, 2018
Two geometric input transformation methods for fast online reinforcement learning with neural nets.
CoRR, 2018
CoRR, 2018
CoRR, 2018
Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Integral Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space.
CoRR, 2017
Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
2016
J. Mach. Learn. Res., 2016
Face valuing: Training user interfaces with facial expressions and reinforcement learning.
CoRR, 2016
Learning representations through stochastic gradient descent in cross-validation error.
CoRR, 2016
CoRR, 2016
2015
Off-policy learning based on weighted importance sampling with linear computational complexity.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
2014
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Weighted importance sampling for off-policy learning with linear function approximation.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
2013
IEEE Robotics Autom. Mag., 2013
Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb.
CoRR, 2013
Proceedings of the Tenth Symposium on Abstraction, Reformulation, and Approximation, 2013
Real-time prediction learning for the simultaneous actuation of multiple prosthetic joints.
Proceedings of the IEEE 13th International Conference on Rehabilitation Robotics, 2013
Proceedings of the 30th International Conference on Machine Learning, 2013
Proceedings of the Learning Rich Representations from Low-Level Sensors, 2013
2012
Acquiring a broad range of empirical knowledge in real time by temporal-difference learning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the American Control Conference, 2012
Proceedings of the Robots Learning Interactively from Human Teachers, 2012
2011
Proceedings of the Inductive Logic Programming - 21st International Conference, 2011
Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction.
Proceedings of the 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2011), 2011
2010
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010
2009
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009
Fast gradient-descent methods for temporal-difference learning with linear function approximation.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
2008
Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System.
Neural Comput., 2008
Proceedings of the UAI 2008, 2008
A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the Machine Learning, 2008
Proceedings of the Fourth Artificial Intelligence and Interactive Digital Entertainment Conference, 2008
2007
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the IJCAI 2007, 2007
Proceedings of the Machine Learning, 2007
2006
Proceedings of the Advances in Neural Information Processing Systems 19, 2006
Proceedings of the Proceedings, 2006
2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005
Using Predictive Representations to Improve Generalization in Reinforcement Learning.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005
Proceedings of the Machine Learning, 2005
2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
2001
Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Scaling Reinforcement Learning toward RoboCup Soccer.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001
Off-Policy Temporal Difference Learning with Function Approximation.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001
2000
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000
Eligibility Traces for Off-Policy Policy Evaluation.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000
1999
Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning.
Artif. Intell., 1999
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999
Proceedings of the Computational Learning Theory, 4th European Conference, 1999
1998
Proceedings of the Simulated Evolution and Learning, 1998
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Intra-Option Learning about Temporally Abstract Actions.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998
Proceedings of the Machine Learning: ECML-98, 1998
Adaptive computation and machine learning, MIT Press, ISBN: 978-0-262-19398-6, 1998
1997
Experiments with Reinforcement Learning in Problems with Continuous State and Action Spaces.
Adapt. Behav., 1997
Proceedings of the Advances in Neural Information Processing Systems 10, 1997
Exponentiated Gradient Methods for Reinforcement Learning.
Proceedings of the Fourteenth International Conference on Machine Learning (ICML 1997), 1997
Proceedings of the Artificial Neural Networks, 1997
1996
1995
Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding.
Proceedings of the Advances in Neural Information Processing Systems 8, 1995
Proceedings of the Machine Learning, 1995
1993
Proceedings of the Machine Learning, 1993
1992
Proceedings of the 10th National Conference on Artificial Intelligence, 1992
1991
SIGART Bull., 1991
Proceedings of the Advances in Neural Information Processing Systems 4, 1991
Proceedings of the Eighth International Workshop (ML91), 1991
Proceedings of the Eighth International Workshop (ML91), 1991
1990
Proceedings of the Advances in Neural Information Processing Systems 3, 1990
Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming.
Proceedings of the Machine Learning, 1990
1989
Proceedings of the Advances in Neural Information Processing Systems 2, 1989
1988
1985
Proceedings of the 9th International Joint Conference on Artificial Intelligence. Los Angeles, 1985
1983
IEEE Trans. Syst. Man Cybern., 1983