Satinder Singh
Orcid: 0000-0002-5169-9486Affiliations:
- DeepMind, London, UK
- University of Michigan, Department of Electrical Engineering and Computer Science, Ann Arbor, MI, USA
- Syntek Capital
- AT&T Labs, Florham Park, NJ, USA
- University of Colorado Boulder, Department of Computer Science, CO, USA
- Massachusetts Institute of Technology (MIT), Brain and Cognitive Science Department, Cambridge, MA, USA
According to our database1,
Satinder Singh
authored at least 250 papers
between 1991 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Attention learning models using local Zernike moments-based normalized images and convolutional neural networks for skin lesion classification.
Biomed. Signal Process. Control., 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
2023
Risk-aware analysis for interpretations of probabilistic achievement and maintenance commitments.
Artif. Intell., April, 2023
Trans. Mach. Learn. Res., 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Companion Proceedings of the Conference on Genetic and Evolutionary Computation, 2023
2022
Figure Data for the paper "Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning".
Dataset, October, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Conference on Lifelong Learning Agents, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments.
CoRR, 2020
Semantics and algorithms for trustworthy commitment achievement under model uncertainty.
Auton. Agents Multi Agent Syst., 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020
Querying to Find a Safe Policy under Uncertain Safety Constraints in Markov Decision Processes.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Modeling Probabilistic Commitments for Maintenance Is Inherently Harder than for Achievement.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the Adversarial and Uncertain Reasoning for Adaptive Cyber Defense, 2019
CoRR, 2019
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Computational Strategies for the Trustworthy Pursuit and the Safe Modeling of Probabilistic Maintenance Commitments.
Proceedings of the Workshop on Artificial Intelligence Safety 2019 co-located with the 28th International Joint Conference on Artificial Intelligence, 2019
Deep Reinforcement Learning for Multi-driver Vehicle Dispatching and Repositioning Problem.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Multistage Attack Graph Security Games: Heuristic Strategies, with Empirical Game-Theoretic Analysis.
Secur. Commun. Networks, 2018
Named Entities troubling your Neural Methods? Build NE-Table: A neural approach for handling Named Entities.
CoRR, 2018
The Advantage of Doubling: A Deep Reinforcement Learning Approach to Studying the Double Team in the NBA.
CoRR, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 20th International Trust Workshop co-located with AAMAS/IJCAI/ECAI/ICML 2018, 2018
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018
Proceedings of the Algorithmic Learning Theory, 2018
2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
Proceedings of the Decision and Game Theory for Security - 8th International Conference, 2017
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017
Multi-Stage Attack Graph Security Games: Heuristic Strategies, with Empirical Game-Theoretic Analysis.
Proceedings of the 2017 Workshop on Moving Target Defense, 2017
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, 2017
Approximately-Optimal Queries for Planning in Reward-Uncertain Markov Decision Processes.
Proceedings of the Twenty-Seventh International Conference on Automated Planning and Scheduling, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Multi-task seizure detection: addressing intra-patient variation in seizure morphologies.
Mach. Learn., 2016
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016
Proceedings of the 3rd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 18th International Workshop on Trust in Agent Societies co-located with the 15th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2016), 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Proceedings of the 32nd International Conference on Machine Learning, 2015
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Computational Rationality: Linking Mechanism and Behavior Through Bounded Utility Maximization.
Top. Cogn. Sci., 2014
Top. Cogn. Sci., 2014
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014
Ecologically valid long-term mood monitoring of individuals with bipolar disorder using speech.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014
Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014
Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics, 2014
Proceedings of the Twenty-Fourth International Conference on Automated Planning and Scheduling, 2014
Computationally Rational Saccadic Control: An Explanation of Spillover Effects Based on Sampling from Noisy Perception and Memory.
Proceedings of the Fifth Workshop on Cognitive Modeling and Computational Linguistics, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
2013
The Adaptive Nature of Eye Movements in Linguistic Tasks: How Payoff and Architecture Shape Speed-Accuracy Trade-Offs.
Top. Cogn. Sci., 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the Human-Computer Interaction. Human-Centred Design Approaches, Methods, Tools, and Environments, 2013
2012
Proceedings of the 13th ACM Conference on Electronic Commerce, 2012
Proceedings of the 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics, 2012
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012
Proceedings of the Game Theory for Security, 2012
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012
2011
Learning to Make Predictions In Partially Observable Environments Without a Generative Model.
J. Artif. Intell. Res., 2011
Proceedings of the PASSAT/SocialCom 2011, Privacy, 2011
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011
2010
IEEE Trans. Auton. Ment. Dev., 2010
Proceedings of the UAI 2010, 2010
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010
Proceedings of the 2010 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2010
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010
2009
Proceedings of the IJCAI 2009, 2009
Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009
Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009
2008
Proceedings of the UAI 2008, 2008
Proceedings of the Advances in Neural Information Processing Systems 21, 2008
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008
Predictive Linear-Gaussian Models of Dynamical Systems with Vector-Valued Actions and Observations.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008
Efficiently learning linear-linear exponential family predictive representations of state.
Proceedings of the Machine Learning, 2008
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008
2007
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the IJCAI 2007, 2007
On discovery and learning of models with predictive representations of state for agents with continuous actions and observations.
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
2006
Auton. Agents Multi Agent Syst., 2006
Proceedings of the UAI '06, 2006
Proceedings of the Machine Learning, 2006
Proceedings of the Machine Learning, 2006
Proceedings of the Machine Learning, 2006
Mixtures of Predictive Linear Gaussian Models for Nonlinear, Stochastic Dynamical Systems.
Proceedings of the Proceedings, 2006
Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains.
Proceedings of the Proceedings, 2006
2005
Proceedings of the UAI '05, 2005
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005
Proceedings of the Machine Learning, 2005
Proceedings of the Proceedings, 2005
2004
Proceedings of the UAI '04, 2004
Proceedings of the Proceedings 5th ACM Conference on Electronic Commerce (EC-2004), 2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004
Proceedings of the 2004 International Conference on Machine Learning and Applications, 2004
Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning.
Proceedings of the Machine Learning, 2004
Learning and discovery of predictive state representations in dynamical systems with reset.
Proceedings of the Machine Learning, 2004
Proceedings of the Computers and Games, 4th International Conference, 2004
Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS 2004), 2004
2003
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003
2002
Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System.
J. Artif. Intell. Res., 2002
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002
2001
Proceedings of the Electronic Commerce, Second International Workshop, 2001
Proceedings of the UAI '01: Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, 2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Proceedings of the Fifth International Conference on Autonomous Agents, 2001
2000
Mach. Learn., 2000
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000
Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000
Eligibility Traces for Off-Policy Policy Evaluation.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000
A Boosting Approach to Topic Spotting on Subdialogues.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000
Bias-Variance Error Bounds for Temporal Difference Updates.
Proceedings of the Thirteenth Annual Conference on Computational Learning Theory (COLT 2000), June 28, 2000
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000
1999
Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning.
Artif. Intell., 1999
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999
1998
Mach. Learn., 1998
Experimental Results on Learning Stochastic Memoryless Policies for Partially Observable Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Optimizing Admission Control while Ensuring Quality of Service in Multimedia Networks via Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 11, [NIPS Conference, Denver, Colorado, USA, November 30, 1998
Intra-Option Learning about Temporally Abstract Actions.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998
Using Eligibility Traces to Find the Best Memoryless Policy in Partially Observable Markov Decision Processes.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998
Near-Optimal Reinforcement Learning in Polynominal Time.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998
Proceedings of the Machine Learning: ECML-98, 1998
1997
Proceedings of the Advances in Neural Information Processing Systems 10, 1997
1996
Proceedings of the Advances in Neural Information Processing Systems 9, 1996
Proceedings of the Advances in Neural Information Processing Systems 9, 1996
Proceedings of the Ninth Annual Conference on Computational Learning Theory, 1996
1995
Proceedings of the Advances in Neural Information Processing Systems 8, 1995
Proceedings of the Eigth Annual Conference on Computational Learning Theory, 1995
1994
Neural Comput., 1994
Mach. Learn., 1994
Proceedings of the Advances in Neural Information Processing Systems 7, 1994
Proceedings of the Advances in Neural Information Processing Systems 7, 1994
Learning Without State-Estimation in Partially Observable Markovian Decision Processes.
Proceedings of the Machine Learning, 1994
Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31, 1994
1993
Proceedings of the Advances in Neural Information Processing Systems 6, 1993
1992
Mach. Learn., 1992
Scaling Reinforcement Learning Algorithms by Learning Variable Temporal Resolution Models.
Proceedings of the Ninth International Workshop on Machine Learning (ML 1992), 1992
Proceedings of the 10th National Conference on Artificial Intelligence, 1992
1991
Proceedings of the Advances in Neural Information Processing Systems 4, 1991
A Cortico-Cerebellar Model that Learns to Generate Distributed Motor Commands to Control a Kinematic Arm.
Proceedings of the Advances in Neural Information Processing Systems 4, 1991
Proceedings of the Eighth International Workshop (ML91), 1991