Gerald Tesauro
Affiliations:- IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA
According to our database1,
Gerald Tesauro
authored at least 99 papers
between 1987 and 2023.
Collaborative distances:
Collaborative distances:
Awards
ACM Fellow
ACM Fellow 2018, "For contributions to reinforcement learning, neural networks, and intelligent autonomous agents".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2023
CoRR, 2023
2022
Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 43rd Annual Meeting of the Cognitive Science Society, 2021
Proceedings of the 2021 IEEE Conference on Games (CoG), 2021
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract).
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games.
CoRR, 2020
CoRR, 2020
Finding Macro-Actions with Disentangled Effects for Efficient Planning with the Goal-Count Heuristic.
CoRR, 2020
Decentralized TD Tracking with Linear Function Approximation and its Finite-Time Analysis.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference.
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Neural Networks, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
CoRR, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2013
2012
IBM J. Res. Dev., 2012
Proceedings of the Winter Simulation Conference, 2012
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012
2010
2009
Proceedings of the 26th Annual International Conference on Machine Learning, 2009
2008
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008
2007
Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, 2007
IEEE Internet Comput., 2007
Clust. Comput., 2007
Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007
Proceedings of the Integrated Network Management, 2007
Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs.
Proceedings of the Fourth International Conference on Autonomic Computing (ICAC'07), 2007
2006
Proceedings of the 3rd International Conference on Autonomic Computing, 2006
Proceedings of the Machine Learning: ECML 2006, 2006
2005
Proceedings of the Second International Conference on Autonomic Computing (ICAC 2005), 2005
Proceedings of the Proceedings, 2005
Proceedings of the Proceedings, 2005
2004
Proceedings of the 1st International Conference on Autonomic Computing (ICAC 2004), 2004
Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2004), 2004
2003
Proceedings of the UAI '03, 2003
A strategic decision model for multi-attribute bilateral negotiation with alternating.
Proceedings of the Proceedings 4th ACM Conference on Electronic Commerce (EC-2003), 2003
Proceedings of the Proceedings 4th ACM Conference on Electronic Commerce (EC-2003), 2003
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003
2002
Auton. Agents Multi Agent Syst., 2002
Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002
2001
Proceedings of the Proceedings 3rd ACM Conference on Electronic Commerce (EC-2001), 2001
Proceedings of the Sequence Learning - Paradigms, Algorithms, and Applications, 2001
Agent-Human Interactions in the Continuous Double Auction.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001
2000
Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000
Pseudo-convergent Q-Learning by Competitive Pricebots.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000
1999
Proceedings of the First ACM Conference on Electronic Commerce (EC-99), 1999
1998
Mach. Learn., 1998
Proceedings of the First International Conference on Information and Computation Economies, 1998
1996
Proceedings of the Advances in Neural Information Processing Systems 9, 1996
1995
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995
1994
Neural Comput., 1994
1992
Proceedings of the Ninth International Workshop on Machine Learning (ML 1992), 1992
1991
1990
Proceedings of the Advances in Neural Information Processing Systems 3, 1990
Proceedings of the IJCNN 1990, 1990
1989
Proceedings of the Advances in Neural Information Processing Systems 2, 1989
Proceedings of the Advances in Neural Information Processing Systems 2, 1989
1988
Proceedings of the Advances in Neural Information Processing Systems 1, 1988
Proceedings of the Advances in Neural Information Processing Systems 1, 1988
Connectionist Learning of Expert Backgammon Evaluations.
Proceedings of the Machine Learning, 1988
1987
Complex Syst., 1987
Proceedings of the Neural Information Processing Systems, Denver, Colorado, USA, 1987, 1987