Michael H. Bowling

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

No-Regret Learning in Extensive-Form Games with Imperfect Recall.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

On Local Regret.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Machine Learning, 2012

Context Tree Switching.

[BibT_eX]

[DOI]

Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

Keynotes [abstracts of three keynote presentations].

[BibT_eX]

[DOI]

Jeff Orkin

Gillian Smith

Proceedings of the 2012 IEEE Conference on Computational Intelligence and Games, 2012

Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Finding Optimal Abstract Strategies in Extensive-Form Games.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Generalized Sampling and Variance in Counterfactual Regret Minimization.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Investigating Contingency Awareness Using Atari 2600 Games.

[BibT_eX]

[DOI]

Marc G. Bellemare

Joel Veness

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

The lemonade stand game competition: solving unsolvable games.

[BibT_eX]

[DOI]

Michael Wunder

SIGecom Exch., 2011

Variance Reduction in Monte-Carlo Tree Search.

[BibT_eX]

[DOI]

Joel Veness

Marc Lanctot

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Accelerating Best Response Calculation in Large Extensive Games.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

Euclidean Heuristic Optimization.

[BibT_eX]

[DOI]

D. Chris Rayner

Nathan R. Sturtevant

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010

Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis.

[BibT_eX]

[DOI]

Daniel J. Lizotte

Susan A. Murphy

Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2009

Data Biased Robust Counter Strategies.

[BibT_eX]

[DOI]

Michael Johanson

Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

A Practical Use of Imperfect Recall.

[BibT_eX]

[DOI]

Proceedings of the Eighth Symposium on Abstraction, Reformulation, and Approximation, 2009

Strategy Grafting in Extensive Games.

[BibT_eX]

[DOI]

Kevin Waugh

Nolan Bard

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Monte Carlo Sampling for Regret Minimization in Extensive Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning a Value Analysis Tool for Agent Evaluation.

[BibT_eX]

[DOI]

Martha White

Proceedings of the IJCAI 2009, 2009

Probabilistic State Translation in Extensive Games with Large Action Sets.

[BibT_eX]

[DOI]

David Schnizlein

Duane Szafron

Proceedings of the IJCAI 2009, 2009

Abstraction pathologies in extensive games.

[BibT_eX]

[DOI]

Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

A demonstration of the Polaris poker system.

[BibT_eX]

[DOI]

John Alexander Hawkin

Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2008

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping.

[BibT_eX]

[DOI]

Proceedings of the UAI 2008, 2008

Multidisciplinary students and instructors: a second-year games course.

[BibT_eX]

[DOI]

Proceedings of the 39th SIGCSE Technical Symposium on Computer Science Education, 2008

Scalable Action Respecting Embedding.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

Apprenticeship learning using linear programming.

[BibT_eX]

[DOI]

Umar Syed

Robert E. Schapire

Proceedings of the Machine Learning, 2008

Strategy evaluation in extensive games with importance sampling.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2008

Autonomous geocaching: navigation and goal finding in outdoor domains.

[BibT_eX]

[DOI]

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Sigma point policy iteration.

[BibT_eX]

[DOI]

Alborz Geramifard

David Wingate

Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games.

[BibT_eX]

[DOI]

Proceedings of the Fourth Artificial Intelligence and Interactive Digital Entertainment Conference, 2008

2007

Regret Minimization in Games with Incomplete Information.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Stable Dual Dynamic Programming.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Computing Robust Counter-Strategies.

[BibT_eX]

[DOI]

Michael Johanson

Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Automatic Gait Optimization with Gaussian Process Regression.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2007, 2007

A New Algorithm for Generating Equilibria in Massive Zero-Sum Games.

[BibT_eX]

[DOI]

Neil Burch

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Particle Filtering for Dynamic Agent Modelling in Simplified Poker.

[BibT_eX]

[DOI]

Nolan Bard

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

Machine learning and games.

[BibT_eX]

[DOI]

Mach. Learn., 2006

iLSTD: Eligibility Traces and Convergence Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Learning predictive state representations using non-blind policies.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2006

Robust game play against unknown opponents.

[BibT_eX]

[DOI]

Nathan R. Sturtevant

Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

Optimal Unbiased Estimators for Evaluating Agent Performance.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

Prob-Maxn: Playing N-Player Games with Opponent Models.

[BibT_eX]

[DOI]

Nathan R. Sturtevant

Proceedings of the Proceedings, 2006

Boosting Expert Ensembles for Rapid Concept Recall.

[BibT_eX]

[DOI]

Achim Rettinger

Proceedings of the Proceedings, 2006

Bayesian Calibration for Monte Carlo Localization.

[BibT_eX]

[DOI]

Armita Kaboli

Petr Musílek

Proceedings of the Proceedings, 2006

Incremental Least-Squares Temporal Difference Learning.

[BibT_eX]

[DOI]

Alborz Geramifard

Richard S. Sutton

Proceedings of the Proceedings, 2006

Subjective Mapping.

[BibT_eX]

[DOI]

Dana F. Wilkinson

Ali Ghodsi

Proceedings of the Proceedings, 2006

2005

Bayes? Bluff: Opponent Modelling in Poker.

[BibT_eX]

[DOI]

Proceedings of the UAI '05, 2005

Online Discovery and Learning of Predictive State Representations.

[BibT_eX]

[DOI]

Peter McCracken

Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Subjective Localization with Action Respecting Embedding.

[BibT_eX]

[DOI]

Proceedings of the Robotics Research: Results of the 12th International Symposium, 2005

Learning Subjective Representations for Planning.

[BibT_eX]

[DOI]

Dana F. Wilkinson

Ali Ghodsi

Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Bayesian sparse sampling for on-line reward optimization.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning, 2005

Action respecting embedding.

[BibT_eX]

[DOI]

Ali Ghodsi

Dana F. Wilkinson

Proceedings of the Machine Learning, 2005

Coordination and Adaptation in Impromptu Teams.

[BibT_eX]

[DOI]

Peter McCracken

Proceedings of the Proceedings, 2005

2004

Existence of Multiagent Equilibria with Limited Agents.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2004

Convergence and No-Regret in Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games.

[BibT_eX]

[DOI]

Proceedings of the Computers and Games, 4th International Conference, 2004

Plays as Effective Multiagent Plans Enabling Opponent-Adaptive Play Selection.

[BibT_eX]

[DOI]

Brett Browning

Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS 2004), 2004

Safe Strategies for Agent Modelling in Games.

[BibT_eX]

[DOI]

Peter McCracken

Proceedings of the Artificial Multiagent Learning, 2004

2003

Plays as Team Plans for Coordination and Adaptation.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Simultaneous Adversarial Multi-Robot Learning.

[BibT_eX]

[DOI]

Proceedings of the IJCAI-03, 2003

A Formalization of Equilibria for Multiagent Planning.

[BibT_eX]

[DOI]

Rune Møller Jensen

Proceedings of the IJCAI-03, 2003

Multi-robot team response to a multi-robot opponent team.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003

2002

Multiagent learning using a variable learning rate.

[BibT_eX]

[DOI]

Artif. Intell., 2002

Approximation Techniques in Multiagent Learning.

[BibT_eX]

[DOI]

Proceedings of the Abstraction, 2002

Improbability Filtering for Rejecting False Positives.

[BibT_eX]

[DOI]

Brett Browning

Proceedings of the 2002 IEEE International Conference on Robotics and Automation, 2002

Towards robust teams with many agents.

[BibT_eX]

[DOI]

Gal A. Kaminka

Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

2001

CM-Dragons'01 - Vision-Based Motion Tracking and Heteregenous Robots.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

Rational and Convergent Learning in Stochastic Games.

[BibT_eX]

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Convergence of Gradient Dynamics with a Variable Learning Rate.

[BibT_eX]

Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

2000

CMUNITED-98: RoboCup-98 Small-Robot World Champion Team.

[BibT_eX]

[DOI]

AI Mag., 2000

Convergence Problems of General-Sum Multiagent Reinforcement Learning.

[BibT_eX]

Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

1999

CMUnited-99: Small-Size Robot Team.

[BibT_eX]

[DOI]

Sorin Achim

Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Motion Control in Dynamic Multi-Robot Environments.

[BibT_eX]

[DOI]

Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Bounding the Suboptimality of Reusing Subproblem.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

CMUnited-98: A Team of Robotic Soccer Agents.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998

The CMUnited-98 champion small-robot team.

[BibT_eX]

[DOI]

Peter Stone

Adv. Robotics, 1998

The CMUnited-98 Small-Robot Team.

[BibT_eX]

[DOI]