Michael H. Bowling

Affiliations:
  • Department of Computing Science, University of Alberta


According to our database1, Michael H. Bowling authored at least 168 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Monitored Markov Decision Processes.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Learning Not to Regret.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Temporal Abstraction in Reinforcement Learning with the Successor Representation.
J. Mach. Learn. Res., 2023

Assessing the Interpretability of Programmatic Policies with Large Language Models.
CoRR, 2023

Proper Laplacian Representation Learning.
CoRR, 2023

TacticAI: an AI assistant for football tactics.
CoRR, 2023

Rethinking Formal Models of Partially Observable Multiagent Decision Making (Extended Abstract).
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Settling the Reward Hypothesis.
Proceedings of the International Conference on Machine Learning, 2023

Targeted Search Control in AlphaZero for Effective Policy Improvement.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Policy invariant explicit shaping: an efficient alternative to reward shaping.
Neural Comput. Appl., 2022

Over-communicate no more: Situated RL agents learn concise communication protocols.
CoRR, 2022

The Alberta Plan for AI Research.
CoRR, 2022

Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration.
CoRR, 2022

Should Models Be Accurate?
CoRR, 2022

Rethinking formal models of partially observable multiagent decision making.
Artif. Intell., 2022

Approximate Exploitability: Learning a Best Response.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Learning Curricula for Humans: An Empirical Study with Puzzles from The Witness.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021
Teaching People by Justifying Tree Search Decisions: An Empirical Study in Curling.
J. Artif. Intell. Res., 2021

Player of Games.
CoRR, 2021

The Partially Observable History Process.
CoRR, 2021

Learning to Be Cautious.
CoRR, 2021

Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games.
Proceedings of the 38th International Conference on Machine Learning, 2021

Toward a Competitive Agent Framework for Magic: The Gathering.
Proceedings of the Thirty-Fourth International Florida Artificial Intelligence Research Society Conference, 2021

Sound Algorithms in Imperfect Information Games.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Solving Common-Payoff Games with Approximate Policy Iteration.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Hindsight and Sequential Rationality of Correlated Play.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Useful Policy Invariant Shaping from Arbitrary Advice.
CoRR, 2020

The Advantage Regret-Matching Actor-Critic.
CoRR, 2020

Sound Search in Imperfect Information Games.
CoRR, 2020

Hallucinating Value: A Pitfall of Dyna-style Planning with Imperfect Environment Models.
CoRR, 2020

Sample-Efficient Model-based Actor-Critic for an Interactive Dialogue Task.
CoRR, 2020

Approximate exploitability: Learning a best response in large games.
CoRR, 2020

The Hanabi challenge: A new frontier for AI research.
Artif. Intell., 2020

Marginal Utility for Planning in Continuous or Large Discrete Action Spaces.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Low-Variance and Zero-Variance Baselines for Extensive-Form Games.
Proceedings of the 37th International Conference on Machine Learning, 2020

Alternative Function Approximation Parameterizations for Solving Games: An Analysis of ƒ-Regression Counterfactual Regret Minimization.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Count-Based Exploration with the Successor Representation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Ease-of-Teaching and Language Structure from Emergent Communication.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games Using Baselines.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Solving Large Extensive-Form Games with Strategy Constraints.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents.
J. Artif. Intell. Res., 2018

Generalization and Regularization in DQN.
CoRR, 2018

The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces.
CoRR, 2018

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract).
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker.
CoRR, 2017

Heads-up limit hold'em poker is solved.
Commun. ACM, 2017

A Laplacian Framework for Option Discovery in Reinforcement Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Eqilibrium Approximation Quality of Current No-Limit Poker Bots.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning Purposeful Behaviour in the Absence of Rewards.
CoRR, 2016

The Forget-me-not Process.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Monte Carlo Tree Search in Continuous Action Spaces with Execution Uncertainty.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Action Selection for Hammer Shots in Curling.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

State of the Art Control of Atari Games Using Shallow Reinforcement Learning.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Counterfactual Regret Minimization in Sequential Security Games.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Solving Heads-Up Limit Texas Hold'em.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract).
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Variance Reduction via Antithetic Markov Chains.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Optimal Estimation of Multivariate ARMA Models.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Solving Games with Functional Regret Estimation.
Proceedings of the Computer Poker and Imperfect Information, 2015

Pairwise Relative Offset Features for Atari 2600 Games.
Proceedings of the Learning for General Competency in Video Games, 2015

Improving Exploration in UCT Using Local Manifolds.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Domain-Independent Optimistic Initialization for Reinforcement Learning.
Proceedings of the Learning for General Competency in Video Games, 2015

Policy Tree: Adaptive Representation for Policy Gradient.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Decision-Theoretic Clustering of Strategies.
Proceedings of the Computer Poker and Imperfect Information, 2015

2014
Do pokers players know how good they are? Accuracy of poker skill estimation in online and offline players.
Comput. Hum. Behav., 2014

Asymmetric abstractions for adversarial settings.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Using Response Functions to Measure Strategy Strength.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Solving Imperfect Information Games Using Decomposition.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Alignment based kernel learning with a continuous set of base kernels.
Mach. Learn., 2013

The Arcade Learning Environment: An Evaluation Platform for General Agents.
J. Artif. Intell. Res., 2013

CFR-D: Solving Imperfect Information Games Using Decomposition
CoRR, 2013

Subset Selection of Search Heuristics.
Proceedings of the IJCAI 2013, 2013

Bayesian Learning of Recursively Factored Environments.
Proceedings of the 30th International Conference on Machine Learning, 2013

A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning.
Proceedings of the 30th International Conference on Machine Learning, 2013

Partition Tree Weighting.
Proceedings of the 2013 Data Compression Conference, 2013

Evaluating state-space abstractions in extensive-form games.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Baseline: practical control variates for agent evaluation in zero-sum domains.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Online implicit agent modelling.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Rating players in games with real-valued outcomes.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Automating Collusion Detection in Sequential Games.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Linear fitted-Q iteration with multiple reward functions.
J. Mach. Learn. Res., 2012

No-Regret Learning in Extensive-Form Games with Imperfect Recall
CoRR, 2012

A Randomized Strategy for Learning to Combine Many Features
CoRR, 2012

Tractable Objectives for Robust Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Sketch-Based Linear Value Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

No-Regret Learning in Extensive-Form Games with Imperfect Recall.
Proceedings of the 29th International Conference on Machine Learning, 2012

On Local Regret.
Proceedings of the 29th International Conference on Machine Learning, 2012

Context Tree Switching.
Proceedings of the 2012 Data Compression Conference, Snowbird, UT, USA, April 10-12, 2012, 2012

Keynotes [abstracts of three keynote presentations].
Proceedings of the 2012 IEEE Conference on Computational Intelligence and Games, 2012

Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

Finding Optimal Abstract Strategies in Extensive-Form Games.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Generalized Sampling and Variance in Counterfactual Regret Minimization.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Investigating Contingency Awareness Using Atari 2600 Games.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
The lemonade stand game competition: solving unsolvable games.
SIGecom Exch., 2011

Variance Reduction in Monte-Carlo Tree Search.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Accelerating Best Response Calculation in Large Extensive Games.
Proceedings of the IJCAI 2011, 2011

Euclidean Heuristic Optimization.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

2009
Data Biased Robust Counter Strategies.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

A Practical Use of Imperfect Recall.
Proceedings of the Eighth Symposium on Abstraction, Reformulation, and Approximation, 2009

Strategy Grafting in Extensive Games.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Monte Carlo Sampling for Regret Minimization in Extensive Games.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning a Value Analysis Tool for Agent Evaluation.
Proceedings of the IJCAI 2009, 2009

Probabilistic State Translation in Extensive Games with Large Action Sets.
Proceedings of the IJCAI 2009, 2009

Abstraction pathologies in extensive games.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

A demonstration of the Polaris poker system.
Proceedings of the 8th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2009), 2009

2008
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping.
Proceedings of the UAI 2008, 2008

Multidisciplinary students and instructors: a second-year games course.
Proceedings of the 39th SIGCSE Technical Symposium on Computer Science Education, 2008

Scalable Action Respecting Embedding.
Proceedings of the International Symposium on Artificial Intelligence and Mathematics, 2008

Apprenticeship learning using linear programming.
Proceedings of the Machine Learning, 2008

Strategy evaluation in extensive games with importance sampling.
Proceedings of the Machine Learning, 2008

Autonomous geocaching: navigation and goal finding in outdoor domains.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Sigma point policy iteration.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games.
Proceedings of the Fourth Artificial Intelligence and Interactive Digital Entertainment Conference, 2008

2007
Regret Minimization in Games with Incomplete Information.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Stable Dual Dynamic Programming.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Computing Robust Counter-Strategies.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Automatic Gait Optimization with Gaussian Process Regression.
Proceedings of the IJCAI 2007, 2007

A New Algorithm for Generating Equilibria in Massive Zero-Sum Games.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

Particle Filtering for Dynamic Agent Modelling in Simplified Poker.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Machine learning and games.
Mach. Learn., 2006

iLSTD: Eligibility Traces and Convergence Analysis.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Learning predictive state representations using non-blind policies.
Proceedings of the Machine Learning, 2006

Robust game play against unknown opponents.
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006

Optimal Unbiased Estimators for Evaluating Agent Performance.
Proceedings of the Proceedings, 2006

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning.
Proceedings of the Proceedings, 2006

Prob-Maxn: Playing N-Player Games with Opponent Models.
Proceedings of the Proceedings, 2006

Boosting Expert Ensembles for Rapid Concept Recall.
Proceedings of the Proceedings, 2006

Bayesian Calibration for Monte Carlo Localization.
Proceedings of the Proceedings, 2006

Incremental Least-Squares Temporal Difference Learning.
Proceedings of the Proceedings, 2006

Subjective Mapping.
Proceedings of the Proceedings, 2006

2005
Bayes? Bluff: Opponent Modelling in Poker.
Proceedings of the UAI '05, 2005

Online Discovery and Learning of Predictive State Representations.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Subjective Localization with Action Respecting Embedding.
Proceedings of the Robotics Research: Results of the 12th International Symposium, 2005

Learning Subjective Representations for Planning.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Bayesian sparse sampling for on-line reward optimization.
Proceedings of the Machine Learning, 2005

Action respecting embedding.
Proceedings of the Machine Learning, 2005

Coordination and Adaptation in Impromptu Teams.
Proceedings of the Proceedings, 2005

2004
Existence of Multiagent Equilibria with Limited Agents.
J. Artif. Intell. Res., 2004

Convergence and No-Regret in Multiagent Learning.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games.
Proceedings of the Computers and Games, 4th International Conference, 2004

Plays as Effective Multiagent Plans Enabling Opponent-Adaptive Play Selection.
Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS 2004), 2004

Safe Strategies for Agent Modelling in Games.
Proceedings of the Artificial Multiagent Learning, 2004

2003
Plays as Team Plans for Coordination and Adaptation.
Proceedings of the RoboCup 2003: Robot Soccer World Cup VII, 2003

Simultaneous Adversarial Multi-Robot Learning.
Proceedings of the IJCAI-03, 2003

A Formalization of Equilibria for Multiagent Planning.
Proceedings of the IJCAI-03, 2003

Multi-robot team response to a multi-robot opponent team.
Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003

2002
Multiagent learning using a variable learning rate.
Artif. Intell., 2002

Approximation Techniques in Multiagent Learning.
Proceedings of the Abstraction, 2002

Improbability Filtering for Rejecting False Positives.
Proceedings of the 2002 IEEE International Conference on Robotics and Automation, 2002

Towards robust teams with many agents.
Proceedings of the First International Joint Conference on Autonomous Agents & Multiagent Systems, 2002

2001
CM-Dragons'01 - Vision-Based Motion Tracking and Heteregenous Robots.
Proceedings of the RoboCup 2001: Robot Soccer World Cup V, 2001

Rational and Convergent Learning in Stochastic Games.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Convergence of Gradient Dynamics with a Variable Learning Rate.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

2000
CMUNITED-98: RoboCup-98 Small-Robot World Champion Team.
AI Mag., 2000

Convergence Problems of General-Sum Multiagent Reinforcement Learning.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

1999
CMUnited-99: Small-Size Robot Team.
Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Motion Control in Dynamic Multi-Robot Environments.
Proceedings of the RoboCup-99: Robot Soccer World Cup III, 1999

Bounding the Suboptimality of Reusing Subproblem.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

CMUnited-98: A Team of Robotic Soccer Agents.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998
The CMUnited-98 champion small-robot team.
Adv. Robotics, 1998

The CMUnited-98 Small-Robot Team.
Proceedings of the RoboCup-98: Robot Soccer World Cup II, 1998


  Loading...