2024
The Update-Equivalence Framework for Decision-Time Planning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Abstracting Imperfect Information Away from Two-Player Zero-Sum Games.
Proceedings of the International Conference on Machine Learning, 2023
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
Human-AI Coordination via Human-Regularized Search and Learning.
CoRR, 2022
Modeling Strong and Human-Like Gameplay with KL-Regularized Search.
Proceedings of the International Conference on Machine Learning, 2022
A Fine-Tuning Approach to Belief State Modeling.
Proceedings of the Tenth International Conference on Learning Representations, 2022
Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Modeling Strong and Human-Like Gameplay with KL-Regularized Search.
CoRR, 2021
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings.
CoRR, 2021
Scalable Online Planning via Reinforcement Learning Fine-Tuning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
No-Press Diplomacy from Scratch.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Human-Level Performance in No-Press Diplomacy via Equilibrium Search.
Proceedings of the 9th International Conference on Learning Representations, 2021
Safe Search for Stackelberg Equilibria in Extensive-Form Games.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Unlocking the Potential of Deep Counterfactual Value Networks.
CoRR, 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning.
CoRR, 2020
Combining Deep Reinforcement Learning and Search for Imperfect-Information Games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Improving Policies via Search in Cooperative Partially Observable Games.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Stable-Predictive Optimistic Counterfactual Regret Minimization.
Proceedings of the 36th International Conference on Machine Learning, 2019
Deep Counterfactual Regret Minimization.
Proceedings of the 36th International Conference on Machine Learning, 2019
Solving Imperfect-Information Games via Discounted Regret Minimization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Reports of the Workshops of the 32nd AAAI Conference on Artificial Intelligence.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
AI Mag., 2018
Depth-Limited Solving for Imperfect-Information Games.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
2017
Safe and Nested Subgame Solving for Imperfect-Information Games.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Libratus: The Superhuman AI for No-Limit Poker.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017
Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning.
Proceedings of the 34th International Conference on Machine Learning, 2017
Safe and Nested Endgame Solving for Imperfect-Information Games.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017
Reduced Space and Faster Convergence in Imperfect-Information Games via Regret-Based Pruning.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017
Dynamic Thresholding and Pruning for Regret Minimization.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Baby Tartanian8: Winning Agent from the 2016 Annual Computer Poker Competition.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Strategy-Based Warm Starting for Regret Minimization in Games.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Regret-Based Pruning in Extensive-Form Games.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Simultaneous Abstraction and Equilibrium Finding in Games.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
Hierarchical Abstraction, Distributed Equilibrium Computation, and Post-Processing, with Application to a Champion No-Limit Texas Hold'em Agent.
Proceedings of the Computer Poker and Imperfect Information, 2015
Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Regret Transfer and Parameter Optimization.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014