2024

The Update-Equivalence Framework for Decision-Time Planning.

[DOI]

Samuel Sokota

Gabriele Farina

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Abstracting Imperfect Information Away from Two-Player Zero-Sum Games.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Human-AI Coordination via Human-Regularized Search and Learning.

[DOI]

CoRR, 2022

Modeling Strong and Human-Like Gameplay with KL-Regularized Search.

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

A Fine-Tuning Approach to Belief State Modeling.

[DOI]

Jakob Nicolaus Foerster

Noam Brown

Proceedings of the Tenth International Conference on Learning Representations, 2022

Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization.

[DOI]

Hugh Zhang

Adam Lerer

Noam Brown

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Modeling Strong and Human-Like Gameplay with KL-Regularized Search.

[DOI]

CoRR, 2021

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings.

[DOI]

CoRR, 2021

Off-Belief Learning.

[DOI]

CoRR, 2021

Scalable Online Planning via Reinforcement Learning Fine-Tuning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

No-Press Diplomacy from Scratch.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Off-Belief Learning.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Human-Level Performance in No-Press Diplomacy via Equilibrium Search.

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Safe Search for Stackelberg Equilibria in Extensive-Form Games.

[DOI]

Chun Kai Ling

Noam Brown

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Unlocking the Potential of Deep Counterfactual Value Networks.

[DOI]

CoRR, 2020

DREAM: Deep Regret minimization with Advantage baselines and Model-free learning.

[DOI]

Eric Steinberger

Adam Lerer

Noam Brown

CoRR, 2020

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Improving Policies via Search in Cooperative Partially Observable Games.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Stable-Predictive Optimistic Counterfactual Regret Minimization.

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Deep Counterfactual Regret Minimization.

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Solving Imperfect-Information Games via Discounted Regret Minimization.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Reports of the Workshops of the 32nd AAAI Conference on Artificial Intelligence.

[DOI]

AI Mag., 2018

Depth-Limited Solving for Imperfect-Information Games.

[DOI]

Noam Brown

Tuomas Sandholm

Brandon Amos

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017

Safe and Nested Subgame Solving for Imperfect-Information Games.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Libratus: The Superhuman AI for No-Limit Poker.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Reduced Space and Faster Convergence in Imperfect-Information Games via Pruning.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the 34th International Conference on Machine Learning, 2017

Safe and Nested Endgame Solving for Imperfect-Information Games.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

Reduced Space and Faster Convergence in Imperfect-Information Games via Regret-Based Pruning.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

Dynamic Thresholding and Pruning for Regret Minimization.

[DOI]

Noam Brown

Christian Kroer

Tuomas Sandholm

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Baby Tartanian8: Winning Agent from the 2016 Annual Computer Poker Competition.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Strategy-Based Warm Starting for Regret Minimization in Games.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Regret-Based Pruning in Extensive-Form Games.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Simultaneous Abstraction and Equilibrium Finding in Games.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Hierarchical Abstraction, Distributed Equilibrium Computation, and Post-Processing, with Application to a Champion No-Limit Texas Hold'em Agent.

[DOI]

Noam Brown

Sam Ganzfried

Tuomas Sandholm

Proceedings of the Computer Poker and Imperfect Information, 2015

Tartanian7: A Champion Two-Player No-Limit Texas Hold'em Poker-Playing Program.

[DOI]

Noam Brown

Sam Ganzfried

Tuomas Sandholm

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Regret Transfer and Parameter Optimization.

[DOI]

Noam Brown

Tuomas Sandholm

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014