Stephen McAleer

Orcid: 0000-0003-0118-6874

According to our database1, Stephen McAleer authored at least 62 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ASP: Learn a Universal Neural Solver!
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games.
CoRR, 2024

Tree Search for Language Model Agents.
CoRR, 2024

AgentKit: Flow Engineering with Graphs, not Coding.
CoRR, 2024

Faster Game Solving via Hyperparameter Schedules.
CoRR, 2024

Steering No-Regret Learners to a Desired Equilibrium.
Proceedings of the 25th ACM Conference on Economics and Computation, 2024

Policy Space Response Oracles: A Survey.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Scalable Mechanism Design for Multi-Agent Path Finding.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Confronting Reward Model Overoptimization with Constrained RLHF.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Illusory Attacks: Information-theoretic detectability matters in adversarial attacks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Llemma: An Open Language Model for Mathematics.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Automated Design of Affine Maximizer Mechanisms in Dynamic Settings.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games.
Trans. Mach. Learn. Res., 2023

AI Alignment: A Comprehensive Survey.
CoRR, 2023

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations.
CoRR, 2023

Steering No-Regret Learners to Optimal Equilibria.
CoRR, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.
CoRR, 2023

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning.
CoRR, 2023

Algorithms and Complexity for Computing Nash Equilibria in Adversarial Team Games.
Proceedings of the 24th ACM Conference on Economics and Computation, 2023

Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Space Diversity for Non-Transitive Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Language Models can Solve Computer Tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Regret-Minimizing Double Oracle for Extensive-Form Games.
Proceedings of the International Conference on Machine Learning, 2023

A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems.
Proceedings of the International Conference on Machine Learning, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.
Proceedings of the International Conference on Machine Learning, 2023

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Sequential Decision Making in Single-Agent and Multi-Agent Domains
PhD thesis, 2022

Online Double Oracle.
Trans. Mach. Learn. Res., 2022

Game Theoretic Rating in N-player general-sum games with Equilibria.
CoRR, 2022

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments.
CoRR, 2022

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games.
CoRR, 2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
CoRR, 2022

Learning Risk-Averse Equilibria in Multi-Agent Systems.
CoRR, 2022

Anytime PSRO for Two-Player Zero-Sum Games.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks.
Proceedings of the International Conference on Machine Learning, 2022

Proving Theorems using Incremental Learning and Hindsight Experience Replay.
Proceedings of the International Conference on Machine Learning, 2022

Independent Natural Policy Gradient always converges in Markov Potential Games.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Target Entropy Annealing for Discrete Soft Actor-Critic.
CoRR, 2021

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates.
CoRR, 2021

Improving Social Welfare While Preserving Autonomy via a Pareto Mediator.
CoRR, 2021

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games.
CoRR, 2021

XDO: A Double Oracle Algorithm for Extensive-Form Games.
CoRR, 2021

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks.
CoRR, 2021

XDO: A Double Oracle Algorithm for Extensive-Form Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Auto-Curricula in Two-Player Zero-Sum Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Deep machine learning-assisted multiphoton microscopy to reduce light exposure and expedite imaging.
CoRR, 2020

Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Highly Accurate Machine Fault Diagnosis Using Deep Transfer Learning.
IEEE Trans. Ind. Informatics, 2019

Solving the Rubik's cube with deep reinforcement learning and search.
Nat. Mach. Intell., 2019

ColosseumRL: A Framework for Multiagent Reinforcement Learning in N-Player Games.
CoRR, 2019

Curiosity-Driven Multi-Criteria Hindsight Experience Replay.
CoRR, 2019

Solving the Rubik's Cube with Approximate Policy Iteration.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Solving the Rubik's Cube Without Human Knowledge.
CoRR, 2018


  Loading...