Yaodong Yang

Trans. Mach. Learn. Res., 2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

TorchOpt: An Efficient Library for Differentiable Optimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

AI Alignment: A Comprehensive Survey.

[BibT_eX]

[DOI]

CoRR, 2023

Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark.

[BibT_eX]

[DOI]

CoRR, 2023

Masked Pretraining for Multi-Agent Decision Making.

[BibT_eX]

[DOI]

CoRR, 2023

MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization.

[BibT_eX]

[DOI]

CoRR, 2023

Measuring Value Understanding in Language Models through Discriminator-Critique Gap.

[BibT_eX]

[DOI]

CoRR, 2023

Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators.

[BibT_eX]

[DOI]

CoRR, 2023

ProAgent: Building Proactive Cooperative AI with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Safe DreamerV3: Safe Reinforcement Learning with World Models.

[BibT_eX]

[DOI]

CoRR, 2023

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset.

[BibT_eX]

[DOI]

CoRR, 2023

Maximum Entropy Heterogeneous-Agent Mirror Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Deep Reinforcement Learning with Multitask Episodic Memory Based on Task-Conditioned Hypernetwork.

[BibT_eX]

[DOI]

CoRR, 2023

Heterogeneous Value Evaluation for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game.

[BibT_eX]

[DOI]

CoRR, 2023

OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research.

[BibT_eX]

[DOI]

CoRR, 2023

Heterogeneous-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2023

A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors.

[BibT_eX]

[DOI]

CoRR, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, 2023

MSRL: Distributed Reinforcement Learning with Dataflow Fragments.

[BibT_eX]

[DOI]

Proceedings of the 2023 USENIX Annual Technical Conference, 2023

Multi-Agent First Order Constrained Optimization in Policy Space.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Space Diversity for Non-Transitive Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hierarchical Multi-Agent Skill Discovery.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GenDexGrasp: Generalizable Dexterous Grasping.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

RLAfford: End-to-End Affordance Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Regret-Minimizing Double Oracle for Extensive-Form Games.

[BibT_eX]

[DOI]

Xiaohang Tang

Le Cong Dinh

Proceedings of the International Conference on Machine Learning, 2023

A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems.

[BibT_eX]

[DOI]

Oliver Slumbers

David Henry Mguni

Stefano B. Blumberg

Proceedings of the International Conference on Machine Learning, 2023

MANSA: Learning Fast and Slow in Multi-Agent Systems.

[BibT_eX]

[DOI]

Feifei Tong

Proceedings of the International Conference on Machine Learning, 2023

Quality-Similar Diversity via Population Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Dynamic Handover: Throw and Catch with Bimanual Hands.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

Is Nash Equilibrium Approximator Learnable?

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Learning to Shape Rewards Using a Game of Two Partners.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Online Double Oracle.

[BibT_eX]

[DOI]

Le Cong Dinh

Trans. Mach. Learn. Res., 2022

Illiquidity Comovement and Market Crisis.

[BibT_eX]

[DOI]

J. Syst. Sci. Complex., 2022

Contextual Transformer for Offline Meta Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

MARLlib: Extending RLlib for Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

End-to-End Affordance Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2022

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL.

[BibT_eX]

[DOI]

CoRR, 2022

Fully Decentralized Model-based Policy Optimization for Networked Systems.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.

[BibT_eX]

[DOI]

Hao Dong

Zongqing Lu

Song-Chun Zhu

CoRR, 2022

Learning Risk-Averse Equilibria in Multi-Agent Systems.

[BibT_eX]

[DOI]

CoRR, 2022

A Review of Safe Reinforcement Learning: Methods, Theory and Applications.

[BibT_eX]

[DOI]

CoRR, 2022

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Zehao Dou

Jakub Grudzien Kuba

CoRR, 2022

Settling the Communication Complexity for Distributed Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Juliusz Krysztof Ziomek

CoRR, 2022

Efficient Policy Space Response Oracles.

[BibT_eX]

[DOI]

CoRR, 2022

Measuring the Non-Transitivity in Chess.

[BibT_eX]

[DOI]

Ricky Sanjaya

Algorithms, 2022

Debias the Black-Box: A Fair Ranking Framework via Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Web Information Systems Engineering - WISE 2022, 2022

Constrained Update Projection Approach to Safe Policy Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

On the Convergence of Fictitious Play: A Decomposition Approach.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

A Game-Theoretic Approach to Multi-agent Trust Region Optimization.

[BibT_eX]

[DOI]

Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022

2021

Many-agent reinforcement learning

[BibT_eX]

[DOI]

PhD thesis, 2021

On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games.

[BibT_eX]

[DOI]

Electron. Colloquium Comput. Complex., 2021

Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.

[BibT_eX]

[DOI]

CoRR, 2021

A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers.

[BibT_eX]

[DOI]

CoRR, 2021

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention.

[BibT_eX]

[DOI]

CoRR, 2021

Multi-Agent Constrained Policy Optimisation.

[BibT_eX]

[DOI]

CoRR, 2021

Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics.

[BibT_eX]

[DOI]

CoRR, 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games.

[BibT_eX]

[DOI]

CoRR, 2021

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games.

[BibT_eX]

[DOI]

CoRR, 2021

Learning to Shape Rewards using a Game of Switching Controls.

[BibT_eX]

[DOI]

CoRR, 2021

Modelling Behavioural Diversity for Learning in Open-Ended Games.

[BibT_eX]

[DOI]

CoRR, 2021

Online Double Oracle.

[BibT_eX]

[DOI]

CoRR, 2021

Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Settling the Variance of Multi-Agent Policy Gradients.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Auto-Curricula in Two-Player Zero-Sum Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand.

[BibT_eX]

[DOI]

Proceedings of the NeurIPS 2022 Competition Track, 2021

Modelling Behavioural Diversity for Learning in Open-Ended Games.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Learning in Nonzero-Sum Stochastic Games with Potentials.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Order Execution Probability and Order Queue in Limit Order Markets.

[BibT_eX]

[DOI]

J. Syst. Sci. Complex., 2020

Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting.

[BibT_eX]

[DOI]

Johnnie E. V. Johnson

Eur. J. Oper. Res., 2020

An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective.

[BibT_eX]

[DOI]

CoRR, 2020

Replica-Exchange Nosé-Hoover Dynamics for Bayesian Learning on Large Datasets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning.

[BibT_eX]

[DOI]

Ying Wen

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Multi-Agent Determinantal Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Learning to Infer User Hidden States for Online Sequential Advertising.

[BibT_eX]

[DOI]

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

αα-Rank: Practically Scaling α-Rank through Stochastic Optimisation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Sequential Advertising Agent with Interpretable User Hidden Intents.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Bi-Level Actor-Critic for Multi-Agent Coordination.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Multi-Agent Generalized Recursive Reasoning.

[BibT_eX]

[DOI]

CoRR, 2019

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the World Wide Web Conference, 2019

Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Adversarial Variational Bayes Methods for Tweedie Compound Poisson Mixed Models.

[BibT_eX]

[DOI]