2024

Dynamics-Adaptive Continual Reinforcement Learning via Progressive Contextualization.

[DOI]

Tiantian Zhang

Zichuan Lin

IEEE Trans. Neural Networks Learn. Syst., October, 2024

Optimizing Latent Goal by Learning from Trajectory Preference.

[DOI]

CoRR, 2024

Playable Game Generation.

[DOI]

CoRR, 2024

Diverse Policies Recovering via Pointwise Mutual Information Weighted Imitation Learning.

[DOI]

CoRR, 2024

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.

[DOI]

CoRR, 2024

Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination.

[DOI]

CoRR, 2024

More Agents Is All You Need.

[DOI]

CoRR, 2024

Enhance Reasoning for Large Language Models in the Game Werewolf.

[DOI]

CoRR, 2024

Affordable Generative Agents.

[DOI]

CoRR, 2024

Automatically designing counterfactual regret minimization algorithms for solving imperfect-information games.

[DOI]

Artif. Intell., 2024

Opponent Modeling with In-context Search.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Efficient Multi-task Reinforcement Learning with Cross-Task Policy Guidance.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent.

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Dynamic Discounted Counterfactual Regret Minimization.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Maximum Entropy Heterogeneous-Agent Reinforcement Learning.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards Offline Opponent Modeling with In-context Learning.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing.

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

RLTF: Reinforcement Learning from Unit Test Feedback.

[DOI]

Trans. Mach. Learn. Res., 2023

Not All Tasks Are Equally Difficult: Multi-Task Reinforcement Learning with Dynamic Depth Routing.

[DOI]

CoRR, 2023

Diversity from Human Feedback.

[DOI]

CoRR, 2023

Maximum Entropy Heterogeneous-Agent Mirror Learning.

[DOI]

CoRR, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.

[DOI]

CoRR, 2023

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization.

[DOI]

CoRR, 2023

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning.

[DOI]

CoRR, 2023

Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Space Diversity for Non-Transitive Games.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Robust and Opponent-Aware League Training Method for StarCraft II.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Multi-objective Optimization-based Selection for Quality-Diversity by Non-surrounded-dominated Sorting.

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Future-conditioned Unsupervised Pretraining for Decision Transformer.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Opponent-Limited Online Search for Imperfect Information Games.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Quality-Similar Diversity via Population Based Reinforcement Learning.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Curriculum-based Co-design of Morphology and Control of Voxel-based Soft Robots.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

PreCo: Enhancing Generalization in Co-Design of Modular Soft Robots via Brain-Body Pre-Training.

[DOI]

Proceedings of the Conference on Robot Learning, 2023

Sequential Cooperative Multi-Agent Reinforcement Learning.

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

RLogist: Fast Observation Strategy on Whole-Slide Images with Deep Reinforcement Learning.

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Supervised Learning Achieves Human-Level Performance in MOBA Games: A Case Study of Honor of Kings.

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

Revisiting Discrete Soft Actor-Critic.

[DOI]

CoRR, 2022

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning.

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Greedy when Sure and Conservative when Uncertain about the Opponents.

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game.

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Speedup Training Artificial Intelligence for Mahjong via Reward Variance Reduction.

[DOI]

Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms.

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Which Heroes to Pick? Learning to Draft in MOBA Games With Neural Networks and Tree Search.

[DOI]

IEEE Trans. Games, 2021

MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned.

[DOI]

Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track, 2021

Learning Diverse Policies in MOBA Games via Macro-Goals.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks.

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Combining Tree Search and Action Prediction for State-of-the-Art Performance in DouDiZhu.

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Boosting Offline Reinforcement Learning with Residual Generative Modeling.

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020

Towards Playing Full MOBA Games with Deep Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Mastering Complex Control in MOBA Games with Deep Reinforcement Learning.

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2018

Hierarchical Macro Strategy Model for MOBA Game AI.

[DOI]

CoRR, 2018