Chao Yu

Orcid: 0000-0002-4371-3663

  • Dalian University of Technology, School of Computer Science and Technology, China
  • Sun Yat-Sen University, School of Data and Computer Science, Guangzhou, China
  • University of Wollongong, Australia (PhD 2014)

According to our database1, Chao Yu authored at least 73 papers between 2011 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Hierarchical Multi-Agent Meta-Reinforcement Learning for Cross-Channel Bidding.
IEEE Trans. Knowl. Data Eng., March, 2025

Rapid Learning in Constrained Minimax Games with Negative Momentum.
CoRR, January, 2025

Hierarchical task network-enhanced multi-agent reinforcement learning: Toward efficient cooperative strategies.
Neural Networks, 2025

Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games.
IEEE Trans. Games, June, 2024

An Offline-Transfer-Online Framework for Cloud-Edge Collaborative Distributed Reinforcement Learning.
IEEE Trans. Parallel Distributed Syst., May, 2024

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization.
CoRR, 2024

Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines.
CoRR, 2024

An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Policy-regularized Offline Multi-objective Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Spatiotemporal Relationship Cognitive Learning for Multirobot Air Combat.
IEEE Trans. Cogn. Dev. Syst., December, 2023

Towards more efficient and robust evaluation of sepsis treatment with deep reinforcement learning.
BMC Medical Informatics Decis. Mak., December, 2023

Reinforcement Learning in Healthcare: A Survey.
ACM Comput. Surv., 2023

Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning.
CoRR, 2023

Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey.
CoRR, 2023

Causal Deep Reinforcement Learning Using Observational Data.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Multi-Agent Transfer Reinforcement Learning With Multi-View Encoder for Adaptive Traffic Signal Control.
IEEE Trans. Intell. Transp. Syst., 2022

Lifelong reinforcement learning with temporal logic formulas and reward machines.
Knowl. Based Syst., 2022

Offline reinforcement learning with representations for actions.
Inf. Sci., 2022

Plan To Predict: Learning an Uncertainty-Foreseeing Model For Model-Based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Unified Diversity Measure for Multiagent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines.
CoRR, 2021

Coordinated Proximal Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Combining Model-Based and Model-Free Reinforcement Learning Policies for More Efficient Sepsis Treatment.
Proceedings of the Bioinformatics Research and Applications - 17th International Symposium, 2021

Distributed Multiagent Coordinated Learning for Autonomous Driving in Highways Based on Dynamic Coordination Graphs.
IEEE Trans. Intell. Transp. Syst., 2020

Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
BMC Medical Informatics Decis. Mak., 2020

Two-stage Automatic Image Annotation Based on Latent Semantic Scene Classification.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

D3PG: Decomposed Deep Deterministic Policy Gradient for Continuous Control.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020

Interactive RL via Online Human Demonstrations.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Decomposed Deep Reinforcement Learning for Robotic Control.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
BMC Medical Informatics Decis. Mak., 2019

Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV.
BMC Medical Informatics Decis. Mak., 2019

Execution allowance based fixed priority scheduling for probabilistic real-time systems.
J. Syst. Softw., 2019

Reinforcement Learning in Healthcare: A Survey.
CoRR, 2019

Adversarial Examples for CNN-Based Malware Detectors.
IEEE Access, 2019

Multi-Grained Cascade AdaBoost Extreme Learning Machine for Feature Representation.
Proceedings of the International Joint Conference on Neural Networks, 2019

The Price of Governance: A Middle Ground Solution to Coordination in Organizational Control.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Deep Inverse Reinforcement Learning for Sepsis Treatment.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Workload-Aware Harmonic Partitioned Scheduling of Periodic Real-Time Tasks with Constrained Deadlines.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

Reinforcement Learning for Cooperative Overtaking.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Coordinated Multiagent Reinforcement Learning for Teams of Mobile Sensing Robots.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Efficient and Robust Emergence of Norms through Heuristic Collective Learning.
ACM Trans. Auton. Adapt. Syst., 2018

基于被包围状态和马尔可夫模型的显著性检测 (Saliency Detection Based on Surroundedness and Markov Model).
计算机科学, 2018

Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning.
CoRR, 2018

A Saliency Map Fusion Method Based on Weighted DS Evidence Theory.
IEEE Access, 2018

Packet Multicast in Cognitive Radio Ad Hoc Networks: A Method Based on Random Network Coding.
IEEE Access, 2018

Adaptively Shaping Reinforcement Learning Agents via Human Reward.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Decentralized Multiagent Reinforcement Learning for Efficient Robotic Control by Coordination Graphs.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Fair Transmission Rate Adjustment in Cooperative Vehicle Safety Systems Based on Multi-Agent Model Predictive Control.
IEEE Trans. Veh. Technol., 2017

An Efficient and QoS Supported Multichannel MAC Protocol for Vehicular Ad Hoc Networks.
Sensors, 2017

APDM: An adaptive multi-priority distributed multichannel MAC protocol for vehicular ad hoc networks in unsaturated conditions.
Comput. Commun., 2017

Neural learning for the emergence of social norms in multiagent systems.
Proceedings of the IEEE International Conference on Agents, 2017

Dynamic Feedback Power Control for Cooperative Vehicle Safety Systems.
Wirel. Pers. Commun., 2016

Model Reference Adaptive Power Control for Cooperative Vehicle Safety Systems.
J. Inf. Sci. Eng., 2016

Fair Channel Access in Cooperative Vehicle Safety Systems.
J. Inf. Sci. Eng., 2016

Adaptive Learning for Efficient Emergence of Social Norms in Networked Multiagent Systems.
Proceedings of the PRICAI 2016: Trends in Artificial Intelligence, 2016

Accelerating Norm Emergence Through Hierarchical Heuristic Learning.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

An Adaptive Learning Framework for Efficient Emergence of Social Norms: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Emotional Multiagent Reinforcement Learning in Spatial Social Dilemmas.
IEEE Trans. Neural Networks Learn. Syst., 2015

Multiagent Learning of Coordination in Loosely Coupled Multiagent Systems.
IEEE Trans. Cybern., 2015

Hierarchical Learning for Emergence of Social Norms in Networked Multiagent Systems.
Proceedings of the AI 2015: Advances in Artificial Intelligence, 2015

A Multi-Agent Approach for Decentralized Voltage Regulation by Considering Distributed Generators.
Proceedings of the AI 2015: Advances in Artificial Intelligence, 2015

Heuristic Collective Learning for Efficient and Robust Emergence of Social Norms.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

Collective Learning for the Emergence of Social Norms in Networked Multiagent Systems.
IEEE Trans. Cybern., 2014

Coordinated learning by exploiting sparse interaction in multiagent systems.
Concurr. Comput. Pract. Exp., 2014

An Adaptive Bilateral Negotiation Model Based on Bayesian Learning.
Proceedings of the Complex Automated Negotiations: Theories, 2013

Emotional Multiagent Reinforcement Learning in Social Dilemmas.
Proceedings of the PRIMA 2013: Principles and Practice of Multi-Agent Systems, 2013

Emergence of social norms through collective learning in networked agent societies.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

Exploiting Independent Relationships in Multiagent Systems for Coordinated Learning.
Proceedings of the PRICAI 2012: Trends in Artificial Intelligence, 2012

Coordinated Learning for Loosely Coupled Agents with Sparse Interactions.
Proceedings of the AI 2011: Advances in Artificial Intelligence, 2011
