Runyu Zhang

Orcid: 0000-0002-9333-3554

Affiliations:
  • Harvard University, John A. Paulson School of Engineering and Applied Sciences, MA, USA


According to our database1, Runyu Zhang authored at least 19 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Gradient Play in Stochastic Games: Stationary Points, Convergence, and Sample Complexity.
IEEE Trans. Autom. Control., October, 2024

Scalable spectral representations for network multiagent control.
CoRR, 2024

Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework.
CoRR, 2024

Multi-agent coverage control with transient behavior consideration.
Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis.
Proceedings of the American Control Conference, 2024

Scalable Reinforcement Learning for Linear-Quadratic Control of Networks.
Proceedings of the American Control Conference, 2024

2023
Regularized Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity.
CoRR, 2023

Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity.
CoRR, 2023

Multi-Agent Reinforcement Learning with Reward Delays.
Proceedings of the Learning for Dynamics and Control Conference, 2023

Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

On the Optimal Control of Network LQR with Spatially-Exponential Decaying Structure.
Proceedings of the American Control Conference, 2023

2022
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach.
IEEE Trans. Autom. Control., 2022

Multi-Agent Reinforcement Learning with Reward Delays.
CoRR, 2022

On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems.
CoRR, 2022

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Policy Optimization for Markov Games: Unified Framework and Faster Convergence.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence.
CoRR, 2021

On the Regret Analysis of Online LQR Control with Predictions.
Proceedings of the 2021 American Control Conference, 2021


  Loading...