Chengdong Ma

Orcid: 0000-0002-7963-3024

According to our database1, Chengdong Ma authored at least 13 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games.
CoRR, 2024

Conflux-PSRO: Effectively Leveraging Collective Advantages in Policy Space Response Oracles.
CoRR, 2024

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment.
CoRR, 2024

Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning.
CoRR, 2024

A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles.
CoRR, 2024

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects.
CoRR, 2024

Panacea: Pareto Alignment via Preference Adaptation for LLMs.
CoRR, 2024

2023
Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models.
CoRR, 2023

Confrontation and Obstacle-Avoidance of Unmanned Vehicles Based on Progressive Reinforcement Learning.
IEEE Access, 2023

2022
Fully Decentralized Model-based Policy Optimization for Networked Systems.
CoRR, 2022

Scalable Model-based Policy Optimization for Decentralized Networked Systems.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

2018
Design of a Low-Power Cold Chain Logistics Internet of Things System.
Proceedings of the Advances in Internet, 2018


  Loading...