Muning Wen

Orcid: 0009-0000-7868-1262

According to our database1, Muning Wen authored at least 20 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Safe Multiagent Learning With Soft Constrained Policy Optimization in Real Robot Control.
IEEE Trans. Ind. Informatics, September, 2024

RoMAT: Role-based multi-agent transformer for generalizable heterogeneous cooperation.
Neural Networks, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models.
CoRR, 2024

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking.
CoRR, 2024

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation.
CoRR, 2024

P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for Optimizing LLM Training.
CoRR, 2024

Reinforcing Language Agents via Policy Optimization with Action Decomposition.
CoRR, 2024

Entropy-Regularized Token-Level Policy Optimization for Large Language Models.
CoRR, 2024

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Large sequence models for sequential decision-making: a survey.
Frontiers Comput. Sci., December, 2023

Offline Pre-trained Multi-agent Decision Transformer.
Mach. Intell. Res., April, 2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
J. Mach. Learn. Res., 2023

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training.
CoRR, 2023

2022
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks.
CoRR, 2021

Multi-Agent Constrained Policy Optimisation.
CoRR, 2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
CoRR, 2021

Settling the Variance of Multi-Agent Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...