Hongming Zhang

Orcid: 0000-0003-4905-6569

Affiliations:
  • Chinese Academy of Sciences, Center for Research on Intelligent System and Engineering, Beijing, China


According to our database1, Hongming Zhang authored at least 11 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation.
CoRR, 2024

Monte Carlo Tree Search in the Presence of Transition Uncertainty.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Build generally reusable agent-environment interaction models.
CoRR, 2022

2021
A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning.
CoRR, 2021

Efficient Reinforcement Learning Development with RLzoo.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library.
CoRR, 2020

2019
Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning.
CoRR, 2019

RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space.
Proceedings of the International Joint Conference on Neural Networks, 2019

2018
A Logarithmic Barrier Method For Proximal Policy Optimization.
CoRR, 2018


  Loading...