Hongming Zhang

Orcid: 0000-0003-4905-6569

Affiliations:

Chinese Academy of Sciences, Center for Research on Intelligent System and Engineering, Beijing, China

According to our database¹, Hongming Zhang authored at least 13 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

β-DQN: Improving Deep Q-Learning By Evolving the Behavior.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation.

[BibT_eX]

[DOI]

CoRR, 2024

Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Monte Carlo Tree Search in the Presence of Transition Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Build generally reusable agent-environment interaction models.

[BibT_eX]

[DOI]

Jun Jin

Hongming Zhang

Jun Luo

CoRR, 2022

2021

A Simple Unified Framework for Anomaly Detection in Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Efficient Reinforcement Learning Development with RLzoo.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

RevCuT Tree Search Method in Complex Single-player Game with Continuous Search Space.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

2018

A Logarithmic Barrier Method For Proximal Policy Optimization.

[BibT_eX]

[DOI]

Cheng Zeng

Hongming Zhang

CoRR, 2018

Hongming Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...