Wenjia Meng

Orcid: 0000-0002-5784-7187

According to our database1, Wenjia Meng authored at least 6 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline.
CoRR, 2024

2023
Off-Policy Proximal Policy Optimization.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., 2022

2020
Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network.
IEEE Trans. Neural Networks Learn. Syst., 2020

2018
A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Two-Bit Networks for Deep Learning on Resource-Constrained Embedded Devices.
CoRR, 2017


  Loading...