Huizhen Yu
Orcid: 0000-0002-3673-0094
According to our database1,
Huizhen Yu
authored at least 34 papers
between 2001 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies.
Math. Oper. Res., 2024
CoRR, 2024
On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes.
CoRR, 2024
2023
A Note on Stability in Asynchronous Stochastic Approximation without Communication Delays.
CoRR, 2023
2022
On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs.
Math. Oper. Res., 2022
2020
Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies.
SIAM J. Control. Optim., 2020
On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs.
SIAM J. Control. Optim., 2020
Research on the Structural Impact of the Disappearance of China's Demographic Dividend on the Education Industry.
Proceedings of the ICETM 2020: 3rd International Conference on Education Technology Management, 2020
2018
J. Mach. Learn. Res., 2018
Two geometric input transformation methods for fast online reinforcement learning with neural nets.
CoRR, 2018
2017
On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning.
CoRR, 2017
2016
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize.
J. Mach. Learn. Res., 2016
CoRR, 2016
2015
On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes.
SIAM J. Control. Optim., 2015
A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies.
Math. Oper. Res., 2015
Proceedings of The 28th Conference on Learning Theory, 2015
2013
Math. Oper. Res., 2013
Ann. Oper. Res., 2013
2012
SIAM J. Control. Optim., 2012
Math. Oper. Res., 2012
2011
SIAM J. Optim., 2011
2010
Math. Oper. Res., 2010
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
2009
IEEE Trans. Autom. Control., 2009
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009
2008
Math. Oper. Res., 2008
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008
2006
2005
A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies.
Proceedings of the UAI '05, 2005
2004
Proceedings of the UAI '04, 2004
2001
Proceedings of the Advances in Multimedia Information Processing, 2001