On The Sample Complexity Bounds In Bilevel Reinforcement Learning.
CoRR, March, 2025
On The Global Convergence Of Online RLHF With Neural Parametrization.
CoRR, 2024
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization.
CoRR, 2023
On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization.
Proceedings of the International Conference on Machine Learning, 2023