2025

On The Sample Complexity Bounds In Bilevel Reinforcement Learning.

[DOI]

CoRR, March, 2025

2024

On The Global Convergence Of Online RLHF With Neural Parametrization.

[DOI]

CoRR, 2024

Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization.

[DOI]

CoRR, 2023

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023