Jalaj Bhandari

Orcid: 0000-0002-7115-8986

According to our database1, Jalaj Bhandari authored at least 10 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Global Optimality Guarantees for Policy Gradient Methods.
Oper. Res., 2024

2023
Pearl: A Production-ready Reinforcement Learning Agent.
CoRR, 2023

Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning.
Proceedings of the 17th ACM Conference on Recommender Systems, 2023

2021
A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation.
Oper. Res., 2021

On the Linear Convergence of Policy Gradient Methods for Finite MDPs.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Optimization Foundations of Reinforcement Learning.
PhD thesis, 2020

A Note on the Linear Convergence of Policy Gradient Methods.
CoRR, 2020

2017
Annular Augmentation Sampling.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
On the tightness of an LP relaxation for rational optimization and its applications.
Oper. Res. Lett., 2016

Elliptical Slice Sampling with Expectation Propagation.
Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016


  Loading...