Priyank Agrawal

Orcid: 0000-0002-0644-6703

According to our database1, Priyank Agrawal authored at least 10 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms.
CoRR, 2024

Optimistic Q-learning for average reward and episodic reinforcement learning.
CoRR, 2024

Policy Gradient with Tree Search (PGTS) in Reinforcement Learning Evades Local Maxima.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

2023
A tractable online learning algorithm for the multinomial logit contextual bandit.
Eur. J. Oper. Res., October, 2023

2022
Learning-Augmented Mechanism Design: Leveraging Predictions for Facility Location.
Proceedings of the EC '22: The 23rd ACM Conference on Economics and Computation, Boulder, CO, USA, July 11, 2022

2021
Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Improved Optimistic Algorithm For The Multinomial Logit Contextual Bandit.
CoRR, 2020

Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Incentivising Exploration and Recommendations for Contextual Bandits with Payments.
Proceedings of the Multi-Agent Systems and Agreement Technologies, 2020

2018
Bandits with Temporal Stochastic Constraints.
CoRR, 2018


  Loading...