Gugan Thoppe

Orcid: 0000-0001-5066-6589

According to our database1, Gugan Thoppe authored at least 25 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Reinforcement Learning with Quasi-Hyperbolic Discounting.
CoRR, 2024

Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries.
CoRR, 2024

Online Learning of Weakly Coupled MDP Policies for Load Balancing and Auto Scaling.
Proceedings of the IFIP Networking Conference, 2024

Risk Estimation in a Markov Cost Process: Lower and Upper Bounds.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
VaR\ and CVaR Estimation in a Markov Cost Process: Lower and Upper Bounds.
CoRR, 2023

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search.
CoRR, 2023

Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking.
Proceedings of the Performance Evaluation Methodologies and Tools, 2023

Does Momentum Help in Stochastic Optimization? A Sample Complexity Analysis.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Online Learning with Adversaries: A Differential-Inclusion Analysis.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

2022
Online algorithms for estimating change rates of web pages.
Perform. Evaluation, 2022

Approximate Q-learning and SARSA(0) under the ε-greedy Policy: a Differential Inclusion Analysis.
CoRR, 2022

2021
Does Momentum Help? A Sample Complexity Analysis.
CoRR, 2021

Scale Invariant Solutions for Overdetermined Linear Systems with Applications to Reinforcement Learning.
CoRR, 2021

A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Randomly Weighted d-Complexes: Minimal Spanning Acycles and Persistence Diagrams.
Electron. J. Comb., 2020

Change Rate Estimation and Optimal Freshness in Web Page Crawling.
Proceedings of the VALUETOOLS 2020: 13th EAI International Conference on Performance Evaluation Methodologies and Tools, 2020

A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2018
Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning.
Proceedings of the Conference On Learning Theory, 2018

Finite Sample Analyses for TD(0) With Function Approximation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Finite Sample Analysis for TD(0) with Linear Function Approximation.
CoRR, 2017

Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning.
CoRR, 2017

2016
A concentration result for stochastic approximation (Extended abstract).
Proceedings of the 2016 Information Theory and Applications Workshop, 2016

2014
An Improved Block Coordinate Descent Method for Quadratic Programming in High Dimensions.
CoRR, 2014

A stochastic Kaczmarz algorithm for network tomography.
Autom., 2014

2012
Generalized network tomography.
Proceedings of the 50th Annual Allerton Conference on Communication, 2012


  Loading...