Prashanth L. A.
Orcid: 0000-0003-0362-6730Affiliations:
- University of Maryland
- INRIA Lille - Nord Europe
- Indian Institute of Science, Department of Computer Science and Automation
According to our database1,
Prashanth L. A.
authored at least 66 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on isr.umd.edu
On csauthors.net:
Bibliography
2024
Finite Time Analysis of Temporal Difference Learning for Mean-Variance in a Discounted MDP.
CoRR, 2024
Truncated Cauchy random perturbations for smoothed functional-based stochastic optimization.
Autom., 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
IEEE Trans. Autom. Control., March, 2023
CoRR, 2023
Proceedings of the Uncertainty in Artificial Intelligence, 2023
Generalized Simultaneous Perturbation Stochastic Approximation with Reduced Estimator Bias.
Proceedings of the 57th Annual Conference on Information Sciences and Systems, 2023
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
2022
J. Mach. Learn. Res., 2022
Found. Trends Mach. Learn., 2022
A Gradient Smoothed Functional Algorithm with Truncated Cauchy Random Perturbations for Stochastic Optimization.
CoRR, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
2021
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint.
Syst. Control. Lett., 2021
Concentration bounds for temporal difference learning with linear function approximation: the case of batch data and uniform sampling.
Mach. Learn., 2021
Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis.
CoRR, 2021
CoRR, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE Trans. Autom. Control., 2020
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions.
Proceedings of the 37th International Conference on Machine Learning, 2020
2019
Oper. Res. Lett., 2019
Improved Concentration Bounds for Conditional Value-at-Risk and Cumulative Prospect Theory using Wasserstein distance.
CoRR, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
2018
IEEE Trans. Autom. Control., 2018
CoRR, 2018
2017
IEEE Trans. Autom. Control., 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
A constrained optimization perspective on actor-critic algorithms and application to network routing.
Syst. Control. Lett., 2016
Mach. Learn., 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
Proceedings of the 55th IEEE Conference on Decision and Control, 2016
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016
2015
Simul., 2015
J. Optim. Theory Appl., 2015
CoRR, 2015
Adaptive system optimization using (simultaneous) random directions stochastic approximation.
CoRR, 2015
On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence.
Proceedings of the 32nd International Conference on Machine Learning, 2015
Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015
Fast Gradient Descent for Drifting Least Squares Regression, with Application to Bandits.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Two timescale convergent Q-learning for sleep-scheduling in wireless sensor networks.
Wirel. Networks, 2014
Fast LSTD Using Stochastic Approximation: Finite Time Analysis and Application to Traffic Control.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014
Proceedings of the Sixth International Conference on Communication Systems and Networks, 2014
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014
Proceedings of the Algorithmic Learning Theory - 25th International Conference, 2014
2013
Analysis of stochastic approximation for efficient least squares regression and LSTD.
CoRR, 2013
Online gradient descent for least squares regression: Non-asymptotic bounds and application to bandits.
CoRR, 2013
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013
2012
IEEE Trans. Veh. Technol., 2012
2011
IEEE Trans. Intell. Transp. Syst., 2011
Reinforcement learning with average cost for adaptive control of traffic lights at intersections.
Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems, 2011
Proceedings of the Service-Oriented Computing - 9th International Conference, 2011
2008
OFDM-MAC algorithms and their impact on TCP performance in next generation mobile networks.
Proceedings of the Third International Conference on COMmunication System softWAre and MiddlewaRE (COMSWARE 2008), 2008
Proceedings of the 5th IEEE Consumer Communications and Networking Conference, 2008