Vivek S. Borkar

Orcid: 0000-0003-0756-5402

According to our database1, Vivek S. Borkar authored at least 206 papers between 1991 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Tabular and Deep Learning for the Whittle Index.
ACM Trans. Model. Perform. Evaluation Comput. Syst., September, 2024

Controlled Martingale Problems and Their Markov Mimics.
SIAM J. Control. Optim., 2024

Reinforcement learning in non-Markovian environments.
Syst. Control. Lett., 2024

Whittle Index Based User Association in Dense Millimeter Wave Networks.
CoRR, 2024

Minkowski Descent: An Algorithm for Stochastic Global Optimization.
Proceedings of the 60th Annual Allerton Conference on Communication, 2024

2023
Stochastic control and stability with applications: Ari Arapostathis memorial issue.
Syst. Control. Lett., April, 2023

Special Issue : Multi-Agent Dynamic Decision Making and Learning.
Dyn. Games Appl., March, 2023

Ergodic risk-sensitive control - A survey.
Annu. Rev. Control., January, 2023

A concentration bound for LSPE(λ).
Syst. Control. Lett., 2023

Actor-Critic or Critic-Actor? A Tale of Two Time Scales.
IEEE Control. Syst. Lett., 2023

A Concentration Bound for TD(0) with Function Approximation.
CoRR, 2023

Approximation of Convex Envelope Using Reinforcement Learning.
CoRR, 2023

Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion.
CoRR, 2023

Node Cardinality Estimation in the Internet of Things Using Privileged Feature Distillation.
CoRR, 2023

Full Gradient Deep Reinforcement Learning for Average-Reward Criterion.
Proceedings of the Learning for Dynamics and Control Conference, 2023

Online Parameter Estimation in Partially Observed Markov Decision Processes.
Proceedings of the 59th Annual Allerton Conference on Communication, 2023

2022
User Association in Dense mmWave Networks as Restless Bandits.
IEEE Trans. Veh. Technol., 2022

Opinion Shaping in Social Networks Using Reinforcement Learning.
IEEE Trans. Control. Netw. Syst., 2022

Dynamic Social Learning Under Graph Constraints.
IEEE Trans. Control. Netw. Syst., 2022

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes.
IEEE Trans. Autom. Control., 2022

QWI: Q-learning with Whittle Index.
SIGMETRICS Perform. Evaluation Rev., 2022

Revisiting SIR in the Age of COVID-19: Explicit Solutions and Control Problems.
SIAM J. Control. Optim., 2022

ERRATUM: LP Formulations of Discrete Time Long-Run Average Optimal Control Problems: The Nonergodic Case.
SIAM J. Control. Optim., 2022

Corrigendum to "A concentration bound for contractive stochastic approximation" [Syst. Control Lett. 153 (2021) 104947].
Syst. Control. Lett., 2022

Learning to cooperate in agent-based control of queueing networks.
Queueing Syst. Theory Appl., 2022

Reinforcement Learning in Non-Markovian Environments.
CoRR, 2022

Concentration bounds for SSP Q-learning for average cost MDPs.
CoRR, 2022

Whittle index based Q-learning for restless bandits with average reward.
Autom., 2022

Whittle indexability in egalitarian processor sharing systems.
Ann. Oper. Res., 2022

Scheduling in Wireless Networks using Whittle Index Theory.
Proceedings of the 27th National Conference on Communications, 2022

A Concentration Bound for Distributed Stochastic Approximation.
Proceedings of the 58th Annual Allerton Conference on Communication, 2022

2021
Prospect-theoretic Q-learning.
Syst. Control. Lett., 2021

A concentration bound for contractive stochastic approximation.
Syst. Control. Lett., 2021

Scheduling in wireless networks with spatial reuse of spectrum as restless bandits.
Perform. Evaluation, 2021

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning.
CoRR, 2021

Concentration of Contractive Stochastic Approximation and Reinforcement Learning.
CoRR, 2021

Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme.
CoRR, 2021

Maximizing Conditional Entropy for Batch-Mode Active Learning of Perceptual Metrics.
CoRR, 2021

Accelerating MCMC by Rare Intermittent Resets.
Proceedings of the Performance Evaluation Methodologies and Tools, 2021

A Unified Batch Selection Policy for Active Metric Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Linear and Dynamic Programs for Risk-Sensitive Cost Minimization.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020
Low Complexity Online Radio Access Technology Selection Algorithm in LTE-WiFi HetNet.
IEEE Trans. Mob. Comput., 2020

A Variational Characterization of the Risk-Sensitive Average Reward for Controlled Diffusions on ℝ<sup>d</sup>.
SIAM J. Control. Optim., 2020

2019
LP Formulations of Discrete Time Long-Run Average Optimal Control Problems: The NonErgodic Case.
SIAM J. Control. Optim., 2019

Stochastic approximation algorithms for rumor source inference on graphs.
Perform. Evaluation, 2019

Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents.
Math. Control. Signals Syst., 2019

Linear Programming Formulation of Long-Run Average Optimal Control Problem.
J. Optim. Theory Appl., 2019

Aerial monitoring of slow moving convoys using elliptical orbits.
Eur. J. Control, 2019

Metastability in Stochastic Replicator Dynamics.
Dyn. Games Appl., 2019

Average cost optimal control under weak hypotheses: Relative value iterations.
CoRR, 2019

A Structure-aware Online Learning Algorithm for Markov Decision Processes.
Proceedings of the 12th EAI International Conference on Performance Evaluation Methodologies and Tools, 2019

On LP Formulations of Optimal Control Problems with Time Averaging and Time Discounting Criteria in Non-Ergodic Case.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019

A learning algorithm for the Whittle index policy for scheduling web crawlers.
Proceedings of the 57th Annual Allerton Conference on Communication, 2019

2018
Opportunistic Scheduling as Restless Bandits.
IEEE Trans. Control. Netw. Syst., 2018

Whittle Index Policy for Crawling Ephemeral Content.
IEEE Trans. Control. Netw. Syst., 2018

Distributed Stochastic Approximation with Local Projections.
SIAM J. Optim., 2018

Q-learning for Markov decision processes with a satisfiability criterion.
Syst. Control. Lett., 2018

Distributed Algorithms: Tsitsiklis and Beyond.
Proceedings of the 2018 Information Theory and Applications Workshop, 2018

Concentration bounds for two time scale stochastic approximation.
Proceedings of the 56th Annual Allerton Conference on Communication, 2018

2017
Temporally Agnostic Rumor-Source Detection.
IEEE Trans. Signal Inf. Process. over Networks, 2017

Distributed Reinforcement Learning via Gossip.
IEEE Trans. Autom. Control., 2017

Whittle Index for Partially Observed Binary Markov Decision Processes.
IEEE Trans. Autom. Control., 2017

A Distributed Boyle-Dykstra-Han Scheme.
SIAM J. Optim., 2017

A Correction to "A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions".
SIAM J. Control. Optim., 2017

A Variational Formula for Risk-Sensitive Reward.
SIAM J. Control. Optim., 2017

Gradient Estimation with Simultaneous Perturbation and Compressive Sensing.
J. Mach. Learn. Res., 2017

Approachability in Stackelberg Stochastic Games with Vector Costs.
Dyn. Games Appl., 2017

Distributed Server Allocation for Content Delivery Networks.
CoRR, 2017

MCMC Approaches to Rumor Source Inference using Pairwise Information.
Proceedings of the 11th EAI International Conference on Performance Evaluation Methodologies and Tools, 2017

A dynamic programming framework for optimal home scheduling.
Proceedings of the 2017 IEEE Innovative Smart Grid Technologies - Asia, 2017

Vector field guidance for convoy monitoring using elliptical orbits.
Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

Linear and dynamic programming approaches to degenerate risk-sensitive reward processes.
Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

Controlling G-AIMD by index policy.
Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

2016
Actor-Critic Algorithms with Online Feature Adaptation.
ACM Trans. Model. Comput. Simul., 2016

Fast rumor source identification via random walks.
Soc. Netw. Anal. Min., 2016

Nonlinear Gossip.
SIAM J. Control. Optim., 2016

Gaussian approximations in high dimensional estimation.
Syst. Control. Lett., 2016

An Adaptive Window Scheme for Backoff in 802.11 MAC Protocol.
EAI Endorsed Trans. Wirel. Spectr., 2016

Distributed and Asynchronous Methods for Semi-supervised Learning.
Proceedings of the Algorithms and Models for the Web Graph - 13th International Workshop, 2016

Randomized Kaczmarz for rank aggregation from pairwise comparisons.
Proceedings of the 2016 IEEE Information Theory Workshop, 2016

A concentration result for stochastic approximation (Extended abstract).
Proceedings of the 2016 Information Theory and Applications Workshop, 2016

Comparison of Random Walk Based Techniques for Estimating Network Averages.
Proceedings of the Computational Social Networks - 5th International Conference, 2016

Edge conductance estimation using MCMC.
Proceedings of the 54th Annual Allerton Conference on Communication, 2016

2015
Manufacturing Consent.
IEEE Trans. Autom. Control., 2015

Parallel and Distributed Approaches for Graph Based Semi-supervised Learning.
CoRR, 2015

A variational formula for risk-sensitive reward.
CoRR, 2015

Rocking in two by two: From Collatz-Wielandt to Donsker-Varadhan.
Proceedings of the 2015 Information Theory and Applications Workshop, 2015

2014
Risk-Constrained Markov Decision Processes.
IEEE Trans. Autom. Control., 2014

Convergence of the Relative Value Iteration for the Ergodic Control Problem of Nondegenerate Diffusions under Near-Monotone Costs.
SIAM J. Control. Optim., 2014

Asynchronous Gossip for Averaging and Spectral Ranking.
IEEE J. Sel. Top. Signal Process., 2014

An Improved Block Coordinate Descent Method for Quadratic Programming in High Dimensions.
CoRR, 2014

Empirical Q-Value Iteration.
CoRR, 2014

A Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games.
CoRR, 2014

A stochastic Kaczmarz algorithm for network tomography.
Autom., 2014

Submodularity in Team Formation Problem.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Reinforcement learning, particle filters and the EM algorithm.
Proceedings of the 2014 Information Theory and Applications Workshop, 2014

Reinforcement Learning for Matrix Computations: PageRank as an Example.
Proceedings of the Distributed Computing and Internet Technology, 2014

Blackwell's approachability in stackelberg stochastic games: A learning version.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

An actor critic algorithm based on Grassmanian search.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

From Collatz-Weilandt to Donkser-Varadhan via Krein-Rutman.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

Stabilization of a quasi-linear parabolic Cauchy problem associated with ergodic control of diffusions.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

2013
On the diffusion of messages in on-line social networks.
Perform. Evaluation, 2013

Feature Search in the Grassmanian in Online Reinforcement Learning.
IEEE J. Sel. Top. Signal Process., 2013

Markov chains, Hamiltonian cycles and volumes of convex bodies.
J. Glob. Optim., 2013

Model-based clock synchronization protocol for wireless sensor networks.
CoRR, 2013

How to gossip if you must.
CoRR, 2013

Gossip with transmission constraints.
Proceedings of the 2013 Information Theory and Applications Workshop, 2013

2012
A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions.
SIAM J. Control. Optim., 2012

Stochastic approximation with long range dependent and heavy tailed noise.
Queueing Syst. Theory Appl., 2012

Relative Value Iteration for Stochastic Differential Games
CoRR, 2012

Oja's algorithm for graph clustering, Markov spectral decomposition, and risk sensitive control.
Autom., 2012

A relative value iteration for controlled diffusions under ergodic cost.
Proceedings of the 51th IEEE Conference on Decision and Control, 2012

Who is the fairest of them all?
Proceedings of the 50th Annual Allerton Conference on Communication, 2012

2011
Optimal Control of Sleep Periods for Wireless Terminals.
IEEE J. Sel. Areas Commun., 2011

Stochastic approximation with 'bad' noise.
Proceedings of the Information Theory and Applications Workshop, 2011

Risk sensitive optimal control framework applied to delay tolerant networks.
Proceedings of the INFOCOM 2011. 30th IEEE International Conference on Computer Communications, 2011

Convergence analysis for an online recommendation system.
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011

Asymptotics of the invariant measure in mean field models with jumps.
Proceedings of the 49th Annual Allerton Conference on Communication, 2011

Controlled gossip.
Proceedings of the 49th Annual Allerton Conference on Communication, 2011

2010
A Stable Online Algorithm for Energy-Efficient Multiuser Scheduling.
IEEE Trans. Mob. Comput., 2010

Singular Perturbations in Risk-Sensitive Stochastic Control.
SIAM J. Control. Optim., 2010

ERRATUM: White-Noise Representations in Stochastic Realization Theory.
SIAM J. Control. Optim., 2010

Uniform Recurrence Properties of Controlled Diffusions and Applications to Optimal Control.
SIAM J. Control. Optim., 2010

On a controlled eigenvalue problem.
Syst. Control. Lett., 2010

Application of nonlinear filtering to credit risk.
Oper. Res. Lett., 2010

Quasi-stationary distributions as centrality measures for the giant strongly connected component of a reducible graph.
J. Comput. Appl. Math., 2010

Manufacturing consent.
Proceedings of the 48th Annual Allerton Conference on Communication, 2010

2009
A novel ACO algorithm for optimization via reinforcement and initial bias.
Swarm Intell., 2009

Cooperative dynamics and Wardrop equilibria.
Syst. Control. Lett., 2009

On the Hamiltonicity Gap and doubly stochastic matrices.
Random Struct. Algorithms, 2009

A New Learning Algorithm for Optimal Stopping.
Discret. Event Dyn. Syst., 2009

A Stable On-line Algorithm for Energy Efficient Multi-user Scheduling
CoRR, 2009

Finite dimensional approximation and Newton-based algorithm for stochastic approximation in Hilbert space.
Autom., 2009

Vacation policy optimization with application to IEEE 802.16e power saving mechanism.
Proceedings of the 2nd IFIP Wireless Days, 2009

A Theory of QoS for Wireless.
Proceedings of the INFOCOM 2009. 28th IEEE International Conference on Computer Communications, 2009

Convergence of population dynamics in symmetric routing games with a finite number of playersz.
Proceedings of the 1st International Conference on Game Theory for Networks, 2009

A model-based approach to clock synchronization.
Proceedings of the 48th IEEE Conference on Decision and Control, 2009

Optimal sampling for state change detection with application to the control of sleep mode.
Proceedings of the 48th IEEE Conference on Decision and Control, 2009

2008
Distributed topology control of wireless networks.
Wirel. Networks, 2008

Structural Properties of Optimal Transmission Policies Over a Randomly Varying Channel.
IEEE Trans. Autom. Control., 2008

Stochastic Control with Imperfect Models.
SIAM J. Control. Optim., 2008

A note on linear function approximation using random projections.
Syst. Control. Lett., 2008

A Learning Algorithm for Risk-Sensitive Cost.
Math. Oper. Res., 2008

An on-line learning algorithm for energy efficient delay constrained scheduling over a fading channel.
IEEE J. Sel. Areas Commun., 2008

Oja's algorithm for graph clustering and Markov spectral decomposition.
Proceedings of the 3rd International ICST Conference on Performance Evaluation Methodologies and Tools, 2008

Opportunistic Transmission over Randomly Varying Channels.
Proceedings of the Network Control and Optimization, Second Euro-NF Workshop, 2008

Index Policies for Real-Time Multicast Scheduling for Wireless Broadcast Systems.
Proceedings of the INFOCOM 2008. 27th IEEE International Conference on Computer Communications, 2008

A learning scheme for stationary probabilities of large markov chains with examples.
Proceedings of the 46th Annual Allerton Conference on Communication, 2008

2007
Singular Perturbations in Ergodic Control of Diffusions.
SIAM J. Control. Optim., 2007

Dynamic Programming for Ergodic Control of Markov Chains under Partial Observations: A Correction.
SIAM J. Control. Optim., 2007

Common randomness and distributed control: A counterexample.
Syst. Control. Lett., 2007

Power Efficient Scheduling under Delay Constraints over Multi-user Wireless Channels
CoRR, 2007

Quasi-stationary distributions as centrality measures of reducible graphs
CoRR, 2007

Some Examples of Stochastic Approximation in Communications.
Proceedings of the Network Control and Optimization, 2007

An Information Theoretic View of Stochastic Resonance.
Proceedings of the IEEE International Symposium on Information Theory, 2007

2006
Distributed self-tuning of sensor networks.
Wirel. Networks, 2006

Erratum to "Avoidance of traps in stochastic approximation" [Systems Control Letters 50 (2003) 1-9].
Syst. Control. Lett., 2006

Stochastic approximation with 'controlled Markov' noise.
Syst. Control. Lett., 2006

A Simulation-Based Algorithm for Ergodic Control of Markov Chains Conditioned on Rare Events.
J. Mach. Learn. Res., 2006

Adaptive Importance Sampling Technique for Markov Chains Using Stochastic Approximation.
Oper. Res., 2006

Power Optimal Opportunistic Scheduling.
Proceedings of the Global Telecommunications Conference, 2006. GLOBECOM '06, San Francisco, CA, USA, 27 November, 2006

A New Distributed Time Synchronization Protocol for Multihop Wireless Networks.
Proceedings of the 45th IEEE Conference on Decision and Control, 2006

Existence of limit occupational measures set used for averaging of singularly perturbed controlled stochastic differential equations.
Proceedings of the 45th IEEE Conference on Decision and Control, 2006

2005
On Existence of Limit Occupational Measures Set of a Controlled Stochastic Differential Equation.
SIAM J. Control. Optim., 2005

An actor-critic algorithm for constrained Markov decision processes.
Syst. Control. Lett., 2005

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device.
Proceedings of the 3rd International Symposium on Modeling and Optimization in Mobile, 2005

Sequential Source Coding: An optimization viewpoint.
Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

HJB Equations for Ergodic Control Problems for Constrained Diffusions in Polyhedral Domains.
Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

2004
Ergodic Control for Constrained Diffusions: Characterization Using HJB Equations.
SIAM J. Control. Optim., 2004

Directed graphs, Hamiltonicity and doubly stochastic matrices.
Random Struct. Algorithms, 2004

Charge-based control of DiffServ-like queues.
Autom., 2004

Optimal random access in networks with two-way traffic.
Proceedings of the IEEE 15th International Symposium on Personal, 2004

Some Game-Theoretic Problems in Wireless Ad-Hoc Networks.
Proceedings of the Wireless Systems and Mobility in Next Generation Internet, 2004

2003
Dynamic Cesaro-Wardrop equilibration in networks.
IEEE Trans. Autom. Control., 2003

Multiscale Chaotic SPSA and Smoothed Functional Algorithms for Simulation Optimization.
Simul., 2003

Avoidance of traps in stochastic approximation.
Syst. Control. Lett., 2003

Peformance Analysis Conditioned on Rare Events: An Adaptive Simulation Scheme.
Commun. Inf. Syst., 2003

Algorithmic Mechanisms for Secure Multi-Auction Systems.
Proceedings of the 1st Indian International Conference on Artificial Intelligence, 2003

Value functions and performance evaluation in stochastic network models.
Proceedings of the 42nd IEEE Conference on Decision and Control, 2003

2002
Stochastic Approximation for Nonexpansive Maps: Application to Q-Learning Algorithms.
SIAM J. Control. Optim., 2002

Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost.
Math. Oper. Res., 2002

Q-Learning for Risk-Sensitive Control.
Math. Oper. Res., 2002

Mathematical Programming Embeddings of Logic.
J. Autom. Reason., 2002

On The Lock-In Probability Of Stochastic Approximation.
Comb. Probab. Comput., 2002

Reinforcement Learning in Markovian Evolutionary Games.
Adv. Complex Syst., 2002

Differential Join Prices for Parallel Queues: Social Optimality, Dynamic Pricing Algorithms and Application to Internet Pricing.
Proceedings of the Proceedings IEEE INFOCOM 2002, 2002

2001
Optimal Sequential Vector Quantization of Markov Sources.
SIAM J. Control. Optim., 2001

Learning Algorithms for Markov Decision Processes with Average Cost.
SIAM J. Control. Optim., 2001

A sensitivity formula for risk-sensitive cost and the actor-critic algorithm.
Syst. Control. Lett., 2001

Markov control problems under communication constraints.
Commun. Inf. Syst., 2001

2000
The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning.
SIAM J. Control. Optim., 2000

Erratum: Asynchronous Stochastic Approximations.
SIAM J. Control. Optim., 2000

Average Cost Dynamic Programming Equations For Controlled Markov Chains With Partial Observations.
SIAM J. Control. Optim., 2000

A learning algorithm for Markov decision processes with adaptive state aggregation.
Proceedings of the 39th IEEE Conference on Decision and Control, 2000

1999
Randomized neural networks for learning stochastic dependences.
IEEE Trans. Syst. Man Cybern. Part B, 1999

Actor-Critic - Type Learning Algorithms for Markov Decision Processes.
SIAM J. Control. Optim., 1999

1998
A unified framework for hybrid control: model and optimal control theory.
IEEE Trans. Autom. Control., 1998

Dynamics of individual specialization and global diversification in communities.
Complex., 1998

Generalized Replicator Dynamics as a Model of Specialization and Diversity in Societies.
Adv. Complex Syst., 1998

1997
Learning Decentralized Goal-based Vector Quantization.
Complex Syst., 1997

1996
Stochastic processes that generate polygonal and related random fields.
IEEE Trans. Inf. Theory, 1996

1995
A Convex Analytic Framework for Ergodic Control of Semi-Markov Processes.
Math. Oper. Res., 1995

Epsilon-Approximation of Differential Inclusions.
Proceedings of the Hybrid Systems III: Verification and Control, 1995

1992
Ergodic and adaptive control of nearest-neighbor motions.
Math. Control. Signals Syst., 1992

Weak convergence methods and singularly perturbed stochastic control and filtering problems: Harold J. Kushner.
Autom., 1992

1991
A remark on control of partially observed Markov chains.
Ann. Oper. Res., 1991


  Loading...