Vivek S. Borkar

Urtzi Ayesta

ACM Trans. Model. Perform. Evaluation Comput. Syst., September, 2024

Controlled Martingale Problems and Their Markov Mimics.

[BibT_eX]

[DOI]

Siva Athreya

Nitya Gadhiwala

SIAM J. Control. Optim., 2024

Reinforcement learning in non-Markovian environments.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2024

Whittle Index Based User Association in Dense Millimeter Wave Networks.

[BibT_eX]

[DOI]

Mandar R. Nalavade

CoRR, 2024

Minkowski Descent: An Algorithm for Stochastic Global Optimization.

[BibT_eX]

[DOI]

Keshav Patel Keval

Vivek Shripad Borkar

Ananya Singhal

Proceedings of the 60th Annual Allerton Conference on Communication, 2024

2023

Stochastic control and stability with applications: Ari Arapostathis memorial issue.

[BibT_eX]

[DOI]

Syst. Control. Lett., April, 2023

Special Issue : Multi-Agent Dynamic Decision Making and Learning.

[BibT_eX]

[DOI]

U. Jayakrishnan Nair

Dyn. Games Appl., March, 2023

Ergodic risk-sensitive control - A survey.

[BibT_eX]

[DOI]

Anup Biswas

Annu. Rev. Control., January, 2023

A concentration bound for LSPE(λ).

[BibT_eX]

[DOI]

Harsh Dolhare

Syst. Control. Lett., 2023

Actor-Critic or Critic-Actor? A Tale of Two Time Scales.

[BibT_eX]

[DOI]

Soumyajit Guin

IEEE Control. Syst. Lett., 2023

A Concentration Bound for TD(0) with Function Approximation.

[BibT_eX]

[DOI]

CoRR, 2023

Approximation of Convex Envelope Using Reinforcement Learning.

[BibT_eX]

[DOI]

Adit Akarsh

CoRR, 2023

Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion.

[BibT_eX]

[DOI]

Keshav P. Keval

CoRR, 2023

Node Cardinality Estimation in the Internet of Things Using Privileged Feature Distillation.

[BibT_eX]

[DOI]

CoRR, 2023

Full Gradient Deep Reinforcement Learning for Average-Reward Criterion.

[BibT_eX]

[DOI]

Tejas Pagare

Proceedings of the Learning for Dynamics and Control Conference, 2023

Online Parameter Estimation in Partially Observed Markov Decision Processes.

[BibT_eX]

[DOI]

Sai Sumedh R. Hindupur

Proceedings of the 59th Annual Allerton Conference on Communication, 2023

2022

User Association in Dense mmWave Networks as Restless Bandits.

[BibT_eX]

[DOI]

Santosh Kumar Singh

IEEE Trans. Veh. Technol., 2022

Opinion Shaping in Social Networks Using Reinforcement Learning.

[BibT_eX]

[DOI]

Alexandre Reiffers-Masson

IEEE Trans. Control. Netw. Syst., 2022

Dynamic Social Learning Under Graph Constraints.

[BibT_eX]

[DOI]

Sharayu Moharir

Suhail Mohmad Shah

IEEE Trans. Control. Netw. Syst., 2022

Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2022

QWI: Q-learning with Whittle Index.

[BibT_eX]

[DOI]

Francisco Robledo

Urtzi Ayesta

SIGMETRICS Perform. Evaluation Rev., 2022

Revisiting SIR in the Age of COVID-19: Explicit Solutions and Control Problems.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2022

ERRATUM: LP Formulations of Discrete Time Long-Run Average Optimal Control Problems: The Nonergodic Case.

[BibT_eX]

[DOI]

Ilya Shvartsman

SIAM J. Control. Optim., 2022

Corrigendum to "A concentration bound for contractive stochastic approximation" [Syst. Control Lett. 153 (2021) 104947].

[BibT_eX]

[DOI]

Syst. Control. Lett., 2022

Learning to cooperate in agent-based control of queueing networks.

[BibT_eX]

[DOI]

Queueing Syst. Theory Appl., 2022

Reinforcement Learning in Non-Markovian Environments.

[BibT_eX]

[DOI]

Parth Dodhia

CoRR, 2022

Concentration bounds for SSP Q-learning for average cost MDPs.

[BibT_eX]

[DOI]

Shaan Ul Haque

CoRR, 2022

Whittle index based Q-learning for restless bandits with average reward.

[BibT_eX]

[DOI]

Autom., 2022

Whittle indexability in egalitarian processor sharing systems.

[BibT_eX]

[DOI]

Ann. Oper. Res., 2022

Scheduling in Wireless Networks using Whittle Index Theory.

[BibT_eX]

[DOI]

Karthik G. V. B.

Proceedings of the 27th National Conference on Communications, 2022

A Concentration Bound for Distributed Stochastic Approximation.

[BibT_eX]

[DOI]

Harsh Dolhare

Proceedings of the 58th Annual Allerton Conference on Communication, 2022

2021

Prospect-theoretic Q-learning.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2021

A concentration bound for contractive stochastic approximation.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2021

Scheduling in wireless networks with spatial reuse of spectrum as restless bandits.

[BibT_eX]

[DOI]

Perform. Evaluation, 2021

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Concentration of Contractive Stochastic Approximation and Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme.

[BibT_eX]

[DOI]

H. P. Dolhare

K. Patil

CoRR, 2021

Maximizing Conditional Entropy for Batch-Mode Active Learning of Perceptual Metrics.

[BibT_eX]

[DOI]

CoRR, 2021

Accelerating MCMC by Rare Intermittent Resets.

[BibT_eX]

[DOI]

Syomantak Chaudhuri

Proceedings of the Performance Evaluation Methodologies and Tools, 2021

A Unified Batch Selection Policy for Active Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2021

Linear and Dynamic Programs for Risk-Sensitive Cost Minimization.

[BibT_eX]

[DOI]

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020

Low Complexity Online Radio Access Technology Selection Algorithm in LTE-WiFi HetNet.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2020

A Variational Characterization of the Risk-Sensitive Average Reward for Controlled Diffusions on ℝ<sup>d</sup>.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2020

2019

LP Formulations of Discrete Time Long-Run Average Optimal Control Problems: The NonErgodic Case.

[BibT_eX]

[DOI]

Ilya Shvartsman

SIAM J. Control. Optim., 2019

Stochastic approximation algorithms for rumor source inference on graphs.

[BibT_eX]

[DOI]

Anand Kalvit

Perform. Evaluation, 2019

Non-asymptotic error bounds for constant stepsize stochastic approximation for tracking mobile agents.

[BibT_eX]

[DOI]

Bhumesh Kumar

Akhil Shetty

Math. Control. Signals Syst., 2019

Linear Programming Formulation of Long-Run Average Optimal Control Problem.

[BibT_eX]

[DOI]

J. Optim. Theory Appl., 2019

Aerial monitoring of slow moving convoys using elliptical orbits.

[BibT_eX]

[DOI]

Aseem Vivek Borkar

Arpita Sinha

Eur. J. Control, 2019

Metastability in Stochastic Replicator Dynamics.

[BibT_eX]

[DOI]

Dyn. Games Appl., 2019

Average cost optimal control under weak hypotheses: Relative value iterations.

[BibT_eX]

[DOI]

CoRR, 2019

A Structure-aware Online Learning Algorithm for Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the 12th EAI International Conference on Performance Evaluation Methodologies and Tools, 2019

On LP Formulations of Optimal Control Problems with Time Averaging and Time Discounting Criteria in Non-Ergodic Case.

[BibT_eX]

[DOI]

Ilya Shvartsman

Proceedings of the 58th IEEE Conference on Decision and Control, 2019

A learning algorithm for the Whittle index policy for scheduling web crawlers.

[BibT_eX]

[DOI]

Proceedings of the 57th Annual Allerton Conference on Communication, 2019

2018

Opportunistic Scheduling as Restless Bandits.

[BibT_eX]

[DOI]

IEEE Trans. Control. Netw. Syst., 2018

Whittle Index Policy for Crawling Ephemeral Content.

[BibT_eX]

[DOI]

IEEE Trans. Control. Netw. Syst., 2018

Distributed Stochastic Approximation with Local Projections.

[BibT_eX]

[DOI]

Suhail Mohmad Shah

SIAM J. Optim., 2018

Q-learning for Markov decision processes with a satisfiability criterion.

[BibT_eX]

[DOI]

Suhail M. Shah

Syst. Control. Lett., 2018

Distributed Algorithms: Tsitsiklis and Beyond.

[BibT_eX]

[DOI]

Suhail Mohmad Shah

Proceedings of the 2018 Information Theory and Applications Workshop, 2018

Concentration bounds for two time scale stochastic approximation.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Allerton Conference on Communication, 2018

2017

Temporally Agnostic Rumor-Source Detection.

[BibT_eX]

[DOI]

Ankit Kumar

IEEE Trans. Signal Inf. Process. over Networks, 2017

Distributed Reinforcement Learning via Gossip.

[BibT_eX]

[DOI]

Adwaitvedant S. Mathkar

IEEE Trans. Autom. Control., 2017

Whittle Index for Partially Observed Binary Markov Decision Processes.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2017

A Distributed Boyle-Dykstra-Han Scheme.

[BibT_eX]

[DOI]

Soham R. Phade

SIAM J. Optim., 2017

A Correction to "A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions".

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2017

A Variational Formula for Risk-Sensitive Reward.

[BibT_eX]

[DOI]

V. Anantharam

SIAM J. Control. Optim., 2017

Gradient Estimation with Simultaneous Perturbation and Compressive Sensing.

[BibT_eX]

[DOI]

Vikranth Reddy Dwaracherla

Neeraja Sahasrabudhe

J. Mach. Learn. Res., 2017

Approachability in Stackelberg Stochastic Games with Vector Costs.

[BibT_eX]

[DOI]

Dyn. Games Appl., 2017

Distributed Server Allocation for Content Delivery Networks.

[BibT_eX]

[DOI]

CoRR, 2017

MCMC Approaches to Rumor Source Inference using Pairwise Information.

[BibT_eX]

[DOI]

Anand Kalvit

Proceedings of the 11th EAI International Conference on Performance Evaluation Methodologies and Tools, 2017

A dynamic programming framework for optimal home scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Innovative Smart Grid Technologies - Asia, 2017

Vector field guidance for convoy monitoring using elliptical orbits.

[BibT_eX]

[DOI]

Aseem Vivek Borkar

Arpita Sinha

Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

Linear and dynamic programming approaches to degenerate risk-sensitive reward processes.

[BibT_eX]

[DOI]

Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

Controlling G-AIMD by index policy.

[BibT_eX]

[DOI]

Proceedings of the 56th IEEE Annual Conference on Decision and Control, 2017

2016

Actor-Critic Algorithms with Online Feature Adaptation.

[BibT_eX]

[DOI]

Prabuchandran K. J.

ACM Trans. Model. Comput. Simul., 2016

Fast rumor source identification via random walks.

[BibT_eX]

[DOI]

Alankar Jain

Dinesh Garg

Soc. Netw. Anal. Min., 2016

Nonlinear Gossip.

[BibT_eX]

[DOI]

Adwaitvedant S. Mathkar

SIAM J. Control. Optim., 2016

Gaussian approximations in high dimensional estimation.

[BibT_eX]

[DOI]

Raaz Dwivedi

Neeraja Sahasrabudhe

Syst. Control. Lett., 2016

An Adaptive Window Scheme for Backoff in 802.11 MAC Protocol.

[BibT_eX]

[DOI]

Priyesh Y. Shetty

EAI Endorsed Trans. Wirel. Spectr., 2016

Distributed and Asynchronous Methods for Semi-supervised Learning.

[BibT_eX]

[DOI]

Krishnakant V. Saboo

Proceedings of the Algorithms and Models for the Web Graph - 13th International Workshop, 2016

Randomized Kaczmarz for rank aggregation from pairwise comparisons.

[BibT_eX]

[DOI]

Sharad Mirani

Proceedings of the 2016 IEEE Information Theory Workshop, 2016

A concentration result for stochastic approximation (Extended abstract).

[BibT_eX]

[DOI]

Gugan Thoppe

Proceedings of the 2016 Information Theory and Applications Workshop, 2016

Comparison of Random Walk Based Techniques for Estimating Network Averages.

[BibT_eX]

[DOI]

Jithin Kazuthuveettil Sreedharan

Arun Kadavankandy

Proceedings of the Computational Social Networks - 5th International Conference, 2016

Edge conductance estimation using MCMC.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Allerton Conference on Communication, 2016

2015

Manufacturing Consent.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2015

Parallel and Distributed Approaches for Graph Based Semi-supervised Learning.

[BibT_eX]

[DOI]

Krishnakant V. Saboo

CoRR, 2015

A variational formula for risk-sensitive reward.

[BibT_eX]

[DOI]

Venkatachalam Anantharam

Vivek Shripad Borkar

CoRR, 2015

Rocking in two by two: From Collatz-Wielandt to Donsker-Varadhan.

[BibT_eX]

[DOI]

V. Anantharam

Proceedings of the 2015 Information Theory and Applications Workshop, 2015

2014

Risk-Constrained Markov Decision Processes.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., 2014

Convergence of the Relative Value Iteration for the Ergodic Control Problem of Nondegenerate Diffusions under Near-Monotone Costs.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2014

Asynchronous Gossip for Averaging and Spectral Ranking.

[BibT_eX]

[DOI]

Rahul Makhijani

IEEE J. Sel. Top. Signal Process., 2014

An Improved Block Coordinate Descent Method for Quadratic Programming in High Dimensions.

[BibT_eX]

[DOI]

Gugan Thoppe

Dinesh Garg

CoRR, 2014

Empirical Q-Value Iteration.

[BibT_eX]

[DOI]

CoRR, 2014

A Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games.

[BibT_eX]

[DOI]

CoRR, 2014

A stochastic Kaczmarz algorithm for network tomography.

[BibT_eX]

[DOI]

Gugan Thoppe

Autom., 2014

Submodularity in Team Formation Problem.

[BibT_eX]

[DOI]

Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Reinforcement learning, particle filters and the EM algorithm.

[BibT_eX]

[DOI]

Ankushkumar Jain

Proceedings of the 2014 Information Theory and Applications Workshop, 2014

Reinforcement Learning for Matrix Computations: PageRank as an Example.

[BibT_eX]

[DOI]

Adwaitvedant S. Mathkar

Proceedings of the Distributed Computing and Internet Technology, 2014

Blackwell's approachability in stackelberg stochastic games: A learning version.

[BibT_eX]

[DOI]

Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

An actor critic algorithm based on Grassmanian search.

[BibT_eX]

[DOI]

Prabuchandran K. J.

Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

From Collatz-Weilandt to Donkser-Varadhan via Krein-Rutman.

[BibT_eX]

[DOI]

Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

Stabilization of a quasi-linear parabolic Cauchy problem associated with ergodic control of diffusions.

[BibT_eX]

[DOI]

Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

2013

On the diffusion of messages in on-line social networks.

[BibT_eX]

[DOI]

Aditya Karnik

Atul Saroop

Perform. Evaluation, 2013

Feature Search in the Grassmanian in Online Reinforcement Learning.

[BibT_eX]

[DOI]

Prabuchandran K. J.

IEEE J. Sel. Top. Signal Process., 2013

Markov chains, Hamiltonian cycles and volumes of convex bodies.

[BibT_eX]

[DOI]

Jerzy A. Filar

J. Glob. Optim., 2013

Model-based clock synchronization protocol for wireless sensor networks.

[BibT_eX]

[DOI]

Nikolaos M. Freris

P. R. Kumar

CoRR, 2013

How to gossip if you must.

[BibT_eX]

[DOI]

Rahul Makhijani

CoRR, 2013

Gossip with transmission constraints.

[BibT_eX]

[DOI]

Rahul Makhijani

Proceedings of the 2013 Information Theory and Applications Workshop, 2013

2012

A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2012

Stochastic approximation with long range dependent and heavy tailed noise.

[BibT_eX]

[DOI]

Queueing Syst. Theory Appl., 2012

Relative Value Iteration for Stochastic Differential Games

[BibT_eX]

[DOI]

CoRR, 2012

Oja's algorithm for graph clustering, Markov spectral decomposition, and risk sensitive control.

[BibT_eX]

[DOI]

Autom., 2012

A relative value iteration for controlled diffusions under ergodic cost.

[BibT_eX]

[DOI]

Proceedings of the 51th IEEE Conference on Decision and Control, 2012

Who is the fairest of them all?

[BibT_eX]

[DOI]

Rahul M. Makhijani

Proceedings of the 50th Annual Allerton Conference on Communication, 2012

2011

Optimal Control of Sleep Periods for Wireless Terminals.

[BibT_eX]

[DOI]

IEEE J. Sel. Areas Commun., 2011

Stochastic approximation with 'bad' noise.

[BibT_eX]

[DOI]

Proceedings of the Information Theory and Applications Workshop, 2011

Risk sensitive optimal control framework applied to delay tolerant networks.

[BibT_eX]

[DOI]

Eitan Altman

Veeraruna Kavitha

Francesco De Pellegrini

Vijay Kamble

Proceedings of the INFOCOM 2011. 30th IEEE International Conference on Computer Communications, 2011

Convergence analysis for an online recommendation system.

[BibT_eX]

[DOI]

Anh Truong

Negar Kiyavash

Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011

Asymptotics of the invariant measure in mean field models with jumps.

[BibT_eX]

[DOI]

Vivek Shripad Borkar

Proceedings of the 49th Annual Allerton Conference on Communication, 2011

Controlled gossip.

[BibT_eX]

[DOI]

Aditya Karnik

Proceedings of the 49th Annual Allerton Conference on Communication, 2011

2010

A Stable Online Algorithm for Energy-Efficient Multiuser Scheduling.

[BibT_eX]

[DOI]

Nitin Salodkar

IEEE Trans. Mob. Comput., 2010

Singular Perturbations in Risk-Sensitive Stochastic Control.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2010

ERRATUM: White-Noise Representations in Stochastic Realization Theory.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2010

Uniform Recurrence Properties of Controlled Diffusions and Applications to Optimal Control.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2010

On a controlled eigenvalue problem.

[BibT_eX]

[DOI]

Anup Biswas

Syst. Control. Lett., 2010

Application of nonlinear filtering to credit risk.

[BibT_eX]

[DOI]

Mrinal K. Ghosh

Govindan Rangarajan

Oper. Res. Lett., 2010

Quasi-stationary distributions as centrality measures for the giant strongly connected component of a reducible graph.

[BibT_eX]

[DOI]

Danil Nemirovsky

J. Comput. Appl. Math., 2010

Manufacturing consent.

[BibT_eX]

[DOI]

Jayakrishnan Nair

Sanketh Nalli

Proceedings of the 48th Annual Allerton Conference on Communication, 2010

2009

A novel ACO algorithm for optimization via reinforcement and initial bias.

[BibT_eX]

[DOI]

Dibyajyoti Das

Swarm Intell., 2009

Cooperative dynamics and Wardrop equilibria.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2009

On the Hamiltonicity Gap and doubly stochastic matrices.

[BibT_eX]

[DOI]

Vladimir Ejov

Jerzy A. Filar

Random Struct. Algorithms, 2009

A New Learning Algorithm for Optimal Stopping.

[BibT_eX]

[DOI]

Jervis Pinto

Tarun Prabhu

Discret. Event Dyn. Syst., 2009

A Stable On-line Algorithm for Energy Efficient Multi-user Scheduling

[BibT_eX]

[DOI]

Nitin Salodkar

CoRR, 2009

Finite dimensional approximation and Newton-based algorithm for stochastic approximation in Hilbert space.

[BibT_eX]

[DOI]

Ankur A. Kulkarni

Autom., 2009

Vacation policy optimization with application to IEEE 802.16e power saving mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2nd IFIP Wireless Days, 2009

A Theory of QoS for Wireless.

[BibT_eX]

[DOI]

I-Hong Hou

P. R. Kumar

Proceedings of the INFOCOM 2009. 28th IEEE International Conference on Computer Communications, 2009

Convergence of population dynamics in symmetric routing games with a finite number of playersz.

[BibT_eX]

[DOI]

Eitan Altman

Vijay Kamble

Proceedings of the 1st International Conference on Game Theory for Networks, 2009

A model-based approach to clock synchronization.

[BibT_eX]

[DOI]

Nikolaos M. Freris

P. R. Kumar

Proceedings of the 48th IEEE Conference on Decision and Control, 2009

Optimal sampling for state change detection with application to the control of sleep mode.

[BibT_eX]

[DOI]

Proceedings of the 48th IEEE Conference on Decision and Control, 2009

2008

Distributed topology control of wireless networks.

[BibT_eX]

[DOI]

Wirel. Networks, 2008

Structural Properties of Optimal Transmission Policies Over a Randomly Varying Channel.

[BibT_eX]

[DOI]

Mukul Agarwal

IEEE Trans. Autom. Control., 2008

Stochastic Control with Imperfect Models.

[BibT_eX]

[DOI]

Arnab Basu

SIAM J. Control. Optim., 2008

A note on linear function approximation using random projections.

[BibT_eX]

[DOI]

Kishor Barman

Syst. Control. Lett., 2008

A Learning Algorithm for Risk-Sensitive Cost.

[BibT_eX]

[DOI]

Arnab Basu

Tirthankar Bhattacharyya

Math. Oper. Res., 2008

An on-line learning algorithm for energy efficient delay constrained scheduling over a fading channel.

[BibT_eX]

[DOI]

IEEE J. Sel. Areas Commun., 2008

Oja's algorithm for graph clustering and Markov spectral decomposition.

[BibT_eX]

[DOI]

Proceedings of the 3rd International ICST Conference on Performance Evaluation Methodologies and Tools, 2008

Opportunistic Transmission over Randomly Varying Channels.

[BibT_eX]

[DOI]

Proceedings of the Network Control and Optimization, Second Euro-NF Workshop, 2008

Index Policies for Real-Time Multicast Scheduling for Wireless Broadcast Systems.

[BibT_eX]

[DOI]

Proceedings of the INFOCOM 2008. 27th IEEE International Conference on Computer Communications, 2008

A learning scheme for stationary probabilities of large markov chains with examples.

[BibT_eX]

[DOI]

Proceedings of the 46th Annual Allerton Conference on Communication, 2008

2007

Singular Perturbations in Ergodic Control of Diffusions.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2007

Dynamic Programming for Ergodic Control of Markov Chains under Partial Observations: A Correction.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2007

Common randomness and distributed control: A counterexample.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2007

Power Efficient Scheduling under Delay Constraints over Multi-user Wireless Channels

[BibT_eX]

[DOI]

Nitin Salodkar

CoRR, 2007

Quasi-stationary distributions as centrality measures of reducible graphs

[BibT_eX]

[DOI]

Danil Nemirovsky

CoRR, 2007

Some Examples of Stochastic Approximation in Communications.

[BibT_eX]

[DOI]

Proceedings of the Network Control and Optimization, 2007

An Information Theoretic View of Stochastic Resonance.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Information Theory, 2007

2006

Distributed self-tuning of sensor networks.

[BibT_eX]

[DOI]

Aditya Karnik

Anurag Kumar

Wirel. Networks, 2006

Erratum to "Avoidance of traps in stochastic approximation" [Systems Control Letters 50 (2003) 1-9].

[BibT_eX]

[DOI]

Syst. Control. Lett., 2006

Stochastic approximation with 'controlled Markov' noise.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2006

A Simulation-Based Algorithm for Ergodic Control of Markov Chains Conditioned on Rare Events.

[BibT_eX]

[DOI]

Madhukar Akarapu

J. Mach. Learn. Res., 2006

Adaptive Importance Sampling Technique for Markov Chains Using Stochastic Approximation.

[BibT_eX]

[DOI]

T. P. I. Ahamed

S. Juneja

Oper. Res., 2006

Power Optimal Opportunistic Scheduling.

[BibT_eX]

[DOI]

Abhijeet Bhorkar

Proceedings of the Global Telecommunications Conference, 2006. GLOBECOM '06, San Francisco, CA, USA, 27 November, 2006

A New Distributed Time Synchronization Protocol for Multihop Wireless Networks.

[BibT_eX]

[DOI]

Roberto Solis-Robles

Panganamala R. Kumar

Proceedings of the 45th IEEE Conference on Decision and Control, 2006

Existence of limit occupational measures set used for averaging of singularly perturbed controlled stochastic differential equations.

[BibT_eX]

[DOI]

Proceedings of the 45th IEEE Conference on Decision and Control, 2006

2005

On Existence of Limit Occupational Measures Set of a Controlled Stochastic Differential Equation.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2005

An actor-critic algorithm for constrained Markov decision processes.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2005

Closed and Open Loop Optimal Control of Buffer and Energy of a Wireless Device.

[BibT_eX]

[DOI]

Arzad Alam Kherani

B. J. Prabhu

Proceedings of the 3rd International Symposium on Modeling and Optimization in Mobile, 2005

Sequential Source Coding: An optimization viewpoint.

[BibT_eX]

[DOI]

Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

HJB Equations for Ergodic Control Problems for Constrained Diffusions in Polyhedral Domains.

[BibT_eX]

[DOI]

Amarjit Budhiraja

Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

2004

Ergodic Control for Constrained Diffusions: Characterization Using HJB Equations.

[BibT_eX]

[DOI]

Amarjit Budhiraja

SIAM J. Control. Optim., 2004

Directed graphs, Hamiltonicity and doubly stochastic matrices.

[BibT_eX]

[DOI]

Vladimir Ejov

Jerzy A. Filar

Random Struct. Algorithms, 2004

Charge-based control of DiffServ-like queues.

[BibT_eX]

[DOI]

Autom., 2004

Optimal random access in networks with two-way traffic.

[BibT_eX]

[DOI]

Eitan Altman

Arzad Alam Kherani

Proceedings of the IEEE 15th International Symposium on Personal, 2004

Some Game-Theoretic Problems in Wireless Ad-Hoc Networks.

[BibT_eX]

[DOI]

Proceedings of the Wireless Systems and Mobility in Next Generation Internet, 2004

2003

Dynamic Cesaro-Wardrop equilibration in networks.

[BibT_eX]

[DOI]

Panganamala Ramana Kumar

IEEE Trans. Autom. Control., 2003

Multiscale Chaotic SPSA and Smoothed Functional Algorithms for Simulation Optimization.

[BibT_eX]

[DOI]

Simul., 2003

Avoidance of traps in stochastic approximation.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2003

Peformance Analysis Conditioned on Rare Events: An Adaptive Simulation Scheme.

[BibT_eX]

[DOI]

S. Juneja

Arzad Alam Kherani

Commun. Inf. Syst., 2003

Algorithmic Mechanisms for Secure Multi-Auction Systems.

[BibT_eX]

Mehul S. Dave

R. K. Shyamasundar

Proceedings of the 1st Indian International Conference on Artificial Intelligence, 2003

Value functions and performance evaluation in stochastic network models.

[BibT_eX]

[DOI]

Proceedings of the 42nd IEEE Conference on Decision and Control, 2003

2002

Stochastic Approximation for Nonexpansive Maps: Application to Q-Learning Algorithms.

[BibT_eX]

[DOI]

Jinane Abounadi

Dimitri P. Bertsekas

SIAM J. Control. Optim., 2002

Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost.

[BibT_eX]

[DOI]

Math. Oper. Res., 2002

Q-Learning for Risk-Sensitive Control.

[BibT_eX]

[DOI]

Math. Oper. Res., 2002

Mathematical Programming Embeddings of Logic.

[BibT_eX]

[DOI]

Vijay Chandru

J. Autom. Reason., 2002

On The Lock-In Probability Of Stochastic Approximation.

[BibT_eX]

[DOI]

Comb. Probab. Comput., 2002

Reinforcement Learning in Markovian Evolutionary Games.

[BibT_eX]

[DOI]

Adv. Complex Syst., 2002

Differential Join Prices for Parallel Queues: Social Optimality, Dynamic Pricing Algorithms and Application to Internet Pricing.

[BibT_eX]

[DOI]

Parijat Dube

Proceedings of the Proceedings IEEE INFOCOM 2002, 2002

2001

Optimal Sequential Vector Quantization of Markov Sources.

[BibT_eX]

[DOI]

Sekhar Tatikonda

SIAM J. Control. Optim., 2001

Learning Algorithms for Markov Decision Processes with Average Cost.

[BibT_eX]

[DOI]

Jinane Abounadi

Dimitri P. Bertsekas

SIAM J. Control. Optim., 2001

A sensitivity formula for risk-sensitive cost and the actor-critic algorithm.

[BibT_eX]

[DOI]

Syst. Control. Lett., 2001

Markov control problems under communication constraints.

[BibT_eX]

[DOI]

Sekhar Tatikonda

Commun. Inf. Syst., 2001

2000

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2000

Erratum: Asynchronous Stochastic Approximations.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2000

Average Cost Dynamic Programming Equations For Controlled Markov Chains With Partial Observations.

[BibT_eX]

[DOI]

SIAM J. Control. Optim., 2000

A learning algorithm for Markov decision processes with adaptive state aggregation.

[BibT_eX]

[DOI]

John S. Baras

Proceedings of the 39th IEEE Conference on Decision and Control, 2000

1999

Randomized neural networks for learning stochastic dependences.

[BibT_eX]

[DOI]

Piyush Gupta

IEEE Trans. Syst. Man Cybern. Part B, 1999

Actor-Critic - Type Learning Algorithms for Markov Decision Processes.

[BibT_eX]

[DOI]

Vijaymohan Konda

SIAM J. Control. Optim., 1999

1998

A unified framework for hybrid control: model and optimal control theory.

[BibT_eX]

[DOI]

Michael S. Branicky

IEEE Trans. Autom. Control., 1998

Dynamics of individual specialization and global diversification in communities.

[BibT_eX]

[DOI]

Sanjay Jain

Govindan Rangarajan

Complex., 1998

Generalized Replicator Dynamics as a Model of Specialization and Diversity in Societies.

[BibT_eX]

[DOI]

Sanjay Jain

Govindan Rangarajan

Adv. Complex Syst., 1998

1997

Learning Decentralized Goal-based Vector Quantization.

[BibT_eX]

[DOI]

Piyush Gupta

Complex Syst., 1997

1996

Stochastic processes that generate polygonal and related random fields.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 1996

1995

A Convex Analytic Framework for Ergodic Control of Semi-Markov Processes.

[BibT_eX]

[DOI]

Math. Oper. Res., 1995

Epsilon-Approximation of Differential Inclusions.

[BibT_eX]

[DOI]

Anuj Puri

Pravin Varaiya

Proceedings of the Hybrid Systems III: Verification and Control, 1995

1992

Ergodic and adaptive control of nearest-neighbor motions.

[BibT_eX]

[DOI]

Mrinal K. Ghosh

Math. Control. Signals Syst., 1992

Weak convergence methods and singularly perturbed stochastic control and filtering problems: Harold J. Kushner.

[BibT_eX]

[DOI]

Autom., 1992

1991

A remark on control of partially observed Markov chains.

[BibT_eX]

[DOI]