CoRR, January, 2025

2024

On Learning Whittle Index Policy for Restless Bandits With Scalable Regret.

[DOI]

IEEE Trans. Control. Netw. Syst., September, 2024

Strong Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Markov Jump Linear Systems.

[DOI]

IEEE Trans. Autom. Control., June, 2024

Two families of indexable partially observable restless bandits and Whittle index computation.

[DOI]

Perform. Evaluation, 2024

Concentration of Cumulative Reward in Markov Decision Processes.

[DOI]

Peter E. Caines

CoRR, 2024

Model approximation in MDPs with unbounded per-step cost.

[DOI]

CoRR, 2024

Periodic agent-state based Q-learning for POMDPs.

[DOI]

Matthieu Geist

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Bridging State and History Representations: Understanding Self-Predictive RL.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Agent-state based policies in POMDPs: Beyond belief-state MDPs.

[DOI]

Proceedings of the 63rd IEEE Conference on Decision and Control, 2024

Asymptotic Normality of Cumulative Cost in Linear Quadratic Regulators.

[DOI]

Peter E. Caines

Proceedings of the 63rd IEEE Conference on Decision and Control, 2024

A vector almost-supermartingale convergence theorem and its applications.

[DOI]

Silviu-Iulian Niculescu

Mathukumalli Vidyasagar

Proceedings of the 63rd IEEE Conference on Decision and Control, 2024

Constant step-size stochastic approximation with delayed updates.

[DOI]

Silviu-Iulian Niculescu

Mathukumalli Vidyasagar

Proceedings of the 63rd IEEE Conference on Decision and Control, 2024

On learning history-based policies for controlling Markov decision processes.

[DOI]

Gandharv Patil

Doina Precup

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023

Decentralized Linear Quadratic Systems With Major and Minor Agents and Non-Gaussian Noise.

[DOI]

IEEE Trans. Autom. Control., August, 2023

Scalable Regret for Learning to Control Network-Coupled Subsystems With Unknown Dynamics.

[DOI]

IEEE Trans. Control. Netw. Syst., March, 2023

Robustness and Sample Complexity of Model-Based MARL for General-Sum Markov Games.

[DOI]

Dyn. Games Appl., March, 2023

Mean-field games among teams.

[DOI]

Akshat Kumar

CoRR, 2023

Approximate information state based convergence analysis of recurrent Q-learning.

[DOI]

CoRR, 2023

Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning.

[DOI]

Hadi Nekoei

Akilesh Badrinaaraayanan

Mohammad Amini

Janarthanan Rajendran

Sarath Chandar

Proceedings of the Conference on Lifelong Learning Agents, 2023

Asymmetric Actor-Critic with Approximate Information State.

[DOI]

Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems.

[DOI]

Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Weighted-Norm Bounds on Model Approximation in MDPs with Unbounded Per-Step Cost.

[DOI]

Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

2022

Optimal Control of Network-Coupled Subsystems: Spectral Decomposition and Low-Dimensional Solutions.

[DOI]

Shuang Gao

IEEE Trans. Control. Netw. Syst., 2022

Scalable Operator Allocation for Multirobot Assistance: A Restless Bandit Approach.

[DOI]

IEEE Trans. Control. Netw. Syst., 2022

Structure-Aware Reinforcement Learning for Node-Overload Protection in Mobile Edge Computing.

[DOI]

IEEE Trans. Cogn. Commun. Netw., 2022

Approximate Information State for Approximate Planning and Reinforcement Learning in Partially Observed Systems.

[DOI]

Raihan Seraj

J. Mach. Learn. Res., 2022

Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Markov Jump Linear Systems.

[DOI]

Proceedings of the 61st IEEE Conference on Decision and Control, 2022

Thompson-Sampling Based Reinforcement Learning for Networked Control of Unknown Linear Systems.

[DOI]

Proceedings of the 61st IEEE Conference on Decision and Control, 2022

A modified Thompson sampling-based learning algorithm for unknown linear systems.

[DOI]

Proceedings of the 61st IEEE Conference on Decision and Control, 2022

Partially observable restless bandits with restarts: indexability and computation of Whittle index.

[DOI]

Proceedings of the 61st IEEE Conference on Decision and Control, 2022

2021

Multi-Agent Estimation and Filtering for Minimizing Team Mean-Squared Error.

[DOI]

IEEE Trans. Signal Process., 2021

Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Switched Linear Systems.

[DOI]

CoRR, 2021

Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach.

[DOI]

CoRR, 2021

A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems.

[DOI]

CoRR, 2021

Maintenance of a collection of machines under partial observability: Indexability and computation of Whittle index.

[DOI]

CoRR, 2021

Mean-field approximation for large-population beauty-contest games.

[DOI]

Raihan Seraj

Jerome Le Ny

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

Decision Referrals in Human-Automation Teams.

[DOI]

Kesav Kaza

Jerome Le Ny

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

Thompson sampling for linear quadratic mean-field teams.

[DOI]

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020

Counterexamples on the Monotonicity of Delay Optimal Strategies for Energy Harvesting Transmitters.

[DOI]

IEEE Wirel. Commun. Lett., 2020

Renewal Monte Carlo: Renewal Theory-Based Reinforcement Learning.

[DOI]

IEEE Trans. Autom. Control., 2020

Remote Estimation Over a Packet-Drop Channel With Markovian State.

[DOI]

IEEE Trans. Autom. Control., 2020

Optimal Local and Remote Controllers With Unreliable Uplink Channels: An Elementary Proof.

[DOI]

IEEE Trans. Autom. Control., 2020

Restless bandits: indexability and computation of Whittle index.

[DOI]

CoRR, 2020

Cross-layer communication over fading channels with adaptive decision feedback.

[DOI]

Edmund Yeh

Proceedings of the 18th International Symposium on Modeling and Optimization in Mobile, 2020

Completeness Seeking Probabilistic Coverage Estimation using Uncertain State Estimates.

[DOI]

Stephen Rock

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

2019

Dynamic spectrum access under partial observations: A restless bandit approach.

[DOI]

Proceedings of the 16th Canadian Workshop on Information Theory, 2019

Approximate information state for partially observed systems.

[DOI]

Proceedings of the 58th IEEE Conference on Decision and Control, 2019

Networked control of coupled subsystems: Spectral decomposition and low-dimensional solutions.

[DOI]

Shuang Gao

Proceedings of the 58th IEEE Conference on Decision and Control, 2019

Restless bandits with controlled restarts: Indexability and computation of Whittle index.

[DOI]

Proceedings of the 58th IEEE Conference on Decision and Control, 2019

Reinforcement Learning in Stationary Mean-field Games.

[DOI]

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018

Information-Theoretic Privacy for Smart Metering Systems with a Rechargeable Battery.

[DOI]

Simon Li

IEEE Trans. Inf. Theory, 2018

Sufficient Conditions for the Value Function and Optimal Strategy to be Even and Quasi-Convex.

[DOI]

IEEE Trans. Autom. Control., 2018

On Controllability of Leader-Follower Dynamics over a Directed Graph.

[DOI]

Aditya A. Paranjape

Proceedings of the 57th IEEE Conference on Decision and Control, 2018

Optimal sampling of multiple linear processes over a shared medium.

[DOI]

Sebin Mathew

Karl H. Johannson

Proceedings of the 57th IEEE Conference on Decision and Control, 2018

Team Optimal Decentralized State Estimation.

[DOI]

Proceedings of the 57th IEEE Conference on Decision and Control, 2018

2017

Fundamental Limits of Remote Estimation of Autoregressive Markov Processes Under Communication Constraints.

[DOI]

IEEE Trans. Autom. Control., 2017

Remote Estimation over Control Area Networks.

[DOI]

Proceedings of the 86th IEEE Vehicular Technology Conference, 2017

Structure of optimal strategies for remote estimation over Gilbert-Elliott channel with feedback.

[DOI]

Proceedings of the 2017 IEEE International Symposium on Information Theory, 2017

On privacy in smart metering systems with periodically time-varying input distribution.

[DOI]

Yu Liu

Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017

Multi-armed bandits for efficient lifetime estimation in MPSoC design.

[DOI]

Calvin Ma

Brett H. Meyer

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

Stochastic approximation based methods for computing the optimal thresholds in remote-state estimation with packet drops.

[DOI]

Proceedings of the 2017 American Control Conference, 2017

2016

Decentralized stochastic control.

[DOI]

Mehnaz Mannan

Ann. Oper. Res., 2016

Structural results for two-user interactive communication.

[DOI]

Proceedings of the IEEE International Symposium on Information Theory, 2016

Privacy-optimal strategies for smart metering systems with a rechargeable battery.

[DOI]

Simon Li

Proceedings of the 2016 American Control Conference, 2016

2015

Sufficient Statistics for Linear Control Strategies in Decentralized Systems With Partial History Sharing.

[DOI]

IEEE Trans. Autom. Control., 2015

Fundamental limits of remote estimation of Markov processes under communication constraints.

[DOI]

CoRR, 2015

An algorithmic approach to identify irrelevant information in sequential teams.

[DOI]

Autom., 2015

Structure of optimal privacy-preserving policies in smart-metered systems with a rechargeable battery.

[DOI]

Simon Li

Proceedings of the 16th IEEE International Workshop on Signal Processing Advances in Wireless Communications, 2015

Distortion-transmission trade-off in real-time transmission of Markov sources.

[DOI]

Proceedings of the 2015 IEEE Information Theory Workshop, 2015

Distortion-transmission trade-off in real-time transmission of Gauss-Markov sources.

[DOI]

Proceedings of the IEEE International Symposium on Information Theory, 2015

On computing optimal thresholds in decentralized sequential hypothesis testing.

[DOI]

Can Cui

Proceedings of the 54th IEEE Conference on Decision and Control, 2015

Team-optimal solution of finite number of mean-field coupled LQG subsystems.

[DOI]

Jalal Arabneydi

Proceedings of the 54th IEEE Conference on Decision and Control, 2015

An estimation based allocation rule with super-linear regret and finite lock-on time for time-dependent multi-armed bandit processes.

[DOI]

Prokopis C. Prokopiou

Peter E. Caines

Proceedings of the IEEE 28th Canadian Conference on Electrical and Computer Engineering, 2015

Reinforcement learning in decentralized stochastic control systems with partial history sharing.

[DOI]

Jalal Arabneydi

Proceedings of the American Control Conference, 2015

2014

Simultaneous real-time transmission of multiple Markov sources over a shared channel.

[DOI]

Mehnaz Mannan

Proceedings of the 2014 IEEE International Symposium on Information Theory, Honolulu, HI, USA, June 29, 2014

On the optimal thresholds in remote state estimation with communication costs.

[DOI]

Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

Team optimal control of coupled subsystems with mean-field sharing.

[DOI]

Jalal Arabneydi

Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

Average cost optimal threshold strategies for remote estimation with communication cost.

[DOI]

Proceedings of the 52nd Annual Allerton Conference on Communication, 2014

2013

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach.

[DOI]

IEEE Trans. Autom. Control., 2013

Optimal Decentralized Control of Coupled Subsystems With Control Sharing.

[DOI]

IEEE Trans. Autom. Control., 2013

Forensic Analysis of Instant Messenger Applications on Android Devices

[DOI]

M. S. Dahiya

Hitesh P. Sanghvi

CoRR, 2013

Static LQG teams with countably infinite players.

[DOI]

Nuno C. Martins

Serdar Yüksel

Proceedings of the 52nd IEEE Conference on Decision and Control, 2013

2012

Opportunistic Capacity and Error Exponent Region for the Compound Channel With Feedback.

[DOI]

IEEE Trans. Inf. Theory, 2012

Age Based User Interface in Mobile Operating System

[DOI]

CoRR, 2012

Structure of optimal policies in active sensing.

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

On optimal block Markov coding schemes for multiple-access channel with feedback.

[DOI]

Proceedings of the 46th Annual Conference on Information Sciences and Systems, 2012

Information structures in optimal decentralized control.

[DOI]

Proceedings of the 51th IEEE Conference on Decision and Control, 2012

2011

Optimal Control Strategies in Delayed Sharing Information Structures.

[DOI]

IEEE Trans. Autom. Control., 2011

2010

A Wireless Soil Moisture Smart Sensor Web Using Physics-Based Optimal Control: Concept and Initial Demonstrations.

[DOI]

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2010

Measurement Scheduling for Soil Moisture Sensing: From Physical Models to Optimal Control.

[DOI]

Mahta Moghaddam

Dara Entekhabi

Proc. IEEE, 2010

Optimal sequential transmission over broadcast channel with nested feedback

[DOI]

CoRR, 2010

Opportunistic capacity and error exponent regions for compound channel with feedback.

[DOI]

Proceedings of the IEEE International Symposium on Information Theory, 2010

Opportunistic capacity and error exponent regions for variable length communication over compound channel with feedback.

[DOI]

Proceedings of the 44th Annual Conference on Information Sciences and Systems, 2010

A structural result for delayed sharing information structures.

[DOI]

Proceedings of the American Control Conference, 2010

Measure and cost dependent properties of information structures.

[DOI]

Serdar Yüksel

Proceedings of the American Control Conference, 2010

Optimal transmission policies for two-user multiple access broadcast using dynamic team theory.

[DOI]

Proceedings of the 48th Annual Allerton Conference on Communication, 2010

2009

Optimal design of sequential real-time communication systems.

[DOI]

IEEE Trans. Inf. Theory, 2009

Optimal Performance of Networked Control Systems with Nonclassical Information Structures.

[DOI]

SIAM J. Control. Optim., 2009

A Graphical modeling approach to simplifying sequential teams.

[DOI]

Proceedings of the 7th International Symposium on Modeling and Optimization in Mobile, 2009

A training based scheme for communicating over unknown channels with feedback.

[DOI]

Proceedings of the 47th Annual Allerton Conference on Communication, 2009

Sequential team form and its simplification using graphical models.

[DOI]

Proceedings of the 47th Annual Allerton Conference on Communication, 2009

2008

On the design of globally optimal communication strategies for real-time noisy communication systems with noisy feedback.

[DOI]

IEEE J. Sel. Areas Commun., 2008

A Soil Moisture Smart Sensor Web using Data Assimilation and Optimal Control: Formulation and First Laboratory Demonstration.

[DOI]

Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2008

Identifying tractable decentralized control problems on the basis of information structure.

[DOI]

Proceedings of the 46th Annual Allerton Conference on Communication, 2008

2007

On Real-Time Communication Systems with Noisy Feedback

[DOI]

CoRR, 2007

2006

A novel method for down-conversion of multiple bandpass signals.

[DOI]

Manu Agarwal

Ajit Kumar Chaturvedi

IEEE Trans. Wirel. Commun., 2006

Fixed Delay Optimal Joint Source-Channel Coding for Finite-Memory Systems.

[DOI]

Proceedings of the Proceedings 2006 IEEE International Symposium on Information Theory, 2006

Optimal Performance of Feedback Control Systems with Limited Communication over Noisy Channels.

[DOI]

Proceedings of the 45th IEEE Conference on Decision and Control, 2006

2005

An improved interpretation of depletion approximation in p-n-junctions.

[DOI]

Baquer Mazhari

IEEE Trans. Educ., 2005

A Decision Theoretic Framework for Real-Time Communication

[DOI]