2025
A Theoretical Justification for Asymmetric Actor-Critic Algorithms.
CoRR, January, 2025
2024
On Learning Whittle Index Policy for Restless Bandits With Scalable Regret.
IEEE Trans. Control. Netw. Syst., September, 2024
Strong Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Markov Jump Linear Systems.
IEEE Trans. Autom. Control., June, 2024
Two families of indexable partially observable restless bandits and Whittle index computation.
Perform. Evaluation, 2024
Concentration of Cumulative Reward in Markov Decision Processes.
CoRR, 2024
Model approximation in MDPs with unbounded per-step cost.
CoRR, 2024
Periodic agent-state based Q-learning for POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Bridging State and History Representations: Understanding Self-Predictive RL.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Agent-state based policies in POMDPs: Beyond belief-state MDPs.
Proceedings of the 63rd IEEE Conference on Decision and Control, 2024
Asymptotic Normality of Cumulative Cost in Linear Quadratic Regulators.
Proceedings of the 63rd IEEE Conference on Decision and Control, 2024
A vector almost-supermartingale convergence theorem and its applications.
Proceedings of the 63rd IEEE Conference on Decision and Control, 2024
Constant step-size stochastic approximation with delayed updates.
Proceedings of the 63rd IEEE Conference on Decision and Control, 2024
On learning history-based policies for controlling Markov decision processes.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024
2023
Decentralized Linear Quadratic Systems With Major and Minor Agents and Non-Gaussian Noise.
IEEE Trans. Autom. Control., August, 2023
Scalable Regret for Learning to Control Network-Coupled Subsystems With Unknown Dynamics.
IEEE Trans. Control. Netw. Syst., March, 2023
Robustness and Sample Complexity of Model-Based MARL for General-Sum Markov Games.
Dyn. Games Appl., March, 2023
Mean-field games among teams.
CoRR, 2023
Approximate information state based convergence analysis of recurrent Q-learning.
CoRR, 2023
Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023
Asymmetric Actor-Critic with Approximate Information State.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023
Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023
Weighted-Norm Bounds on Model Approximation in MDPs with Unbounded Per-Step Cost.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023
2022
Optimal Control of Network-Coupled Subsystems: Spectral Decomposition and Low-Dimensional Solutions.
IEEE Trans. Control. Netw. Syst., 2022
Scalable Operator Allocation for Multirobot Assistance: A Restless Bandit Approach.
IEEE Trans. Control. Netw. Syst., 2022
Structure-Aware Reinforcement Learning for Node-Overload Protection in Mobile Edge Computing.
IEEE Trans. Cogn. Commun. Netw., 2022
Approximate Information State for Approximate Planning and Reinforcement Learning in Partially Observed Systems.
J. Mach. Learn. Res., 2022
Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Markov Jump Linear Systems.
Proceedings of the 61st IEEE Conference on Decision and Control, 2022
Thompson-Sampling Based Reinforcement Learning for Networked Control of Unknown Linear Systems.
Proceedings of the 61st IEEE Conference on Decision and Control, 2022
A modified Thompson sampling-based learning algorithm for unknown linear systems.
Proceedings of the 61st IEEE Conference on Decision and Control, 2022
Partially observable restless bandits with restarts: indexability and computation of Whittle index.
Proceedings of the 61st IEEE Conference on Decision and Control, 2022
2021
Multi-Agent Estimation and Filtering for Minimizing Team Mean-Squared Error.
IEEE Trans. Signal Process., 2021
Consistency and Rate of Convergence of Switched Least Squares System Identification for Autonomous Switched Linear Systems.
CoRR, 2021
Scalable Operator Allocation for Multi-Robot Assistance: A Restless Bandit Approach.
CoRR, 2021
A relaxed technical assumption for posterior sampling-based reinforcement learning for control of unknown linear systems.
CoRR, 2021
Maintenance of a collection of machines under partial observability: Indexability and computation of Whittle index.
CoRR, 2021
Mean-field approximation for large-population beauty-contest games.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021
Decision Referrals in Human-Automation Teams.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021
Thompson sampling for linear quadratic mean-field teams.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021
2020
Counterexamples on the Monotonicity of Delay Optimal Strategies for Energy Harvesting Transmitters.
IEEE Wirel. Commun. Lett., 2020
Renewal Monte Carlo: Renewal Theory-Based Reinforcement Learning.
IEEE Trans. Autom. Control., 2020
Remote Estimation Over a Packet-Drop Channel With Markovian State.
IEEE Trans. Autom. Control., 2020
Optimal Local and Remote Controllers With Unreliable Uplink Channels: An Elementary Proof.
IEEE Trans. Autom. Control., 2020
Restless bandits: indexability and computation of Whittle index.
CoRR, 2020
Cross-layer communication over fading channels with adaptive decision feedback.
Proceedings of the 18th International Symposium on Modeling and Optimization in Mobile, 2020
Completeness Seeking Probabilistic Coverage Estimation using Uncertain State Estimates.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020
2019
Dynamic spectrum access under partial observations: A restless bandit approach.
Proceedings of the 16th Canadian Workshop on Information Theory, 2019
Approximate information state for partially observed systems.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019
Networked control of coupled subsystems: Spectral decomposition and low-dimensional solutions.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019
Restless bandits with controlled restarts: Indexability and computation of Whittle index.
Proceedings of the 58th IEEE Conference on Decision and Control, 2019
Reinforcement Learning in Stationary Mean-field Games.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
2018
Information-Theoretic Privacy for Smart Metering Systems with a Rechargeable Battery.
IEEE Trans. Inf. Theory, 2018
Sufficient Conditions for the Value Function and Optimal Strategy to be Even and Quasi-Convex.
IEEE Trans. Autom. Control., 2018
On Controllability of Leader-Follower Dynamics over a Directed Graph.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018
Optimal sampling of multiple linear processes over a shared medium.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018
Team Optimal Decentralized State Estimation.
Proceedings of the 57th IEEE Conference on Decision and Control, 2018
2017
Fundamental Limits of Remote Estimation of Autoregressive Markov Processes Under Communication Constraints.
IEEE Trans. Autom. Control., 2017
Remote Estimation over Control Area Networks.
Proceedings of the 86th IEEE Vehicular Technology Conference, 2017
Structure of optimal strategies for remote estimation over Gilbert-Elliott channel with feedback.
Proceedings of the 2017 IEEE International Symposium on Information Theory, 2017
On privacy in smart metering systems with periodically time-varying input distribution.
Proceedings of the 2017 IEEE Global Conference on Signal and Information Processing, 2017
Multi-armed bandits for efficient lifetime estimation in MPSoC design.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017
Stochastic approximation based methods for computing the optimal thresholds in remote-state estimation with packet drops.
Proceedings of the 2017 American Control Conference, 2017
2016
Decentralized stochastic control.
Ann. Oper. Res., 2016
Structural results for two-user interactive communication.
Proceedings of the IEEE International Symposium on Information Theory, 2016
Privacy-optimal strategies for smart metering systems with a rechargeable battery.
Proceedings of the 2016 American Control Conference, 2016
2015
Sufficient Statistics for Linear Control Strategies in Decentralized Systems With Partial History Sharing.
IEEE Trans. Autom. Control., 2015
Fundamental limits of remote estimation of Markov processes under communication constraints.
CoRR, 2015
An algorithmic approach to identify irrelevant information in sequential teams.
Autom., 2015
Structure of optimal privacy-preserving policies in smart-metered systems with a rechargeable battery.
Proceedings of the 16th IEEE International Workshop on Signal Processing Advances in Wireless Communications, 2015
Distortion-transmission trade-off in real-time transmission of Markov sources.
Proceedings of the 2015 IEEE Information Theory Workshop, 2015
Distortion-transmission trade-off in real-time transmission of Gauss-Markov sources.
Proceedings of the IEEE International Symposium on Information Theory, 2015
On computing optimal thresholds in decentralized sequential hypothesis testing.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015
Team-optimal solution of finite number of mean-field coupled LQG subsystems.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015
An estimation based allocation rule with super-linear regret and finite lock-on time for time-dependent multi-armed bandit processes.
Proceedings of the IEEE 28th Canadian Conference on Electrical and Computer Engineering, 2015
Reinforcement learning in decentralized stochastic control systems with partial history sharing.
Proceedings of the American Control Conference, 2015
2014
Simultaneous real-time transmission of multiple Markov sources over a shared channel.
Proceedings of the 2014 IEEE International Symposium on Information Theory, Honolulu, HI, USA, June 29, 2014
On the optimal thresholds in remote state estimation with communication costs.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014
Team optimal control of coupled subsystems with mean-field sharing.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014
Average cost optimal threshold strategies for remote estimation with communication cost.
Proceedings of the 52nd Annual Allerton Conference on Communication, 2014
2013
Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach.
IEEE Trans. Autom. Control., 2013
Optimal Decentralized Control of Coupled Subsystems With Control Sharing.
IEEE Trans. Autom. Control., 2013
Forensic Analysis of Instant Messenger Applications on Android Devices
CoRR, 2013
Static LQG teams with countably infinite players.
Proceedings of the 52nd IEEE Conference on Decision and Control, 2013
2012
Opportunistic Capacity and Error Exponent Region for the Compound Channel With Feedback.
IEEE Trans. Inf. Theory, 2012
Age Based User Interface in Mobile Operating System
CoRR, 2012
Structure of optimal policies in active sensing.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
On optimal block Markov coding schemes for multiple-access channel with feedback.
Proceedings of the 46th Annual Conference on Information Sciences and Systems, 2012
Information structures in optimal decentralized control.
Proceedings of the 51th IEEE Conference on Decision and Control, 2012
2011
Optimal Control Strategies in Delayed Sharing Information Structures.
IEEE Trans. Autom. Control., 2011
2010
A Wireless Soil Moisture Smart Sensor Web Using Physics-Based Optimal Control: Concept and Initial Demonstrations.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2010
Measurement Scheduling for Soil Moisture Sensing: From Physical Models to Optimal Control.
Proc. IEEE, 2010
Optimal sequential transmission over broadcast channel with nested feedback
CoRR, 2010
Opportunistic capacity and error exponent regions for compound channel with feedback.
Proceedings of the IEEE International Symposium on Information Theory, 2010
Opportunistic capacity and error exponent regions for variable length communication over compound channel with feedback.
Proceedings of the 44th Annual Conference on Information Sciences and Systems, 2010
A structural result for delayed sharing information structures.
Proceedings of the American Control Conference, 2010
Measure and cost dependent properties of information structures.
Proceedings of the American Control Conference, 2010
Optimal transmission policies for two-user multiple access broadcast using dynamic team theory.
Proceedings of the 48th Annual Allerton Conference on Communication, 2010
2009
Optimal design of sequential real-time communication systems.
IEEE Trans. Inf. Theory, 2009
Optimal Performance of Networked Control Systems with Nonclassical Information Structures.
SIAM J. Control. Optim., 2009
A Graphical modeling approach to simplifying sequential teams.
Proceedings of the 7th International Symposium on Modeling and Optimization in Mobile, 2009
A training based scheme for communicating over unknown channels with feedback.
Proceedings of the 47th Annual Allerton Conference on Communication, 2009
Sequential team form and its simplification using graphical models.
Proceedings of the 47th Annual Allerton Conference on Communication, 2009
2008
On the design of globally optimal communication strategies for real-time noisy communication systems with noisy feedback.
IEEE J. Sel. Areas Commun., 2008
A Soil Moisture Smart Sensor Web using Data Assimilation and Optimal Control: Formulation and First Laboratory Demonstration.
Proceedings of the IEEE International Geoscience & Remote Sensing Symposium, 2008
Identifying tractable decentralized control problems on the basis of information structure.
Proceedings of the 46th Annual Allerton Conference on Communication, 2008
2007
On Real-Time Communication Systems with Noisy Feedback
CoRR, 2007
2006
A novel method for down-conversion of multiple bandpass signals.
IEEE Trans. Wirel. Commun., 2006
Fixed Delay Optimal Joint Source-Channel Coding for Finite-Memory Systems.
Proceedings of the Proceedings 2006 IEEE International Symposium on Information Theory, 2006
Optimal Performance of Feedback Control Systems with Limited Communication over Noisy Channels.
Proceedings of the 45th IEEE Conference on Decision and Control, 2006
2005
An improved interpretation of depletion approximation in p-n-junctions.
IEEE Trans. Educ., 2005
A Decision Theoretic Framework for Real-Time Communication
CoRR, 2005