Mohammad Sadegh Talebi

Orcid: 0000-0002-1934-7421

According to our database1, Mohammad Sadegh Talebi authored at least 62 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.



In proceedings 
PhD thesis 




Scaling Power Management in Cloud Data Centers: A Multi-Level Continuous-Time MDP Approach.
IEEE Trans. Serv. Comput., 2024

Provably Efficient Exploration in Reward Machines with Low Regret.
CoRR, 2024

No-regret Exploration in Shuffle Private Reinforcement Learning.
CoRR, 2024

Tractable Offline Learning of Regular Decision Processes.
CoRR, 2024

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?
CoRR, 2024

Double Graph Attention Networks for Visual Semantic Navigation.
Neural Process. Lett., December, 2023

Scaling Up Q-Learning via Exploiting State-Action Equivalence.
Entropy, April, 2023

Provably Efficient Offline Reinforcement Learning in Regular Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Exploration in Reward Machines with Low Regret.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Logarithmic regret in communicating MDPs: Leveraging known dynamics with bandits.
Proceedings of the Asian Conference on Machine Learning, 2023

SIFTER: Space-Efficient Value Iteration for Finite-Horizon MDPs.
Proc. VLDB Endow., 2022

Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks With Strict-Sense Stationary and Non-Stationary Wireless Communication Channels.
IEEE Open J. Commun. Soc., 2022

Improved Exploration in Factored Average-Reward MDPs.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Adversarial Bandits with Corruptions: Regret Lower Bound and No-regret Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Tightening Exploration in Upper Confidence Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Learning Multiple Markov Chains via Adaptive Allocation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Model-Based Reinforcement Learning Exploiting State-Action Equivalence.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

Multiperiod Network Rate Allocation With End-to-End Delay Constraints.
IEEE Trans. Control. Netw. Syst., 2018

Stochastic Online Shortest Path Routing: The Value of Feedback.
IEEE Trans. Autom. Control., 2018

Learning Proportionally Fair Allocations with Low Regret.
Proc. ACM Meas. Anal. Comput. Syst., 2018

Competitive Online Scheduling Algorithms with Applications in Deadline-Constrained EV Charging.
Proceedings of the 26th IEEE/ACM International Symposium on Quality of Service, 2018

Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs.
Proceedings of the Algorithmic Learning Theory, 2018

An Optimal Algorithm for Stochastic Matroid Bandit Optimization.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Scalable and Fair Admission Control for On-Chip Nanophotonic Crossbars.
CoRR, 2015

Stochastic and Adversarial Combinatorial Bandits.
CoRR, 2015

Combinatorial Bandits Revisited.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Utility-optimal dynamic rate allocation under average end-to-end delay requirements.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015

Joint multipath rate control and scheduling for SVC streams in wireless mesh networks.
Int. J. Ad Hoc Ubiquitous Comput., 2014

Maximizing Quality of Aggregation in Delay-Constrained Wireless Sensor Networks.
IEEE Commun. Lett., 2013

Uncoupled Learning Rules for Seeking Equilibria in Repeated Plays: An Overview.
CoRR, 2013

Spectrum bandit optimization.
Proceedings of the 2013 IEEE Information Theory Workshop, 2013

Content-aware rate allocation for efficient video streaming via dynamic network utility maximization.
J. Netw. Comput. Appl., 2012

A key-distribution mechanism for wireless sensor networks using Zig-Zag product.
Int. J. Ad Hoc Ubiquitous Comput., 2012

Exploring playback continuity and delay trade-off in peer-to-peer streaming.
Proceedings of the 2012 IEEE Symposium on Computers and Communications, 2012

NUM-based rate allocation for streaming traffic via Sequential Convex Programming.
Proceedings of IEEE International Conference on Communications, 2012

Quasi-Optimal Network Utility Maximization for Scalable Video Streaming
CoRR, 2011

Cost-aware monitoring of network-wide aggregates in wireless sensor networks.
Comput. Networks, 2011

Proportionally fair flow control mechanism for best effort traffic in network-on-chip architectures.
Int. J. Parallel Emergent Distributed Syst., 2010

Utility-proportional bandwidth sharing for multimedia transmission supporting scalable video coding.
Comput. Commun., 2010

Cost-aware reactive monitoring in resource-constrained wireless sensor networks.
Proceedings of the 2009 IEEE Wireless Communications and Networking Conference, 2009

Rate Control for Scalable Multimedia Applications in Network-on-Chips.
Proceedings of the International Conference on Scalable Computing and Communications / Eighth International Conference on Embedded Computing, 2009

Optimization bandwidth sharing for multimedia transmission supporting scalable video coding.
Proceedings of the 34th Annual IEEE Conference on Local Computer Networks, 2009

Throughput-fairness tradeoff in Best Effort flow control for on-chip architectures.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Flooding-Assisted Threshold Assignment for Aggregate Monitoring in Sensor Networks.
Proceedings of the Distributed Computing and Networking, 10th International Conference, 2009

A Suboptimal Network Utility Maximization Approach for Scalable Multimedia Applications.
Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009

On the Connectivity of Key-Distribution Strategies in Wireless Sensor Networks.
Proceedings of the Global Communications Conference, 2009. GLOBECOM 2009, Honolulu, Hawaii, USA, 30 November, 2009

Distributed Threshold Selection for Aggregate Threshold Monitoring in Sensor Networks.
Proceedings of the 6th IEEE Consumer Communications and Networking Conference, 2009

Loss-Aware Geographic Routing for Unreliable Wireless Sensor Networks.
Proceedings of the 6th IEEE Consumer Communications and Networking Conference, 2009

Source Location Anonymity for Sensor Networks.
Proceedings of the 6th IEEE Consumer Communications and Networking Conference, 2009

On the Stability of Best Effort Flow Control Mechanisms in On-Chip Architectures.
Proceedings of the 16th International Symposium on Modeling, 2008

A Novel Congestion Control Scheme in Network-on-Chip Based on Best Effort Delay-Sum Optimization.
Proceedings of the 9th International Symposium on Parallel Architectures, 2008

Proportionally-fair best effort flow control in network-on-chip architectures.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Maximizing Download Bandwidth for File Sharing in BitTorrent-like Peer-to-Peer Networks.
Proceedings of the 14th International Conference on Parallel and Distributed Systems, 2008

Mathematical analysis of buffer sizing for Network-on-Chips under multimedia traffic.
Proceedings of the 26th International Conference on Computer Design, 2008

Max-Min-Fair Best Effort Flow Control in Network-on-Chip Architectures.
Proceedings of the Computational Science, 2008

Best Effort Flow Control in Network-on-Chip.
Proceedings of the Advances in Computer Science and Engineering, 2008

A Novel Flow Control Scheme for Best Effort Traffic in NoC Based on Source Rate Utility Maximization.
Proceedings of the 15th International Symposium on Modeling, 2007

Secure consensus averaging for secure information fusion in sensor networks.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

A Novel Congestion Control Scheme for Elastic Flows in Network-on-Chip Based on Sum-Rate Optimization.
Proceedings of the Computational Science and Its Applications, 2007

Utility Constrained Energy Minimization In Aloha Networks.
Proceedings of the 4th IEEE Consumer Communications and Networking Conference, 2007

On Secure Consensus Information Fusion over Sensor Networks.
Proceedings of the 2007 IEEE/ACS International Conference on Computer Systems and Applications (AICCSA 2007), 2007

Adaptive Consensus Averaging for Information Fusion over Sensor Networks.
Proceedings of the IEEE 3rd International Conference on Mobile Adhoc and Sensor Systems, 2006
