Longbo Huang

Orcid: 0000-0002-7341-447X

According to our database1, Longbo Huang authored at least 138 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning.
IEEE/ACM Trans. Netw., August, 2024

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning.
IEEE/ACM Trans. Netw., June, 2024

Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration.
CoRR, 2024

uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs.
CoRR, 2024

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks.
CoRR, 2024

Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training.
CoRR, 2024

Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks.
CoRR, 2024

Mixed Sparsity Training: Achieving 4⨉ FLOP Reduction for Transformer Pretraining.
CoRR, 2024

Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight Observation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

A Quadratic Synchronization Rule for Distributed Deep Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Learning-based Optimal Quantum Switch Scheduling.
SIGMETRICS Perform. Evaluation Rev., September, 2023

Network Topology Optimization via Deep Reinforcement Learning.
IEEE Trans. Commun., May, 2023

Online Min-max Problems with Non-convexity and Non-stationarity.
Trans. Mach. Learn. Res., 2023

LCM-LoRA: A Universal Stable-Diffusion Acceleration Module.
CoRR, 2023

One is More: Diverse Perspectives within a Single Network for Efficient DRL.
CoRR, 2023

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference.
CoRR, 2023

Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning.
CoRR, 2023

Queue Scheduling with Adversarial Bandit Learning.
CoRR, 2023

Stochastic Generative Flow Networks.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Provably Safe Reinforcement Learning with Step-wise Violation Constraints.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning.
Proceedings of the International Conference on Machine Learning, 2023

Multi-task Representation Learning for Pure Exploration in Linear Bandits.
Proceedings of the International Conference on Machine Learning, 2023

RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generative Augmented Flow Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Towards Minimax Optimal Reward-free Reinforcement Learning in Linear MDPs.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Why (and When) does Local SGD Generalize Better than SGD?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Collaborative Pure Exploration in Kernel Bandit.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Quantum Network: Security Assessment and Key Management.
IEEE/ACM Trans. Netw., 2022

A Unified Framework for User Identification Across Online and Offline Data.
IEEE Trans. Knowl. Data Eng., 2022

Addendum and Erratum to "The MDS Queue: Analysing the Latency Performance of Erasure Codes".
IEEE Trans. Inf. Theory, 2022

Risk-Sensitive Reinforcement Learning: Iterated CVaR and the Worst Path.
CoRR, 2022

RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch.
CoRR, 2022

Provable Generalization of Overparameterized Meta-learning Trained with SGD.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Effective multi-user delay-constrained scheduling with deep recurrent reinforcement learning.
Proceedings of the MobiHoc '22: The Twenty-third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, Seoul, Republic of Korea, October 17, 2022

Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification.
Proceedings of the International Conference on Machine Learning, 2022

Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably).
Proceedings of the International Conference on Machine Learning, 2022

Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits.
Proceedings of the International Conference on Machine Learning, 2022

Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation.
Proceedings of the International Conference on Machine Learning, 2022

Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Simultaneously achieving sublinear regret and constraint violations for online convex optimization with time-varying constraints.
Perform. Evaluation, 2021

What Makes Multimodal Learning Better than Single (Provably).
CoRR, 2021

Softmax with Regularization: Better Value Estimation in Multi-Agent Reinforcement Learning.
CoRR, 2021

Exploration in policy optimization through multiple paths.
Auton. Agents Multi Agent Syst., 2021

Robust Wireless Scheduling under Arbitrary Channel Dynamics and Feedback Delay (Invited Paper).
Proceedings of the 33rd International Teletraffic Congress ITC 2021, Avignon, France, August 31, 2021

Regularized Softmax Deep Multi-Agent Q-Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-Agent Reinforcement Learning in Stochastic Networked Systems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

What Makes Multi-Modal Learning Better than Single (Provably).
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Fast Federated Learning in the Presence of Arbitrary Device Unavailability.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Continuous Mean-Covariance Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

A One-Size-Fits-All Solution to Conservative Bandit Problems.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Fast-Convergent Learning-Aided Control in Energy Harvesting Networks.
IEEE Trans. Mob. Comput., 2020

Heavy traffic analysis of approximate max-weight matching algorithms for input-queued switches.
Perform. Evaluation, 2020

Loyalty programs in the sharing economy: Optimality and competition.
Perform. Evaluation, 2020

Distributed Reinforcement Learning in Multi-Agent Networked Systems.
CoRR, 2020

Exploration by Maximizing Rényi Entropy for Zero-Shot Meta RL.
CoRR, 2020

Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Softmax Deep Double Deterministic Policy Gradients.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

RTCP - Reduce Delay Variability with an End-to-end Approach.
Proceedings of the 2020 IFIP Networking Conference, 2020

Terminator: An Efficient and Light-weight Fault Localization Framework.
Proceedings of the 39th IEEE Conference on Computer Communications, 2020

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Combinatorial Pure Exploration for Dueling Bandit.
Proceedings of the 37th International Conference on Machine Learning, 2020

Multi-Path Policy Optimization.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Dueling Bandits: From Two-dueling to Multi-dueling.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

2019
Prices and subsidies in the sharing economy.
Perform. Evaluation, 2019

Security assessment and key management in a quantum network.
CoRR, 2019

Reinforcement Learning with Dynamic Boltzmann Softmax Updates.
CoRR, 2019

Double Quantization for Communication-Efficient Distributed Optimization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Two-Scale Stochastic Control for Integrated Multipoint Communication Systems With Renewables.
IEEE Trans. Smart Grid, 2018

Learning-Aided Stochastic Network Optimization With State Prediction.
IEEE/ACM Trans. Netw., 2018

Timely-Throughput Optimal Scheduling With Prediction.
IEEE/ACM Trans. Netw., 2018

A New Analysis of Variance Reduced Stochastic Proximal Methods for Composite Optimization with Serial and Asynchronous Realizations.
CoRR, 2018

Rebalancing Dockless Bike Sharing Systems.
CoRR, 2018

Multi-armed Bandits with Compensation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Social Interaction Activity based Time-Varying User Vectorization Method for Online Social Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Beyond the Click-Through Rate: Web Link Selection with Multi-level Feedback.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Proactive Serving Decreases User Delay Exponentially: The Light-Tailed Service Time Case.
IEEE/ACM Trans. Netw., 2017

Intelligence of Smart Systems: Model, Bounds, and Algorithms.
IEEE/ACM Trans. Netw., 2017

The Value-of-Information in Matching With Queues.
IEEE/ACM Trans. Netw., 2017

Optimal Sleep-Wake Scheduling for Energy Harvesting Smart Mobile Devices.
IEEE Trans. Mob. Comput., 2017

The MDS Queue: Analysing the Latency Performance of Erasure Codes.
IEEE Trans. Inf. Theory, 2017

Time-Average Optimization With Nonconvex Decision Set and Its Convergence.
IEEE Trans. Autom. Control., 2017

Human-in-the-Loop Mobile Networks: A Survey of Recent Advancements.
IEEE J. Sel. Areas Commun., 2017

Learning-aided Stochastic Network Optimization with Imperfect State Prediction.
Proceedings of the 18th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2017

Fast Stochastic Variance Reduced ADMM for Stochastic Composition Optimization.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Multi-level Feedback Web Links Selection Problem: Learning and Optimization.
Proceedings of the 2017 IEEE International Conference on Data Mining, 2017

2016
When Backpressure Meets Predictive Scheduling.
IEEE/ACM Trans. Netw., 2016

Power-Delay Tradeoff With Predictive Scheduling in Integrated Cellular and Wi-Fi Networks.
IEEE J. Sel. Areas Commun., 2016

Two-Scale Stochastic Control for Multipoint Communication Systems with Renewables.
CoRR, 2016

Boosting Sharing Economy: Social Welfare or Revenue Driven?
CoRR, 2016

Learning-aided scheduling for mobile virtual network operators with QoS constraints.
Proceedings of the 14th International Symposium on Modeling and Optimization in Mobile, 2016

System intelligence: model, bounds and algorithms.
Proceedings of the 17th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2016

Age-of-information in the presence of error.
Proceedings of the IEEE International Symposium on Information Theory, 2016

Two-Scale Stochastic Control for Smart-Grid Powered Coordinated Multi-Point Systems.
Proceedings of the 2016 IEEE Global Communications Conference, 2016

User identification in cyber-physical space: a case study on mobile query logs and trajectories.
Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS 2016, Burlingame, California, USA, October 31, 2016

Increasing large-scale data center capacity by statistical power control.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

Online microgrid energy generation scheduling revisited: the benefits of randomization and interval prediction.
Proceedings of the Seventh International Conference on Future Energy Systems, Waterloo, ON, Canada, June 21, 2016

Market share analysis with brand effect.
Proceedings of the 55th IEEE Conference on Decision and Control, 2016

Market Share Analysis with Brand Effect: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015
A Comment on "Power Cost Reduction in Distributed Data Centers: A Two Time Scale Approach for Delay Tolerant Workloads".
IEEE Trans. Parallel Distributed Syst., 2015

Proactive Serving Decreases User Delay Exponentially.
SIGMETRICS Perform. Evaluation Rev., 2015

Receding learning-aided control in stochastic networks.
Perform. Evaluation, 2015

A Methodology for Designing the Control of Energy Harvesting Sensor Nodes.
IEEE J. Sel. Areas Commun., 2015

Optimizing power consumption of mobile games.
Proceedings of the Workshop on Power-Aware Computing and Systems, 2015

Optimizing age-of-information in a multi-class queueing system.
Proceedings of the IEEE International Symposium on Information Theory, 2015

Online Energy Management Strategy for Hybrid Electric Vehicle.
Proceedings of the 2015 ACM Sixth International Conference on Future Energy Systems, 2015

2014
Power Cost Reduction in Distributed Data Centers: A Two-Time-Scale Approach for Delay Tolerant Workloads.
IEEE Trans. Parallel Distributed Syst., 2014

Effect of proactive serving on user delay reduction in service systems.
Proceedings of the ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems, 2014

The power of online learning in stochastic network optimization.
Proceedings of the ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems, 2014

The multi-shop ski rental problem.
Proceedings of the ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems, 2014

Delay-aware Predictive Network Selection in data offloading.
Proceedings of the 2014 Proceedings IEEE INFOCOM Workshops, Toronto, ON, Canada, April 27, 2014

When queueing meets coding: Optimal-latency data retrieving scheme in storage clouds.
Proceedings of the 2014 IEEE Conference on Computer Communications, 2014

Predictive delay-aware network selection in data offloading.
Proceedings of the IEEE Global Communications Conference, 2014

Control of systems that store renewable energy.
Proceedings of the Fifth International Conference on Future Energy Systems, 2014

Optimizing your online-advertisement asynchronously.
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014

2013
Utility Optimal Scheduling in Energy-Harvesting Networks.
IEEE/ACM Trans. Netw., 2013

LIFO-Backpressure Achieves Near-Optimal Utility-Delay Tradeoff.
IEEE/ACM Trans. Netw., 2013

Optimal distributed broadcasting with per-neighbor queues in acyclic overlay networks with arbitrary underlay capacity constraints.
Proceedings of the 2013 IEEE International Symposium on Information Theory, 2013

A Benes packet network.
Proceedings of the IEEE INFOCOM 2013, Turin, Italy, April 14-19, 2013, 2013

2012
Optimal demand response with energy storage management.
Proceedings of the IEEE Third International Conference on Smart Grid Communications, 2012

Optimal smart grid tariffs.
Proceedings of the 2012 Information Theory and Applications Workshop, 2012

Codes can reduce queueing delay in data centers.
Proceedings of the 2012 IEEE International Symposium on Information Theory, 2012

Data centers power reduction: A two time scale approach for delay tolerant workloads.
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012

A multi-layer market for vehicle-to-grid energy trading in the smart grid.
Proceedings of the 2012 Proceedings IEEE INFOCOM Workshops, 2012

2011
Delay reduction via Lagrange multipliers in stochastic network optimization.
IEEE Trans. Autom. Control., 2011

Utility optimal scheduling in processing networks.
Perform. Evaluation, 2011

Delay efficient scheduling via redundant constraints in multihop networks.
Perform. Evaluation, 2011

2010
The optimality of two prices: maximizing revenue in a stochastic communication system.
IEEE/ACM Trans. Netw., 2010

Dynamic product assembly and inventory control for maximum profit.
Proceedings of the 49th IEEE Conference on Decision and Control, 2010

2008
A negotiation game for multichannel access in cognitive radio networks.
Proceedings of the 4th Annual International Conference on Wireless Internet, 2008


  Loading...