Sihan Zeng

Orcid: 0000-0002-0061-5780

According to our database1, Sihan Zeng authored at least 23 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning.
SIAM J. Optim., March, 2024

Partially Observable Contextual Bandits with Linear Payoffs.
CoRR, 2024

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning.
CoRR, 2024

QCQP-Net: Reliably learning feasible alternating current optimal power flow solutions under constraints.
Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Fast two-time-scale stochastic gradient method with applications in reinforcement learning.
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

2023
Finite-Time Convergence Rates of Decentralized Stochastic Approximation With Applications in Multi-Agent and Multi-Task Learning.
IEEE Trans. Autom. Control., May, 2023

Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach.
CoRR, 2023

InkGAN: Generative Adversarial Networks for Ink-And-Wash Style Transfer of Photographs.
Adv. Artif. Intell. Mach. Learn., 2023

Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sequential Fair Resource Allocation under a Markov Decision Process Framework.
Proceedings of the 4th ACM International Conference on AI in Finance, 2023

2022
An unsupervised font style transfer model based on generative adversarial networks.
Multim. Tools Appl., 2022

Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes.
Proceedings of the 61st IEEE Conference on Decision and Control, 2022

2021
Predictability and Prediction of Human Mobility Based on Application-Collected Location Data.
IEEE Trans. Mob. Comput., 2021

A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems.
CoRR, 2021

A decentralized policy gradient approach to multi-task reinforcement learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Finite-Time Analysis of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020
Human Mobility Prediction Using Sparse Trajectory Data.
IEEE Trans. Veh. Technol., 2020

H-DrunkWalk: Collaborative and Adaptive Navigation for Heterogeneous MAV Swarm.
ACM Trans. Sens. Networks, 2020

2019
An approach for medical event detection in Chinese clinical notes of electronic health records.
BMC Medical Informatics Decis. Mak., 2019

Modeling Spatio-Temporal App Usage for a Large User Population.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2019

Fast Compressive Sensing Recovery Using Generative Models with Structured Latent Variables.
Proceedings of the IEEE International Conference on Acoustics, 2019

Toward A High-Performance Emulation Platformfor Brain-Inspired Intelligent SystemsExploring Dataflow-Based Execution Model and Beyond.
Proceedings of the 43rd IEEE Annual Computer Software and Applications Conference, 2019


  Loading...