Sihan Zeng

SIAM J. Optim., March, 2024

Approximate Equivariance in Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Partially Observable Contextual Bandits with Linear Payoffs.

[BibT_eX]

[DOI]

CoRR, 2024

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

QCQP-Net: Reliably learning feasible alternating current optimal power flow solutions under constraints.

[BibT_eX]

[DOI]

Proceedings of the 6th Annual Learning for Dynamics & Control Conference, 2024

Fast two-time-scale stochastic gradient method with applications in reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

2023

Finite-Time Convergence Rates of Decentralized Stochastic Approximation With Applications in Multi-Agent and Multi-Task Learning.

[BibT_eX]

[DOI]

IEEE Trans. Autom. Control., May, 2023

Near-Optimal Fair Resource Allocation for Strategic Agents without Money: A Data-Driven Approach.

[BibT_eX]

[DOI]

CoRR, 2023

InkGAN: Generative Adversarial Networks for Ink-And-Wash Style Transfer of Photographs.

[BibT_eX]

[DOI]

Adv. Artif. Intell. Mach. Learn., 2023

Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sequential Fair Resource Allocation under a Markov Decision Process Framework.

[BibT_eX]

[DOI]

Proceedings of the 4th ACM International Conference on AI in Finance, 2023

2022

An unsupervised font style transfer model based on generative adversarial networks.

[BibT_eX]

[DOI]

Zhongliang Pan

Multim. Tools Appl., 2022

Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the 61st IEEE Conference on Decision and Control, 2022

2021

Predictability and Prediction of Human Mobility Based on Application-Collected Location Data.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2021

A Reinforcement Learning Approach to Parameter Selection for Distributed Optimization in Power Systems.

[BibT_eX]

[DOI]

CoRR, 2021

A decentralized policy gradient approach to multi-task reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Finite-Time Analysis of Decentralized Stochastic Approximation with Applications in Multi-Agent and Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2020

Human Mobility Prediction Using Sparse Trajectory Data.

[BibT_eX]

[DOI]

IEEE Trans. Veh. Technol., 2020

H-DrunkWalk: Collaborative and Adaptive Navigation for Heterogeneous MAV Swarm.

[BibT_eX]

[DOI]

Xinlei Chen

Carlos Ruiz Dominguez

ACM Trans. Sens. Networks, 2020

2019

An approach for medical event detection in Chinese clinical notes of electronic health records.

[BibT_eX]

[DOI]

BMC Medical Informatics Decis. Mak., 2019

Modeling Spatio-Temporal App Usage for a Large User Population.

[BibT_eX]

[DOI]

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2019

Fast Compressive Sensing Recovery Using Generative Models with Structured Latent Variables.

[BibT_eX]

[DOI]

Shaojie Xu

Proceedings of the IEEE International Conference on Acoustics, 2019

Toward A High-Performance Emulation Platformfor Brain-Inspired Intelligent SystemsExploring Dataflow-Based Execution Model and Beyond.

[BibT_eX]

[DOI]