Niao He

Orcid: 0000-0003-4225-7536

According to our database1, Niao He authored at least 104 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Momentum-Based Policy Gradient with Second-Order Information.
Trans. Mach. Learn. Res., 2024

Finite-Time Analysis of Entropy-Regularized Neural Natural Actor-Critic Algorithm.
Trans. Mach. Learn. Res., 2024

Finite-Time Analysis of Natural Actor-Critic for POMDPs.
SIAM J. Math. Data Sci., 2024

Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation.
SIAM J. Optim., 2024

On the Crucial Role of Initialization for Matrix Factorization.
CoRR, 2024

Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems.
CoRR, 2024

From Gradient Clipping to Normalization for Heavy Tailed SGD.
CoRR, 2024

Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning.
CoRR, 2024

Multi-level Monte-Carlo Gradient Methods for Stochastic Optimization with Biased Oracles.
CoRR, 2024

Independent Policy Mirror Descent for Markov Potential Games: Scaling to Large Number of Players.
CoRR, 2024

Complexity of Minimizing Projected-Gradient-Dominated Functions with Stochastic First-order Oracles.
CoRR, 2024

Learning to Steer Markovian Agents under Model Uncertainty.
CoRR, 2024

Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes.
CoRR, 2024

A Hessian-Aware Stochastic Differential Equation for Modelling SGD.
CoRR, 2024

Primal Methods for Variational Inequality Problems with Functional Constraints.
CoRR, 2024

Stochastic Optimization under Hidden Convexity.
CoRR, 2024

DPZero: Private Fine-Tuning of Language Models without Backpropagation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Truly No-Regret Learning in Constrained MDPs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

When is Mean-Field Reinforcement Learning Tractable and Relevant?
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Provably Learning Nash Policies in Constrained Markov Potential Games.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Generalization Bounds of Nonconvex-(Strongly)-Concave Stochastic Minimax Optimization.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Independent Learning in Constrained Markov Potential Games.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Parameter-Agnostic Optimization under Relaxed Smoothness.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Taming Nonconvex Stochastic Mirror Descent with General Bregman Divergence.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Automated Design of Affine Maximizer Mechanisms in Dynamic Settings.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Discrete-Time Switching System Analysis of Q-Learning.
SIAM J. Control. Optim., June, 2023

Sample Complexity and Overparameterization Bounds for Temporal-Difference Learning With Neural Network Approximation.
IEEE Trans. Autom. Control., May, 2023

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods.
Trans. Mach. Learn. Res., 2023

Learning Best Response Policies in Dynamic Auctions via Deep Reinforcement Learning.
CoRR, 2023

Efficiently Escaping Saddle Points for Non-Convex Policy Optimization.
CoRR, 2023

DPZero: Dimension-Independent and Differentially Private Zeroth-Order Optimization.
CoRR, 2023

A Convex Framework for Confounding Robust Inference.
CoRR, 2023

Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes.
CoRR, 2023

Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive Methods.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On Imitation in Mean-field Games.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Knowledge Transfer in Tiered Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games.
Proceedings of the International Conference on Machine Learning, 2023

Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies.
Proceedings of the International Conference on Machine Learning, 2023

Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.
Proceedings of the International Conference on Machine Learning, 2023

TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity.
Proceedings of the 62nd IEEE Conference on Decision and Control, 2023

Kernel Conditional Moment Constraints for Confounding Robust Inference.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Learning to Optimize with Stochastic Dominance Constraints.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Learning to Optimize with Stochastic Dominance Constraints.
CoRR, 2022

Uniform Convergence and Generalization for Nonconvex Stochastic Minimax Problems.
CoRR, 2022

Stochastic Second-Order Methods Provably Beat SGD For Gradient-Dominated Functions.
CoRR, 2022

Adaptive Momentum-Based Policy Gradient with Second-Order Information.
CoRR, 2022

Learning to Control Partially Observed Systems with Finite Memory.
CoRR, 2022

Bring Your Own Algorithm for Optimal Differentially Private Stochastic Minimax Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Nest Your Adaptive Algorithm for Parameter-Agnostic Nonconvex Minimax Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Functions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sharp Analysis of Stochastic Optimization under Global Kurdyka-Lojasiewicz Inequality.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Natural Actor-Critic Framework for Zero-Sum Markov Games.
Proceedings of the International Conference on Machine Learning, 2022

Faster Single-loop Algorithms for Minimax Optimization without Strong Concavity.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Lifted Primal-Dual Method for Bilinearly Coupled Smooth Minimax Optimization.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
Linear Convergence of Entropy-Regularized Natural Policy Gradient with Linear Function Approximation.
CoRR, 2021

Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction.
CoRR, 2021

Sample Complexity and Overparameterization Bounds for Projection-Free Neural TD Learning.
CoRR, 2021

The complexity of nonconvex-strongly-concave minimax optimization.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

On the Bias-Variance-Cost Tradeoff of Stochastic Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Optimization for Reinforcement Learning: From a single agent to cooperative agents.
IEEE Signal Process. Mag., 2020

Sample Complexity of Sample Average Approximation for Conditional Stochastic Optimization.
SIAM J. Optim., 2020

Quadratic Decomposable Submodular Function Minimization: Theory and Practice.
J. Mach. Learn. Res., 2020

Provably-Efficient Double Q-Learning.
CoRR, 2020

Biased Stochastic Gradient Descent for Conditional Stochastic Optimization.
CoRR, 2020

Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems.
CoRR, 2020

A Catalyst Framework for Minimax Optimization.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Devil is in the Detail: A Framework for Macroscopic Prediction via Microscopic Models.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Global Convergence and Variance Reduction for a Class of Nonconvex-Nonconcave Minimax Problems.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Mean-Squared Error of Double Q-Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Biased Stochastic First-Order Methods for Conditional Stochastic Optimization and Applications in Meta Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Periodic Q-Learning.
Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, 2020

2019
A Unified Switching System Perspective and O.D.E. Analysis of Q-Learning Algorithms.
CoRR, 2019

Optimization and Learning Algorithms for Stochastic and Adversarial Power Control.
Proceedings of the International Symposium on Modeling and Optimization in Mobile, 2019

Learning Positive Functions with Pseudo Mirror Descent.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exponential Family Estimation via Adversarial Dynamics Embedding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Target-Based Temporal-Difference Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Dynamic Programming for POMDP with Jointly Discrete and Continuous State-Spaces.
Proceedings of the 2019 American Control Conference, 2019

Stochastic Primal-Dual Q-Learning Algorithm For Discounted MDPs.
Proceedings of the 2019 American Control Conference, 2019

Kernel Exponential Family Estimation via Doubly Dual Embedding.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Predictive Approximate Bayesian Computation via Saddle Points.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Quadratic Decomposable Submodular Function Minimization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Coupled Variational Bayes via Optimization Embedding.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation.
Proceedings of the 35th International Conference on Machine Learning, 2018

Boosting the Actor with Dual Critic.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Smoothed Dual Embedding Control.
CoRR, 2017

Online Learning for Multivariate Hawkes Processes.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Stochastic Generative Hashing.
Proceedings of the 34th International Conference on Machine Learning, 2017

Learning from Conditional Distributions via Dual Embeddings.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Fast and Simple Optimization for Poisson Likelihood Models.
CoRR, 2016

Learning from Conditional Distributions via Dual Kernel Embeddings.
CoRR, 2016

Provable Bayesian Inference via Particle Mirror Descent.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
Scalable Bayesian Inference via Particle Mirror Descent.
CoRR, 2015

Mirror Prox algorithm for multi-term composite minimization and semi-separable problems.
Comput. Optim. Appl., 2015

Semi-Proximal Mirror-Prox for Nonsmooth Composite Minimization.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Time-Sensitive Recommendation From Recurrent User Activities.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014
Scalable Kernel Methods via Doubly Stochastic Gradients.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

2013
Mirror Prox Algorithm for Multi-Term Composite Minimization and Alternating Directions.
CoRR, 2013

Stochastic Alternating Direction Method of Multipliers.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Stochastic ADMM for Nonsmooth Optimization
CoRR, 2012


  Loading...