Dongruo Zhou

PhD thesis, 2023

Provably efficient representation selection in Low-rank Markov Decision Processes: from online to offline RL.

[BibT_eX]

[DOI]

Proceedings of the Uncertainty in Artificial Intelligence, 2023

Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency.

[BibT_eX]

[DOI]

Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

2022

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium.

[BibT_eX]

[DOI]

CoRR, 2022

Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Contextual Bandits Through Perturbed Rewards.

[BibT_eX]

[DOI]

CoRR, 2022

Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Two-Player Markov Games: Neural Function Approximation and Correlated Equilibrium.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Dimension-free Complexity Bounds for High-order Nonconvex Finite-sum Optimization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Learning Neural Contextual Bandits through Perturbed Rewards.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games.

[BibT_eX]

[DOI]

Zixiang Chen

Proceedings of the International Conference on Algorithmic Learning Theory, 29 March, 2022

Faster Perturbed Stochastic Gradient Methods for Finding Local Minima.

[BibT_eX]

[DOI]

Zixiang Chen

Proceedings of the International Conference on Algorithmic Learning Theory, 29 March, 2022

Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation.

[BibT_eX]

[DOI]

Yue Wu

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Linear Contextual Bandits with Adversarial Corruptions.

[BibT_eX]

[DOI]

Heyang Zhao

CoRR, 2021

Provably Efficient Representation Learning in Low-rank Markov Decision Processes.

[BibT_eX]

[DOI]

CoRR, 2021

Batched Neural Bandits.

[BibT_eX]

[DOI]

CoRR, 2021

Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation.

[BibT_eX]

[DOI]

CoRR, 2021

Almost Optimal Algorithms for Two-player Markov Games with Linear Function Approximation.

[BibT_eX]

[DOI]

Zixiang Chen

CoRR, 2021

Pure Exploration in Kernel and Neural Bandits.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation.

[BibT_eX]

[DOI]

Weitong Zhang

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Iterative Teacher-Aware Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Provably Efficient Reinforcement Learning with Linear Function Approximation under Adaptivity Constraints.

[BibT_eX]

[DOI]

Tianhao Wang

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Variance-Aware Off-Policy Evaluation with Linear Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Logarithmic Regret for Reinforcement Learning with Linear Function Approximation.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Neural Thompson Sampling.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes.

[BibT_eX]

[DOI]

Csaba Szepesvári

Proceedings of the Conference on Learning Theory, 2021

2020

Gradient descent optimizes over-parameterized deep ReLU networks.

[BibT_eX]

[DOI]

Mach. Learn., 2020

Stochastic Nested Variance Reduction for Nonconvex Optimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2020

Provable Multi-Objective Reinforcement Learning with Generative Models.

[BibT_eX]

[DOI]

Jiahao Chen

CoRR, 2020

Minimax Optimal Reinforcement Learning for Discounted MDPs.

[BibT_eX]

[DOI]

CoRR, 2020

Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Neural Contextual Bandits with UCB-based Exploration.

[BibT_eX]

[DOI]

Lihong Li

Proceedings of the 37th International Conference on Machine Learning, 2020

Stochastic Recursive Variance-Reduced Cubic Regularization Methods.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Accelerated Factored Gradient Descent for Low-Rank Matrix Factorization.

[BibT_eX]

[DOI]

Yuan Cao

Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

A Frank-Wolfe Framework for Efficient and Effective Adversarial Attacks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Stochastic Variance-Reduced Cubic Regularization Methods.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2019

Neural Contextual Bandits with Upper Confidence Bound-Based Exploration.

[BibT_eX]

[DOI]

Lihong Li

CoRR, 2019

Lower Bounds for Smooth Nonconvex Finite-Sum Optimization.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Sample Efficient Stochastic Variance-Reduced Cubic Regularization Method.

[BibT_eX]

[DOI]

CoRR, 2018

Stochastic Gradient Descent Optimizes Over-parameterized Deep ReLU Networks.

[BibT_eX]

[DOI]

CoRR, 2018

On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization.

[BibT_eX]

[DOI]

CoRR, 2018

Finding Local Minima via Stochastic Nested Variance Reduction.

[BibT_eX]

[DOI]

CoRR, 2018

Stochastic Nested Variance Reduced Gradient Descent for Nonconvex Optimization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Stochastic Variance-Reduced Cubic Regularized Newton Method.

[BibT_eX]

[DOI]