Simon S. Du
Affiliations:- University of Washington, USA
- Carnegie Mellon University, Machine Learning Department (former)
According to our database1,
Simon S. Du
authored at least 156 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on cs.cmu.edu
On csauthors.net:
Bibliography
2024
Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques.
CoRR, 2024
CoRR, 2024
Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models.
CoRR, 2024
CoRR, 2024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation.
CoRR, 2024
Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning.
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation (Extended Abstract).
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Integrating the traffic science with representation learning for city-wide network congestion prediction.
Inf. Fusion, November, 2023
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization.
Trans. Mach. Learn. Res., 2023
Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning.
Trans. Mach. Learn. Res., 2023
CoRR, 2023
Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments.
Proceedings of the International Conference on Machine Learning, 2023
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2023
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the International Conference on Machine Learning, 2023
Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023
2022
Understanding the acceleration phenomenon via high-resolution differential equations.
Math. Program., 2022
Provable General Function Class Representation Learning in Multitask Bandits and MDPs.
CoRR, 2022
Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems.
CoRR, 2022
TransFollower: Long-Sequence Car-Following Trajectory Prediction through Transformer.
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Provable General Function Class Representation Learning in Multitask Bandits and MDP.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the International Conference on Machine Learning, 2022
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2022
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
AdaLoss: A Computationally-Efficient and Provably Convergent Adaptive Gradient Method.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
INFORMS J. Comput., 2021
CoRR, 2021
CoRR, 2021
Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.
CoRR, 2021
A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost.
CoRR, 2021
When is particle filtering efficient for planning in partially observed linear dynamical systems?
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP.
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon.
Proceedings of the Conference on Learning Theory, 2021
Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap.
Proceedings of the Conference on Learning Theory, 2021
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
2020
On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics.
J. Mach. Learn. Res., 2020
Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?
CoRR, 2020
Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity.
CoRR, 2020
Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Agnostic $Q$-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the 37th International Conference on Machine Learning, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
2019
PhD thesis, 2019
CoRR, 2019
Hitting Time of Stochastic Gradient Langevin Dynamics to Stationary Points: A Direct Analysis.
CoRR, 2019
Global Convergence of Adaptive Gradient Methods for An Over-parameterized Neural Network.
CoRR, 2019
Acceleration via Symplectic Discretization of High-Resolution Differential Equations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks.
Proceedings of the 36th International Conference on Machine Learning, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019
2018
CoRR, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow.
Proceedings of the 35th International Conference on Machine Learning, 2018
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms.
Proceedings of the 35th International Conference on Machine Learning, 2018
Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima.
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 35th International Conference on Machine Learning, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018
2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the Field and Service Robotics, 2017
Proceedings of the 30th Conference on Learning Theory, 2017
2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the 29th Conference on Learning Theory, 2016