Simon S. Du

Affiliations:
  • University of Washington, USA
  • Carnegie Mellon University, Machine Learning Department (former)


According to our database1, Simon S. Du authored at least 156 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Transformers are Efficient Compilers, Provably.
CoRR, 2024

The Crucial Role of Samplers in Online Direct Preference Optimization.
CoRR, 2024

Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques.
CoRR, 2024

Understanding the Gains from Repeated Self-Distillation.
CoRR, 2024

Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning.
CoRR, 2024

Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models.
CoRR, 2024

Decoding-Time Language Model Alignment with Multiple Objectives.
CoRR, 2024

CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.
CoRR, 2024

Transferable Reinforcement Learning via Generalized Occupancy Models.
CoRR, 2024

Offline Multi-task Transfer RL with Representational Penalization.
CoRR, 2024

Learning Optimal Tax Design in Nonatomic Congestion Games.
CoRR, 2024

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation.
CoRR, 2024

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning.
CoRR, 2024

Rethinking Transformers in Solving POMDPs.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Horizon-Free Regret for Linear Markov Decision Processes.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Optimal Multi-Distribution Learning.
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

Settling the sample complexity of online reinforcement learning.
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

Refined Sample Complexity for Markov Games with Independent Linear Function Approximation (Extended Abstract).
Proceedings of the Thirty Seventh Annual Conference on Learning Theory, June 30, 2024

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Integrating the traffic science with representation learning for city-wide network congestion prediction.
Inf. Fusion, November, 2023

Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization.
Trans. Mach. Learn. Res., 2023

Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning.
CoRR, 2023

Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Reduction-based Framework for Sequential Decision Making with Delayed Feedback.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Active representation learning for general task space with applications in robotics.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments.
Proceedings of the International Conference on Machine Learning, 2023

Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2023

On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness.
Proceedings of the International Conference on Machine Learning, 2023

Improved Active Multi-Task Representation Learning via Lasso.
Proceedings of the International Conference on Machine Learning, 2023

Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing.
Proceedings of the International Conference on Machine Learning, 2023

Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Variance-Aware Sparse Linear Bandits.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Blessing of Class Diversity in Pre-training.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Understanding the acceleration phenomenon via high-resolution differential equations.
Math. Program., 2022

Horizon-Free Reinforcement Learning for Latent Markov Decision Processes.
CoRR, 2022

Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization.
CoRR, 2022

Provable General Function Class Representation Learning in Multitask Bandits and MDPs.
CoRR, 2022

Nearly Minimax Algorithms for Linear Bandits with Shared Representation.
CoRR, 2022

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems.
CoRR, 2022

TransFollower: Long-Sequence Car-Following Trajectory Prediction through Transformer.
CoRR, 2022

When is Offline Two-Player Zero-Sum Markov Game Solvable?
CoRR, 2022

Near-Optimal Randomized Exploration for Tabular Markov Decision Processes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Gap-dependent Bounds for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Provable General Function Class Representation Learning in Multitask Bandits and MDP.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning in Congestion Games with Bandit Feedback.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

When are Offline Two-Player Zero-Sum Markov Games Solvable?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Nearly Optimal Policy Optimization with Stable at Any Time Guarantee.
Proceedings of the International Conference on Machine Learning, 2022

Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes.
Proceedings of the International Conference on Machine Learning, 2022

First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach.
Proceedings of the International Conference on Machine Learning, 2022

Active Multi-Task Representation Learning.
Proceedings of the International Conference on Machine Learning, 2022

Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path.
Proceedings of the International Conference on Machine Learning, 2022

Denoised MDPs: Learning World Models Better Than the World Itself.
Proceedings of the International Conference on Machine Learning, 2022

A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Provable Adaptation across Multiway Domains via Representation Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies.
Proceedings of the Conference on Learning Theory, 2-5 July 2022, London, UK., 2022

Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

Gap-Dependent Bounds for Two-Player Markov Games.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

AdaLoss: A Computationally-Efficient and Provably Convergent Adaptive Gradient Method.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Near-Linear Time Local Polynomial Nonparametric Estimation with Box Kernels.
INFORMS J. Comput., 2021

A Benchmark for Low-Switching-Cost Reinforcement Learning.
CoRR, 2021

Towards Demystifying Representation Learning with Non-contrastive Self-supervision.
CoRR, 2021

A Unified Framework for Conservative Exploration.
CoRR, 2021

On the Power of Multitask Representation Learning in Linear MDP.
CoRR, 2021

Randomized Exploration is Near-Optimal for Tabular MDP.
CoRR, 2021

Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games.
CoRR, 2021

Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP.
CoRR, 2021

A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost.
CoRR, 2021

When is particle filtering efficient for planning in partially observed linear dynamical systems?
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Nearly Horizon-Free Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Corruption Robust Active Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Near Optimal Reward-Free Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP.
Proceedings of the 38th International Conference on Machine Learning, 2021

Bilinear Classes: A Structural Framework for Provable Generalization in RL.
Proceedings of the 38th International Conference on Machine Learning, 2021

Improved Corruption Robust Algorithms for Episodic Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Impact of Representation Learning in Linear Bandits.
Proceedings of the 9th International Conference on Learning Representations, 2021

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Few-Shot Learning via Learning the Representation, Provably.
Proceedings of the 9th International Conference on Learning Representations, 2021

Optimism in Reinforcement Learning with Generalized Linear Function Approximation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon.
Proceedings of the Conference on Learning Theory, 2021

Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap.
Proceedings of the Conference on Learning Theory, 2021

Q-learning with Logarithmic Regret.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics.
J. Mach. Learn. Res., 2020

Provable Benefits of Representation Learning in Linear Bandits.
CoRR, 2020

Nearly Minimax Optimal Reward-free Reinforcement Learning.
CoRR, 2020

When is Particle Filtering Efficient for POMDP Sequential Planning?
CoRR, 2020

Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?
CoRR, 2020

Provably Efficient Exploration for RL with Unsupervised Learning.
CoRR, 2020

Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity.
CoRR, 2020

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Planning with General Objective Functions: Going Beyond Total Rewards.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On Reward-Free Reinforcement Learning with Linear Function Approximation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Is Long Horizon RL More Difficult Than Short Horizon RL?
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Agnostic $Q$-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Provable Representation Learning for Imitation Learning via Bi-level Optimization.
Proceedings of the 37th International Conference on Machine Learning, 2020

What Can Neural Networks Reason About?
Proceedings of the 8th International Conference on Learning Representations, 2020

Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
Proceedings of the 8th International Conference on Learning Representations, 2020

Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Gradient Descent for Non-convex Problems in Modern Machine Learning.
PhD thesis, 2019

Enhanced Convolutional Neural Tangent Kernels.
CoRR, 2019

Continuous Control with Contexts, Provably.
CoRR, 2019

Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs.
CoRR, 2019

Hitting Time of Stochastic Gradient Langevin Dynamics to Stationary Points: A Direct Analysis.
CoRR, 2019

Global Convergence of Adaptive Gradient Methods for An Over-parameterized Neural Network.
CoRR, 2019

Acceleration via Symplectic Discretization of High-Resolution Differential Equations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Understanding the Importance of Shortcut Connections in Residual Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On Exact Computation with an Infinitely Wide Neural Net.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Gradient Descent Finds Global Minima of Deep Neural Networks.
Proceedings of the 36th International Conference on Machine Learning, 2019

Provably efficient RL with Rich Observations via Latent State Decoding.
Proceedings of the 36th International Conference on Machine Learning, 2019

Width Provably Matters in Optimization for Deep Linear Neural Networks.
Proceedings of the 36th International Conference on Machine Learning, 2019

Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks.
Proceedings of the 36th International Conference on Machine Learning, 2019

Gradient Descent Provably Optimizes Over-parameterized Neural Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Robust Nonparametric Regression under Huber's ε-contamination Model.
CoRR, 2018

How Many Samples are Needed to Learn a Convolutional Neural Network?
CoRR, 2018

Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps.
CoRR, 2018

Near-Linear Time Local Polynomial Nonparametric Estimation.
CoRR, 2018

How Many Samples are Needed to Estimate a Convolutional Neural Network?
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow.
Proceedings of the 35th International Conference on Machine Learning, 2018

Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms.
Proceedings of the 35th International Conference on Machine Learning, 2018

Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima.
Proceedings of the 35th International Conference on Machine Learning, 2018

On the Power of Over-parametrization in Neural Networks with Quadratic Activation.
Proceedings of the 35th International Conference on Machine Learning, 2018

When is a Convolutional Filter Easy to Learn?
Proceedings of the 6th International Conference on Learning Representations, 2018

Stochastic Zeroth-order Optimization in High Dimensions.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Computationally Efficient Robust Estimation of Sparse Functionals.
CoRR, 2017

On the Power of Truncated SVD for General High-rank Matrix Estimation Problems.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Hypothesis Transfer Learning via Transformation Functions.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Gradient Descent Can Take Exponential Time to Escape Saddle Points.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Stochastic Variance Reduction Methods for Policy Evaluation.
Proceedings of the 34th International Conference on Machine Learning, 2017

High-Throughput Robotic Phenotyping of Energy Sorghum Crops.
Proceedings of the Field and Service Robotics, 2017

Computationally Efficient Robust Sparse Estimation in High Dimensions.
Proceedings of the 30th Conference on Learning Theory, 2017

2016
Transformation Function Based Methods for Model Shift.
CoRR, 2016

Efficient Nonparametric Smoothness Estimation.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

An Improved Gap-Dependency Analysis of the Noisy Power Method.
Proceedings of the 29th Conference on Learning Theory, 2016


  Loading...