2025

Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics.

[DOI]

Runzhe Wu

Ayush Sekhari

Akshay Krishnamurthy

Wen Sun

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Diffusing States and Matching Scores: A New Framework for Imitation Learning.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Making RL with Preference-based Feedback Efficient via Randomization.

[DOI]

Runzhe Wu

Wen Sun

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.

[DOI]

J. Mach. Learn. Res., 2023

Contextual Bandits and Imitation Learning via Preference-Based Active Queries.

[DOI]

CoRR, 2023

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Selective Sampling and Imitation Learning via Online Regression.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Contextual Bandits and Imitation Learning with Preference-Based Active Queries.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Distributional Offline Policy Evaluation with Predictive Error Guarantees.

[DOI]

Runzhe Wu

Masatoshi Uehara

Wen Sun

Proceedings of the International Conference on Machine Learning, 2023

2021

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.

[DOI]

CoRR, 2021

Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021