2024

False Correlation Reduction for Offline Reinforcement Learning.

[DOI]

Zhihong Deng

Zuyue Fu

IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

2023

Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information.

[DOI]

CoRR, 2022

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes.

[DOI]

CoRR, 2022

Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation.

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2021

SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning.

[DOI]

CoRR, 2021

Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation.

[DOI]

CoRR, 2021

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning.

[DOI]

CoRR, 2021

Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games.

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy.

[DOI]

Zuyue Fu

Zhuoran Yang

Zhaoran Wang

Proceedings of the 9th International Conference on Learning Representations, 2021

Sample Elicitation.

[DOI]

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020

Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games.

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

2019

Credible Sample Elicitation by Deep Learning, for Deep Learning.

[DOI]

CoRR, 2019