False Correlation Reduction for Offline Reinforcement Learning.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024
Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information.
CoRR, 2022
Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes.
CoRR, 2022
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation.
Proceedings of the International Conference on Machine Learning, 2022
SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning.
CoRR, 2021
Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation.
CoRR, 2021
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning.
CoRR, 2021
Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games.
Proceedings of the 38th International Conference on Machine Learning, 2021
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy.
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games.
Proceedings of the 8th International Conference on Learning Representations, 2020
Credible Sample Elicitation by Deep Learning, for Deep Learning.
CoRR, 2019