Yao Liu

Affiliations:

Stanford University, CA, USA
Peking University, Beijing, China

According to our database¹, Yao Liu authored at least 23 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Reinforcement learning tutor better supported lower performers in a math task.

[BibT_eX]

[DOI]

Mach. Learn., May, 2024

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents.

[BibT_eX]

[DOI]

CoRR, 2024

EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data.

[BibT_eX]

[DOI]

CoRR, 2024

Learning the Target Network in Function Space.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task.

[BibT_eX]

[DOI]

CoRR, 2023

TD Convergence: An Optimization Perspective.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Budgeting Counterfactual for Offline RL.

[BibT_eX]

[DOI]

Yao Liu

Pratik Chaudhari

Rasool Fakoor

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

Offline policy optimization with eligible actions.

[BibT_eX]

[DOI]

Yao Liu

Yannis Flet-Berliac

Emma Brunskill

Proceedings of the Uncertainty in Artificial Intelligence, 2022

2021

Adaptive and efficient batch reinforcement learning algorithms.

[BibT_eX]

[DOI]

Yao Liu

PhD thesis, 2021

2020

Provably Good Batch Reinforcement Learning Without Great Exploration.

[BibT_eX]

[DOI]

CoRR, 2020

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling.

[BibT_eX]

[DOI]

Yao Liu

Pierre-Luc Bacon

Emma Brunskill

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

All-Action Policy Gradient Methods: A Numerical Integration Approach.

[BibT_eX]

[DOI]

Benjamin Petit

Loren Amdahl-Culleton

Yao Liu

Jimmy Smith

Pierre-Luc Bacon

CoRR, 2019

Off-Policy Policy Gradient with State Distribution Correction.

[BibT_eX]

[DOI]

CoRR, 2019

Off-Policy Policy Gradient with Stationary Distribution Correction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Combining parametric and nonparametric models for off-policy evaluation.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters.

[BibT_eX]

[DOI]

CoRR, 2018

When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms.

[BibT_eX]

[DOI]

Yao Liu

Emma Brunskill

CoRR, 2018

Representation Balancing MDPs for Off-policy Policy Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2016

PAC Continuous State Online Multitask Reinforcement Learning with Identification.

[BibT_eX]

[DOI]

Yao Liu

Zhaohan Guo

Emma Brunskill

Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

Yao Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...