Yao Liu

Affiliations:
  • Stanford University, CA, USA
  • Peking University, Beijing, China


According to our database1, Yao Liu authored at least 23 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Reinforcement learning tutor better supported lower performers in a math task.
Mach. Learn., May, 2024

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens.
CoRR, 2024

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents.
CoRR, 2024

EXTRACT: Efficient Policy Learning by Extracting Transferrable Robot Skills from Offline Data.
CoRR, 2024

Learning the Target Network in Function Space.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task.
CoRR, 2023

TD Convergence: An Optimization Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Budgeting Counterfactual for Offline RL.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Offline policy optimization with eligible actions.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

2021
Adaptive and efficient batch reinforcement learning algorithms.
PhD thesis, 2021

2020
Provably Good Batch Reinforcement Learning Without Great Exploration.
CoRR, 2020

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions.
Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
All-Action Policy Gradient Methods: A Numerical Integration Approach.
CoRR, 2019

Off-Policy Policy Gradient with State Distribution Correction.
CoRR, 2019

Off-Policy Policy Gradient with Stationary Distribution Correction.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Combining parametric and nonparametric models for off-policy evaluation.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters.
CoRR, 2018

When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms.
CoRR, 2018

Representation Balancing MDPs for Off-policy Policy Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2016
PAC Continuous State Online Multitask Reinforcement Learning with Identification.
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016


  Loading...