Runzhe Wan

Orcid: 0009-0000-7820-4271

According to our database1, Runzhe Wan authored at least 18 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Robust Offline Reinforcement Learning with Heavy-Tailed Rewards.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Zero-Inflated Bandits.
CoRR, 2023

Robust Offline Policy Evaluation and Optimization with Heavy-Tailed Rewards.
CoRR, 2023

STEEL: Singularity-aware Reinforcement Learning.
CoRR, 2023

Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Multiplier Bootstrap-based Exploration.
Proceedings of the International Conference on Machine Learning, 2023

Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Heterogeneous Synthetic Learner for Panel Data.
CoRR, 2022

Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies.
CoRR, 2022

A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets.
CoRR, 2022

Safe Exploration for Efficient Policy Evaluation and Comparison.
Proceedings of the International Conference on Machine Learning, 2022

2021
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching.
CoRR, 2021

Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Deeply-Debiased Off-Policy Interval Estimation.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Multi-Objective Reinforcement Learning for Infectious Disease Control with Application to COVID-19 Spread.
CoRR, 2020

Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making.
Proceedings of the 37th International Conference on Machine Learning, 2020


  Loading...