Li Zhao

Orcid: 0000-0001-5095-3377

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Tsinghua University, Beijing, China (PhD 2016)


According to our database1, Li Zhao authored at least 55 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Video In-context Learning.
CoRR, 2024

DPO Meets PPO: Reinforced Token Optimization for RLHF.
CoRR, 2024

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting.
CoRR, 2024

AgentIR: 1st Workshop on Agent-based Information Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Diversification of Adaptive Policy for Effective Offline Reinforcement Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Pre-Trained Large Language Models for Industrial Control.
CoRR, 2023

A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management.
CoRR, 2023

Asking Before Action: Gather Information in Embodied Decision Making with Language Models.
CoRR, 2023

H-TSP: Hierarchically Solving the Large-Scale Travelling Salesman Problem.
CoRR, 2023

Distributional Pareto-Optimal Multi-Objective Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Multi-Agent Intention-Aware Communication for Optimal Multi-Order Execution in Finance.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Towards Generalizable Reinforcement Learning for Trade Execution.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Robust Situational Reinforcement Learning in Face of Context Disturbances.
Proceedings of the International Conference on Machine Learning, 2023

DRL4IR: 4th Workshop on Deep Reinforcement Learning for Information Retrieval.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Curriculum Offline Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

H-TSP: Hierarchically Solving the Large-Scale Traveling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Pointerformer: Deep Reinforced Multi-Pointer Transformer for the Traveling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management.
CoRR, 2022

DRL4IR: 3rd Workshop on Deep Reinforcement Learning for Information Retrieval.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality.
Proceedings of the Tenth International Conference on Learning Representations, 2022

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
Proceedings of the IEEE International Conference on Data Mining, 2022

Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation.
Proceedings of the IEEE Conference on Games, CoG 2022, Beijing, 2022

2021
Demonstration actor critic.
Neurocomputing, 2021

Curriculum Offline Imitation Learning.
CoRR, 2021

DRL4IR: 2nd Workshop on Deep Reinforcement Learning for Information Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Distributional Reinforcement Learning for Multi-Dimensional Reward Functions.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Curriculum Offline Imitating Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Independence-aware Advantage Estimation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Return-Based Contrastive Representation Learning for Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
Suphx: Mastering Mahjong with Deep Reinforcement Learning.
CoRR, 2020

Leveraging Demonstrations for Reinforcement Recommendation Reasoning over Knowledge Graphs.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Deep Reinforcement Learning for Information Retrieval: Fundamentals and Advances.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

RD$^2$: Reward Decomposition with Representation Decomposition.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Semi-Supervised Neural Machine Translation via Marginal Distribution Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Fully Parameterized Quantile Function for Distributional Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Distributional Reward Decomposition for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Individualized Indicator for All: Stock-wise Technical Indicator Optimization with Stock Embedding.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Investment Behaviors Can Tell What Inside: Exploring Stock Intrinsic Properties for Stock Trend Prediction.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Unified Policy Optimization for Robust Reinforcement Learning.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

Trust Region Evolution Strategies.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Efficient Sequence Learning with Group Recurrent Networks.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Adversarial Neural Machine Translation.
Proceedings of The 10th Asian Conference on Machine Learning, 2018

Learning Structured Representation for Text Classification via Reinforcement Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Word Attention for Sequence to Sequence Text Understanding.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Dual Transfer Learning for Neural Machine Translation with Marginal Distribution Regularization.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Reinforcement Learning for Relation Classification From Noisy Data.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Sequence Prediction with Unlabeled Data by Reward Function Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Attention-based LSTM for Aspect-level Sentiment Classification.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Semi-Supervised Multinomial Naive Bayes for Text Classification by Leveraging Word-Level Statistical Constraint.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Sentiment Extraction by Leveraging Aspect-Opinion Association Structure.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Clustering Aspect-related Phrases by Leveraging Sentiment Distribution Consistency.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014


  Loading...