Wanpeng Zhang

Orcid: 0000-0001-5351-3449

Affiliations:
  • Peking University, School of Computer Science, Beijing, China
  • Tsinghua University, Shenzhen, China (former)


According to our database1, Wanpeng Zhang authored at least 15 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
RLAdapter: Bridging Large Language Models to Reinforcement Learning in Open Worlds.
CoRR, 2023

Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

2022
Model-Based Opponent Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Efficient and Stable Information Directed Exploration for Continuous Reinforcement Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

iGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning.
CoRR, 2021

IGrow: A Smart Agriculture Solution to Autonomous Greenhouse Control.
CoRR, 2021

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation.
CoRR, 2021

Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

A Simulator-based Planning Framework for Optimizing Autonomous Greenhouse Control Strategy.
Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, 2021

Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control.
Proceedings of the Asian Conference on Machine Learning, 2021

2020
Self-Paced Probabilistic Principal Component Analysis For Data With Outliers.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Self-Paced Probabilistic Principal Component Analysis for Data with Outliers.
CoRR, 2019


  Loading...