Chuheng Zhang

According to our database1, Chuheng Zhang authored at least 27 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation.
CoRR, 2024

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting.
CoRR, 2024

ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning.
CoRR, 2024

Diversification of Adaptive Policy for Effective Offline Reinforcement Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Whittle Index with Multiple Actions and State Constraint for Inventory Management.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Pre-Trained Large Language Models for Industrial Control.
CoRR, 2023

A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management.
CoRR, 2023

Towards Generalizable Reinforcement Learning for Trade Execution.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Robust Situational Reinforcement Learning in Face of Context Disturbances.
Proceedings of the International Conference on Machine Learning, 2023

Curriculum Offline Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management.
CoRR, 2022

Cross DQN: Cross Deep Q Network for Ads Allocation in Feed.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
Proceedings of the IEEE International Conference on Data Mining, 2022

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Return-Based Contrastive Representation Learning for Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Inductive Matrix Completion Using Graph Autoencoder.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Exploration by Maximizing Rényi Entropy for Zero-Shot Meta RL.
CoRR, 2020

DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Policy Search by Target Distribution Learning for Continuous Control.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...