We stand with Ukraine

We stand with Ukraine

Chuheng Zhang

According to our database¹, Chuheng Zhang authored at least 28 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2020

2021

2022

2023

2024

0

1

2

3

4

5

6

7

8

9

4

2

1

1

3

4

7

3

3

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Policy Filtration in RLHF to Fine-Tune LLM for Code Generation.

[BibT_eX]

[DOI]

,

CoRR, 2024

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Diversification of Adaptive Policy for Effective Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Whittle Index with Multiple Actions and State Constraint for Inventory Management.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Pre-Trained Large Language Models for Industrial Control.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Towards Generalizable Reinforcement Learning for Trade Execution.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Robust Situational Reinforcement Learning in Face of Context Disturbances.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Curriculum Offline Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

RePreM: Representation Pre-training with Masked Model for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2022

Cross DQN: Cross Deep Q Network for Ads Allocation in Feed.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Data Mining, 2022

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021

Return-Based Contrastive Representation Learning for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Inductive Matrix Completion Using Graph Autoencoder.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Exploration by Maximizing Rényi Entropy for Zero-Shot Meta RL.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Policy Search by Target Distribution Learning for Continuous Control.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Loading...