×
2022
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
[DOI]
Yuanying Cai
,
Chuheng Zhang
,
Li Zhao
,
Wei Shen
,
Xuyun Zhang
,
Lei Song
,
Jiang Bian
,
Tao Qin
,
Tieyan Liu
Proceedings of the IEEE International Conference on Data Mining, 2022