×
2024
Making Harmful Behaviors Unlearnable for Large Language Models.
[DOI]
Xin Zhou
,
Yi Lu
,
Ruotian Ma
,
Yujian Wei
,
Tao Gui
,
Qi Zhang
,
Xuanjing Huang
Proceedings of the Findings of the Association for Computational Linguistics, 2024