CRDA: Content Risk Drift Assessment of Large Language Models through Adversarial Multi-Agent Interaction.

[DOI]

Zongzhen Liu

,

Guoyi Li

,

,

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2024

General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Adversarial Text Generation by Search and Learning.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023