2024
CRDA: Content Risk Drift Assessment of Large Language Models through Adversarial Multi-Agent Interaction.
Proceedings of the International Joint Conference on Neural Networks, 2024

General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token Level.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Adversarial Text Generation by Search and Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023