2024
Defending Text-to-image Diffusion Models: Surprising Efficacy of Textual Perturbations Against Backdoor Attacks.
CoRR, 2024

Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
Understanding and Mitigating Spurious Correlations in Text Classification.
CoRR, 2023