2025

R1dacted: Investigating Local Censorship in DeepSeek's R1 Language Model.

[DOI]

Ali Naseh

Harsh Chaudhari

CoRR, May, 2025

Multilingual and Multi-Accent Jailbreaking of Audio LLMs.

[DOI]

Jaechul Roh

Virat Shejwalkar

Amir Houmansadr

CoRR, April, 2025

OverThink: Slowdown Attacks on Reasoning LLMs.

[DOI]

CoRR, February, 2025

2024

FameBias: Embedding Manipulation Bias Attack in Text-to-Image Models.

[DOI]

Jaechul Roh

Andrew Yuan

Jinsong Mao

CoRR, 2024

Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors.

[DOI]

CoRR, 2024

OSLO: One-Shot Label-Only Membership Inference Attacks.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Understanding (Un)Intended Memorization in Text-to-Image Generative Models.

[DOI]

Ali Naseh

Jaechul Roh

Amir Houmansadr

CoRR, 2023

Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication.

[DOI]

Ali Naseh

Jaechul Roh

Amir Houmansadr

CoRR, 2023

2022

Robust Smart Home Face Recognition Under Starving Federated Data.

[DOI]

Jaechul Roh

Yajun Fang

Proceedings of the 6th International Conference on Universal Village, 2022

MSDT: Masked Language Model Scoring Defense in Text Domain.

[DOI]

Jaechul Roh

Minhao Cheng

Yajun Fang

Proceedings of the 6th International Conference on Universal Village, 2022

Impact of Adversarial Training on the Robustness of Deep Neural Networks.

[DOI]

Jaechul Roh

Proceedings of the 5th IEEE International Conference on Information Systems and Computer Aided Education, 2022