2025

LLM Misalignment via Adversarial RLHF Platforms.

[DOI]

Erfan Entezami

Ali Naseh

CoRR, March, 2025

OverThink: Slowdown Attacks on Reasoning LLMs.

[DOI]

CoRR, February, 2025

Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation.

[DOI]

CoRR, February, 2025

Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection.

[DOI]

Ali Naseh

Niloofar Mireshghallah

CoRR, January, 2025

Diffence: Fencing Membership Privacy With Diffusion Models.

[DOI]

Yuefeng Peng

Ali Naseh

Amir Houmansadr

Proceedings of the 32nd Annual Network and Distributed System Security Symposium, 2025

2024

Injecting Bias in Text-To-Image Models via Composite-Trigger Backdoors.

[DOI]

CoRR, 2024

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images.

[DOI]

CoRR, 2024

2023

Understanding (Un)Intended Memorization in Text-to-Image Generative Models.

[DOI]

Ali Naseh

Jaechul Roh

Amir Houmansadr

CoRR, 2023

Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication.

[DOI]

Ali Naseh

Jaechul Roh

Amir Houmansadr

CoRR, 2023

On the Risks of Stealing the Decoding Algorithms of Language Models.

[DOI]

CoRR, 2023

Stealing the Decoding Algorithms of Language Models.

[DOI]

Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security, 2023