Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment.
CoRR, 2024
BackdoorAlign: Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
ChatGPT as an Attack Tool: Stealthy Textual Backdoor Attack via Blackbox Generative Model Trigger.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Defending against Insertion-based Textual Backdoor Attacks via Attribution.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Author Correction: Performance evaluation of a prescription medication image classification model: an observational cohort.
npj Digit. Medicine, 2022
Performance evaluation of a prescription medication image classification model: an observational cohort.
npj Digit. Medicine, 2021
Re-ranking Biomedical Literature for Precision Medicine with Pre-trained Neural Models.
Proceedings of the 8th IEEE International Conference on Healthcare Informatics, 2020
PharmMT: A Neural Machine Translation Approach to Simplify Prescription Directions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Exploring the Prediction of Variety-seeking Behavior.
Proceedings of the 2nd International Conference on Data Science and Information Technology, 2019