Tinghao Xie

According to our database¹, Tinghao Xie authored at least 11 papers between 2022 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors.

[BibT_eX]

[DOI]

Udari Madhushani Sehwag

CoRR, 2024

Fantastic Copyrighted Beasts and How (Not) to Generate Them.

[BibT_eX]

[DOI]

CoRR, 2024

AI Risk Management Should Incorporate Both Safety and Security.

[BibT_eX]

[DOI]

CoRR, 2024

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Towards A Proactive ML Approach for Detecting Backdoor Poison Samples.

[BibT_eX]

[DOI]

Proceedings of the 32nd USENIX Security Symposium, 2023

Revisiting the Assumption of Latent Separability for Backdoor Defenses.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Fight Poison with Poison: Detecting Backdoor Poison Samples via Decoupling Benign Correlations.

[BibT_eX]

[DOI]

CoRR, 2022

Circumventing Backdoor Defenses That Are Based on Latent Separability.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Tinghao Xie

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...