Bochuan Cao

According to our database1, Bochuan Cao authored at least 13 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models.
CoRR, 2024

Watch the Watcher! Backdoor Attacks on Security-Enhancing Diffusion Models.
CoRR, 2024

On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept.
CoRR, 2024

Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization.
CoRR, 2024

XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution.
CoRR, 2024

WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response.
CoRR, 2024

On the Difficulty of Defending Contrastive Learning against Backdoor Attacks.
Proceedings of the 33rd USENIX Security Symposium, 2024

Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Jailbreak Open-Sourced Large Language Models via Enforced Decoding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
On the Safety of Open-Sourced Large Language Models: Does Alignment Really Prevent Them From Being Misused?
CoRR, 2023

IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...