Jiongxiao Wang
According to our database1,
Jiongxiao Wang
authored at least 17 papers
between 2022 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks.
CoRR, 2024
Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness.
CoRR, 2024
CoRR, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.
CoRR, 2023
On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models.
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification.
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
Proceedings of the International Conference on Machine Learning, 2022