Zhenhong Zhou
Orcid: 0000-0003-4065-1740
According to our database1,
Zhenhong Zhou
authored at least 11 papers
between 2021 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Future Gener. Comput. Syst., 2024
Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions.
CoRR, 2024
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
CoRR, 2024
Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue.
CoRR, 2024
How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024
Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2021
Three-Dimensional Reconstruction of Huizhou Landscape Combined with Multimedia Technology and Geographic Information System.
Mob. Inf. Syst., 2021