Zhenhong Zhou

Orcid: 0000-0003-4065-1740

According to our database1, Zhenhong Zhou authored at least 11 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Enforcing group fairness in privacy-preserving Federated Learning.
Future Gener. Comput. Syst., 2024

On the Role of Attention Heads in Large Language Model Safety.
CoRR, 2024

Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions.
CoRR, 2024

Course-Correction: Safety Alignment Using Synthetic Preferences.
CoRR, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
CoRR, 2024

Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue.
CoRR, 2024

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Course-Correction: Safety Alignment Using Synthetic Preferences.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Quantifying and Analyzing Entity-Level Memorization in Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2021
Three-Dimensional Reconstruction of Huizhou Landscape Combined with Multimedia Technology and Geographic Information System.
Mob. Inf. Syst., 2021


  Loading...