Zhexin Zhang
Orcid: 0000-0003-1767-8865
According to our database1,
Zhexin Zhang
authored at least 26 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
A Design of Interface for Visual-Impaired People to Access Visual Information from Images Featuring Large Language Models and Visual Language Models.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2024
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Multim. Syst., 2023
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.
CoRR, 2023
SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions.
CoRR, 2023
CoRR, 2023
InstructSafety: A Unified Framework for Building Multidimensional and Explainable Safety Detector through Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Constructing Moral Discussions.
CoRR, 2022
Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the Universal Access in Human-Computer Interaction. Novel Design Approaches and Technologies, 2022
Visualizing the Electroencephalography Signal Discrepancy When Maintaining Social Distancing: EEG-Based Interactive Moiré Patterns.
Proceedings of the Design, User Experience, and Usability: Design for Emotion, Well-being and Health, Learning, and Culture, 2022
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
A Customized VR Rendering with Neural-Network Generated Frames for Reducing VR Dizziness.
Proceedings of the HCI International 2021 - Posters - 23rd HCI International Conference, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2019
Neural Machine Translation Inspired Binary Code Similarity Comparison beyond Function Pairs.
Proceedings of the 26th Annual Network and Distributed System Security Symposium, 2019