Zhexin Zhang

Orcid: 0000-0003-1767-8865

According to our database1, Zhexin Zhang authored at least 26 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks.
CoRR, 2024

Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack.
CoRR, 2024

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A Design of Interface for Visual-Impaired People to Access Visual Information from Images Featuring Large Language Models and Visual Language Models.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2024

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SafetyBench: Evaluating the Safety of Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Import vertical characteristic of rain streak for single image deraining.
Multim. Syst., 2023

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.
CoRR, 2023

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions.
CoRR, 2023

Safety Assessment of Chinese Large Language Models.
CoRR, 2023

Recent Advances towards Safe, Responsible, and Moral Dialogue Systems: A Survey.
CoRR, 2023

InstructSafety: A Unified Framework for Building Multidimensional and Explainable Safety Detector through Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Unveiling the Implicit Toxicity in Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Self-Supervised Sentence Polishing by Adding Engaging Modifiers.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Constructing Moral Discussions.
CoRR, 2022

Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Indoor Auto-Navigate System for Electric Wheelchairs in a Nursing Home.
Proceedings of the Universal Access in Human-Computer Interaction. Novel Design Approaches and Technologies, 2022

Visualizing the Electroencephalography Signal Discrepancy When Maintaining Social Distancing: EEG-Based Interactive Moiré Patterns.
Proceedings of the Design, User Experience, and Usability: Design for Emotion, Well-being and Health, Learning, and Culture, 2022

Automatic Comment Generation for Chinese Student Narrative Essays.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Selecting Stickers in Open-Domain Dialogue through Multitask Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
A Customized VR Rendering with Neural-Network Generated Frames for Reducing VR Dizziness.
Proceedings of the HCI International 2021 - Posters - 23rd HCI International Conference, 2021

OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2019
Neural Machine Translation Inspired Binary Code Similarity Comparison beyond Function Pairs.
Proceedings of the 26th Annual Network and Distributed System Security Symposium, 2019


  Loading...