We stand with Ukraine

We stand with Ukraine

Zhexin Zhang

Orcid: 0000-0003-1767-8865

According to our database¹, Zhexin Zhang authored at least 29 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2019

2020

2021

2022

2023

2024

0

5

10

5

5

1

4

5

6

2

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Agent-SafetyBench: Evaluating the Safety of LLM Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Global Challenge for Safe and Secure LLMs Track 1.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Rick Siow Mong Goh

,

,

,

CoRR, 2024

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

A Design of Interface for Visual-Impaired People to Access Visual Information from Images Featuring Large Language Models and Visual Language Models.

[BibT_eX]

[DOI]

Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2024

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SafetyBench: Evaluating the Safety of Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Import vertical characteristic of rain streak for single image deraining.

[BibT_eX]

[DOI]

,

,

,

,

Multim. Syst., 2023

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

Safety Assessment of Chinese Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Recent Advances towards Safe, Responsible, and Moral Dialogue Systems: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

InstructSafety: A Unified Framework for Building Multidimensional and Explainable Safety Detector through Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Unveiling the Implicit Toxicity in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Self-Supervised Sentence Polishing by Adding Engaging Modifiers.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Constructing Moral Discussions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2022

Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Indoor Auto-Navigate System for Electric Wheelchairs in a Nursing Home.

[BibT_eX]

[DOI]

,

,

Proceedings of the Universal Access in Human-Computer Interaction. Novel Design Approaches and Technologies, 2022

Visualizing the Electroencephalography Signal Discrepancy When Maintaining Social Distancing: EEG-Based Interactive Moiré Patterns.

[BibT_eX]

[DOI]

,

,

,

,

Xanat Vargas Meza

,

Proceedings of the Design, User Experience, and Usability: Design for Emotion, Well-being and Health, Learning, and Culture, 2022

Automatic Comment Generation for Chinese Student Narrative Essays.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Selecting Stickers in Open-Domain Dialogue through Multitask Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

A Customized VR Rendering with Neural-Network Generated Frames for Reducing VR Dizziness.

[BibT_eX]

[DOI]

,

,

Proceedings of the HCI International 2021 - Posters - 23rd HCI International Conference, 2021

OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2019

Neural Machine Translation Inspired Binary Code Similarity Comparison beyond Function Pairs.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 26th Annual Network and Distributed System Security Symposium, 2019

Loading...