2024

Large Language Model Safety: A Holistic Survey.

[DOI]

Dan Shi

Tianhao Shen

CoRR, 2024

Self-Pluralising Culture Alignment for Large Language Models.

[DOI]

CoRR, 2024

CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models.

[DOI]

CoRR, 2024

OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety.

[DOI]

CoRR, 2024

Identifying Multiple Personalities in Large Language Models with External Evaluation.

[DOI]

Gopala Anumanchipalli

Simerjot Kaur

CoRR, 2024

LFED: A Literary Fiction Evaluation Dataset for Large Language Models.

[DOI]

Linhao Yu

Qun Liu

Deyi Xiong

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Evaluating Large Language Models: A Comprehensive Survey.

[DOI]

CoRR, 2023

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models.

[DOI]

CoRR, 2023

CS2W: A Chinese Spoken-to-Written Style Conversion Dataset with Multiple Conversion Types.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023