2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.
CoRR, April, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.
CoRR, February, 2025

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MuPT: A Generative Symbolic Music Pretrained Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale.
CoRR, 2024

Teach Multimodal LLMs to Comprehend Electrocardiographic Images.
CoRR, 2024

Can MLLMs Understand the Deep Implication Behind Chinese Images?
CoRR, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.
CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.
CoRR, 2024

MuPT: A Generative Symbolic Music Pretrained Transformer.
CoRR, 2024

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.
CoRR, 2024

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property.
CoRR, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DEFT: Distribution-guided Efficient Fine-Tuning for Human Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024