2025

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.

[DOI]

M.-A-P. Team

Siwei Wu

CoRR, April, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.

[DOI]

CoRR, February, 2025

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MuPT: A Generative Symbolic Music Pretrained Transformer.

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale.

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Can MLLMs Understand the Deep Implication Behind Chinese Images?

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale.

[DOI]

CoRR, 2024

Teach Multimodal LLMs to Comprehend Electrocardiographic Images.

[DOI]

CoRR, 2024

Can MLLMs Understand the Deep Implication Behind Chinese Images?

[DOI]

CoRR, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.

[DOI]

CoRR, 2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series.

[DOI]

CoRR, 2024

MuPT: A Generative Symbolic Music Pretrained Transformer.

[DOI]

CoRR, 2024

COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning.

[DOI]

CoRR, 2024

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property.

[DOI]

CoRR, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DEFT: Distribution-guided Efficient Fine-Tuning for Human Alignment.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property.

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model.

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024