2025

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.

[DOI]

M.-A-P. Team

Siwei Wu

CoRR, April, 2025

A Comprehensive Survey on Long Context Language Modeling.

[DOI]

CoRR, March, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.

[DOI]

CoRR, February, 2025

PIC: Unlocking Long-Form Text Generation Capabilities of Large Language Models via Position ID Compression.

[DOI]

Haoran Que

Wenge Rong

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Enhancing LLMs via High-Knowledge Data Selection.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

MIO: A Foundation Model on Multimodal Tokens.

[DOI]

CoRR, 2024

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models.

[DOI]

CoRR, 2024

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models.

[DOI]

CoRR, 2024

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DDK: Distilling Domain Knowledge for Efficient Large Language Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

E2-LLM: Efficient and Extreme Length Extension of Large Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.

[DOI]

CoRR, 2023