2025
DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
CoRR, April, 2025

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs.
CoRR, April, 2025

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values.
CoRR, April, 2025

YuE: Scaling Open Foundation Models for Long-Form Music Generation.
CoRR, March, 2025

LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm.
CoRR, February, 2025

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines.
CoRR, February, 2025

Aligning Instruction Tuning with Pre-training.
CoRR, January, 2025

UA-LIO: An Uncertainty-Aware LiDAR-Inertial Odometry for Autonomous Driving in Urban Environments.
IEEE Trans. Instrum. Meas., 2025

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

MuPT: A Generative Symbolic Music Pretrained Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Observing Micromotives and Macrobehavior of Large Language Models.
CoRR, 2024

Can MLLMs Understand the Deep Implication Behind Chinese Images?
CoRR, 2024

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks.
CoRR, 2024

OmniBench: Towards The Future of Universal Omni-Language Models.
CoRR, 2024

LIME: Less Is More for MLLM Evaluation.
CoRR, 2024

Foundation Models for Music: A Survey.
CoRR, 2024

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm.
CoRR, 2024

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association.
CoRR, 2024

MuPT: A Generative Symbolic Music Pretrained Transformer.
CoRR, 2024

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model.
CoRR, 2024

CodeEditorBench: Evaluating Code Editing Capability of Large Language Models.
CoRR, 2024

The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.
CoRR, 2024

DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning.
CoRR, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.
CoRR, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
CoRR, 2024

MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces.
CoRR, 2024

CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark.
CoRR, 2024

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation.
CoRR, 2024

Overview of the NLPCC 2024 Shared Task on Chinese Metaphor Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2024

D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
An extended ITL-VIKOR model using triangular fuzzy numbers for applications to water-richness evaluation.
Expert Syst. Appl., July, 2023

Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators.
CoRR, 2023

2022
PilotAttnNet: Multi-modal Attention Network for End-to-End Steering Control.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

2019
Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals.
Sensors, 2019