2025
WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages.
CoRR, January, 2025

2024
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations.
CoRR, 2024

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.
CoRR, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024

InternLM2 Technical Report.
CoRR, 2024

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset.
CoRR, 2024

2013
Quadrotor Flight Control Parameters Optimization Based on Chaotic Estimation of Distribution Algorithm.
Proceedings of the Advances in Neural Networks - ISNN 2013, 2013