WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Quadrotor Flight Control Parameters Optimization Based on Chaotic Estimation of Distribution Algorithm.
Proceedings of the Advances in Neural Networks - ISNN 2013, 2013