olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models.
CoRR, February, 2025
PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023