LiLTv2: Language-substitutable Layout-image Transformer for Visual Information Extraction.
ACM Trans. Multim. Comput. Commun. Appl., March, 2025
PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024