Peter W. J. Staar

According to our database1, Peter W. J. Staar authored at least 35 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 




Docling Technical Report.
CoRR, 2024

Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs.
CoRR, 2024

INDUS: Effective and Efficient Language Models for Scientific Applications.
CoRR, 2024

KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents.
CoRR, 2024

KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Clustering Items From Adaptively Collected Inconsistent Feedback.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

ESG Accountability Made Easy: DocQA at Your Service.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Skin Tone Analysis for Representation in Educational Materials (STAR-ED) using machine learning.
npj Digit. Medicine, 2023

Optimized Table Tokenization for Table Structure Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

MolGrapher: Graph-based Visual Recognition of Chemical Structures.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

pNLP-Mixer: an Efficient all-MLP Architecture for Language.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

FETA: Towards Specializing Foundation Models for Expert Task Applications.
CoRR, 2022

BusiNet - a Light and Fast Text Detection Network for Business Documents.
CoRR, 2022

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis.
CoRR, 2022

pNLP-Mixer: an Efficient all-MLP Architecture for Language.
CoRR, 2022

FETA: Towards Specializing Foundational Models for Expert Task Applications.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Segmentation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Unsupervised Term Extraction for Highly Technical Domains.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

TableFormer: Table Structure Understanding with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Domain Generalization by Learning a Bridge Across Domains.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness.
Proceedings of the IEEE 15th International Conference on Cloud Computing, 2022

Racial Representation Analysis in Dermatology Academic Materials.
Proceedings of the AMIA 2021, American Medical Informatics Association Annual Symposium, San Diego, CA, USA, October 30, 2021, 2021

Robust PDF Document Conversion using Recurrent Neural Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

DCA++: A software framework to solve correlated electron problems with modern quantum cluster methods.
Comput. Phys. Commun., 2020

An Information Extraction and Knowledge Graph Platform for Accelerating Biochemical Discoveries.
CoRR, 2019

Corpus Conversion Service: A Machine Learning Platform to Ingest Documents at Scale.
ERCIM News, 2018

Corpus Conversion Service: A machine learning platform to ingest documents at scale [Poster abstract].
CoRR, 2018

Exploring Graph Analytics with the PCJ Toolbox.
Proceedings of the Parallel Processing and Applied Mathematics, 2017


Analyzing the energy-efficiency of sparse matrix multiplication on heterogeneous systems: A comparative study of GPU, Xeon Phi and FPGA.
Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016

Stochastic Matrix-Function Estimators: Scalable Big-Data Kernels with High Performance.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Energy-efficient stochastic matrix function estimator for graph analytics on FPGA.
Proceedings of the 26th International Conference on Field Programmable Logic and Applications, 2016

An extreme-scale implicit solver for complex PDEs: highly heterogeneous flow in earth's mantle.
Proceedings of the International Conference for High Performance Computing, 2015

Taking a quantum leap in time to solution for simulations of high-Tc superconductors.
Proceedings of the International Conference for High Performance Computing, 2013
