Peter A. Boncz

Orcid: 0000-0001-6256-0140

Affiliations:
  • National Research Institute for Mathematics and Computer Science, Amsterdam, Netherlands


According to our database1, Peter A. Boncz authored at least 175 papers between 1996 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2022, "For contributions to the design of columnar, main-memory, and vectorized database systems".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Making Data Management Better with Vectorized Query Processing.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

OpenIVM: a SQL-to-SQL Compiler for Incremental Computations.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Incremental Fusion: Unifying Compiled and Vectorized Query Execution.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Robust External Hash Aggregation in the Solid State Age.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

DuckDB-SGX2: The Good, The Bad and The Ugly within Confidential Analytical Query Processing.
Proceedings of the 20th International Workshop on Data Management on New Hardware, 2024

Accelerating GPU Data Processing using FastLanes Compression.
Proceedings of the 20th International Workshop on Data Management on New Hardware, 2024


2023
ALP: Adaptive Lossless floating-Point Compression.
Proc. ACM Manag. Data, December, 2023

DuckPGQ: Bringing SQL/PGQ to DuckDB.
Proc. VLDB Endow., 2023

The FastLanes Compression Layout: Decoding >100 Billion Integers per Second with Scalar Code.
Proc. VLDB Endow., 2023

Improving Data Minimization through Decentralized Data Architectures.
CoRR, 2023

The Linked Data Benchmark Council (LDBC): Driving Competition and Collaboration in the Graph Data Management Space.
Proceedings of the Performance Evaluation and Benchmarking: 15th TPC Technology Conference, 2023

The LDBC Social Network Benchmark Interactive Workload v2: A Transactional Graph Query Benchmark with Deep Delete Operations.
Proceedings of the Performance Evaluation and Benchmarking: 15th TPC Technology Conference, 2023

SPA: Economical and Workload-Driven Indexing for Data Analytics in the Cloud.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

DuckPGQ: Efficient Property Graph Queries in an analytical RDBMS.
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023

SQL/PGQ: A Systems Perspective (Abstract).
Proceedings of the 15th Alberto Mendelzon International Workshop on Foundations of Data Management (AMW 2023), 2023

2022
The LDBC Social Network Benchmark: Business Intelligence Workload.
Proc. VLDB Endow., 2022

Excalibur: A Virtual Machine for Adaptive Fine-grained JIT-Compiled Query Execution based on VOILA.
Proc. VLDB Endow., 2022

Technical perspective: The 'art' of automatic benchmark extraction.
Commun. ACM, 2022

In-Memory Indexed Caching for Distributed Data Processing.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Optimistically Compressed Hash Tables & Strings in theUSSR.
SIGMOD Rec., 2021

Technical Perspective DIAMetrics: Benchmarking Query Engines at Scale.
SIGMOD Rec., 2021

Errata for "Teseo and the Analysis of Structural Dynamic Graph".
Proc. VLDB Endow., 2021

Teseo and the Analysis of Structural Dynamic Graphs.
Proc. VLDB Endow., 2021

Charting the Design Space of Query Execution using VOILA.
Proc. VLDB Endow., 2021

An Epidemiological Model for contact tracing with the Dutch CoronaMelder App.
CoRR, 2021

The future is big graphs: a community view on graph processing systems.
Commun. ACM, 2021

TAKCO: A Platform for Extracting Novel Facts from Tables.
Proceedings of the Companion of The Web Conference 2021, 2021

Highlighting the Performance Diversity of Analytical Queries using VOILA.
Proceedings of the International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, 2021

2020
Make the most out of your SIMD investments: counter control flow divergence in compiled query pipelines.
VLDB J., 2020

Special issue on best papers of VLDB 2017.
VLDB J., 2020

Cuckoo Index: A Lightweight Secondary Index Structure.
Proc. VLDB Endow., 2020

FSST: Fast Random Access String Compression.
Proc. VLDB Endow., 2020

Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores.
Proc. VLDB Endow., 2020

The LDBC Graphalytics Benchmark.
CoRR, 2020

The LDBC Social Network Benchmark.
CoRR, 2020

Towards Testing ACID Compliance in the LDBC Social Network Benchmark.
Proceedings of the Performance Evaluation and Benchmarking, 2020

Tree-Encoded Bitmaps.
Proceedings of the 2020 International Conference on Management of Data, 2020

SparkFuzz: searching correctness regressions in modern query engines.
Proceedings of the 8th International Workshop on Testing Database Systems, 2020

EdgeFrame: Worst-Case Optimal Joins for Graph-Pattern Matching in Spark.
Proceedings of the GRADES-NDA'20: Proceedings of the 3rd Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), 2020

Efficient Query Processing with Optimistically Compressed Hash Tables & Strings in the USSR.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Adaptive Main-Memory Indexing for High-Performance Point-Polygon Joins.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Extracting N-ary Facts from Wikipedia Table Clusters.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

White-box Compression: Learning and Exploiting Compact Table Representations.
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

2019
The Seattle Report on Database Research.
SIGMOD Rec., 2019

Performance-Optimal Filtering: Bloom overtakes Cuckoo at High-Throughput.
Proc. VLDB Endow., 2019

Extracting Novel Facts from Tables for Knowledge Graph Completion (Extended version).
CoRR, 2019

Role of the TPC in the Cloud Age.
Proceedings of the Performance Evaluation and Benchmarking for the Era of Cloud(s), 2019

[Demo] Low-latency Spark Queries on Updatable Data.
Proceedings of the 2019 International Conference on Management of Data, 2019

Estimating Cardinalities with Deep Sketches.
Proceedings of the 2019 International Conference on Management of Data, 2019

Extracting Novel Facts from Tables for Knowledge Graph Completion.
Proceedings of the Semantic Web - ISWC 2019, 2019

Packed Memory Arrays - Rewired.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Fast Concurrent Reads and Updates with PMAs.
Proceedings of the 2nd Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), 2019

Fluid Co-processing: GPU Bloom-filters for CPU Joins.
Proceedings of the 15th International Workshop on Data Management on New Hardware, 2019

Learned Cardinalities: Estimating Correlated Joins with Deep Learning.
Proceedings of the 9th Biennial Conference on Innovative Data Systems Research, 2019

2018
Processor Cache.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Main Memory DBMS.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Main Memory.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Query optimization through the looking glass, and what we found running the Join Order Benchmark.
VLDB J., 2018

Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask.
Proc. VLDB Endow., 2018

Database Architectures for Modern Hardware (Dagstuhl Seminar 18251).
Dagstuhl Reports, 2018

Adaptive Geospatial Joins for Modern Hardware.
CoRR, 2018

Optimizing Group-By and Aggregation using GPU-CPU Co-Processing.
Proceedings of the International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, 2018

G-CORE: A Core for Future Graph Query Languages.
Proceedings of the 2018 International Conference on Management of Data, 2018

Approximate Geospatial Joins with Precision Guarantees.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

An early look at the LDBC social network benchmark's business intelligence workload.
Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA), 2018

2017
Exploring Query Compilation Strategies for JIT, Vectorization and SIMD.
Proceedings of the International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, 2017

JCC-H: Adding Join Crossing Correlations with Skew to TPC-H.
Proceedings of the Performance Evaluation and Benchmarking for the Analytics Era, 2017

Multi-Hypothesis CSV Parsing.
Proceedings of the 29th International Conference on Scientific and Statistical Database Management, 2017

Extending SQL for Computing Shortest Paths.
Proceedings of the Fifth International Workshop on Graph Data-management Experiences & Systems, 2017

Faster across the PCIe bus: a GPU library for lightweight decompression: including support for patched compression schemes.
Proceedings of the 13th International Workshop on Data Management on New Hardware, 2017

2016
Special Issue: Modern Hardware.
VLDB J., 2016

Bitwise dimensional co-clustering for analytical workloads.
VLDB J., 2016

LDBC Graphalytics: A Benchmark for Large-Scale Graph Analysis on Parallel and Distributed Platforms.
Proc. VLDB Endow., 2016

Front Matter.
Proc. VLDB Endow., 2016

Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation.
Proceedings of the 2016 International Conference on Management of Data, 2016

VectorH: Taking SQL-on-Hadoop to the Next Level.
Proceedings of the 2016 International Conference on Management of Data, 2016

Exploiting Emergent Schemas to Make RDF Systems More Efficient.
Proceedings of the Semantic Web - ISWC 2016, 2016

Powerful and efficient bulk shortest-path queries: cypher language extension & Giraph implementation.
Proceedings of the Fourth International Workshop on Graph Data Management Experiences and Systems, Redwood Shores, CA, USA, June 24, 2016

2015
How Good Are Query Optimizers, Really?
Proc. VLDB Endow., 2015

Deriving an Emergent Relational Schema from RDF Data.
Proceedings of the 24th International Conference on World Wide Web, 2015

The LDBC Social Network Benchmark: Interactive Workload.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Graphalytics: A Big Data Benchmark for Graph-Processing Platforms.
Proceedings of the Third International Workshop on Graph Data Management Experiences and Systems, 2015

2014
Advances in Large-Scale RDF Data Management.
Proceedings of the Linked Open Data - Creating Knowledge Out of Interlinked Data, 2014

The linked data benchmark council: a graph and RDF industry benchmarking effort.
SIGMOD Rec., 2014

MonetDB/RDF: Discovering and Exploiting the Emergent Schema of RDF Data.
ERCIM News, 2014

Benchmarking Linked Open Data Management Systems.
ERCIM News, 2014

Parameter Curation for Benchmark Queries.
Proceedings of the Performance Characterization and Benchmarking. Traditional to Big Data, 2014

Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age.
Proceedings of the International Conference on Management of Data, 2014

How to generate query parameters in RDF benchmarks?
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

Experiences with Virtuoso Cluster RDF Column Store.
Proceedings of the Linked Data Management., 2014

2013
The Design and Implementation of Modern Column-Oriented Database Systems.
Found. Trends Databases, 2013

The Linked Data Benchmark Council Project.
Datenbank-Spektrum, 2013

TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark.
Proceedings of the Performance Characterization and Benchmarking, 2013

Micro adaptivity in Vectorwise.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

LDBC: benchmarks for graph and RDF data management.
Proceedings of the 17th International Database Engineering & Applications Symposium, 2013

Recycling in pipelined query evaluation.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Automatic schema design for co-clustered tables.
Proceedings of the Workshops Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Optimizing database architecture for machine architecture: is there still hope?
Proceedings of the 25th GI-Workshop "Grundlagen von Datenbanken 2013", Ilmenau, 2013

Scientific discovery through weighted sampling.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012
From Cooperative Scans to Predictive Buffer Management.
Proc. VLDB Endow., 2012

Vectorwise: Beyond Column Stores.
IEEE Data Eng. Bull., 2012

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2012

S3G2: A Scalable Structure-Correlated Social Graph Generator.
Proceedings of the Selected Topics in Performance Evaluation and Benchmarking, 2012

From x100 to vectorwise: opportunities, challenges and things most researchers do not think about.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Linked Stream Data Processing Engines: Facts and Figures.
Proceedings of the Semantic Web - ISWC 2012, 2012

Robust Runtime Optimization and Skew-Resistant Execution of Analytical SPARQL Queries on Pig.
Proceedings of the Semantic Web - ISWC 2012, 2012

Vectorwise: A Vectorized Analytical DBMS.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Heuristics-based query optimisation for SPARQL.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

Query Processing of Pre-partitioned Data Using Sandwich Operators.
Proceedings of the Enabling Real-Time Business Intelligence - 6th International Workshop, 2012

2011
Integration of vectorwise with ingres.
SIGMOD Rec., 2011

The meaningful use of big data: four perspectives - four challenges.
SIGMOD Rec., 2011

Vectorization vs. compilation in query execution.
Proceedings of the Seventh International Workshop on Data Management on New Hardware, 2011

SciBORQ: Scientific data management with Bounds On Runtime and Quality.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

2010
Projective Distribution of XQuery with Updates.
IEEE Trans. Knowl. Data Eng., 2010

Positional update handling in column stores.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

ROX: The robustness of a run-time XQuery optimizer against correlated data.
Proceedings of the 26th International Conference on Data Engineering, 2010

Database architecture (R)evolution: New hardware vs. new software.
Proceedings of the 26th International Conference on Data Engineering, 2010

Run-time Optimization for Pipelined Systems.
Proceedings of the 4th Alberto Mendelzon International Workshop on Foundations of Data Management, 2010

2009
Processor Cache.
Proceedings of the Encyclopedia of Database Systems, 2009

Main Memory DBMS.
Proceedings of the Encyclopedia of Database Systems, 2009

Main Memory.
Proceedings of the Encyclopedia of Database Systems, 2009

Proceedings of the Encyclopedia of Database Systems, 2009

Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct.
Proc. VLDB Endow., 2009

Column oriented Database Systems.
Proc. VLDB Endow., 2009

StreetTiVo: Using a P2P XML Database System to Manage Multimedia Data in Your Living Room.
Proceedings of the Advances in Data and Web Management, Joint International Conferences, 2009

ROX: run-time optimization of XQueries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

Efficient Distribution of Full-Fledged XQuery.
Proceedings of the 25th International Conference on Data Engineering, 2009

Generic and updatable XML value indices covering equality and range lookups.
Proceedings of the 2009 EDBT/ICDT Workshops, Saint-Petersburg, Russia, March 22, 2009, 2009

Materialized View Selection in XML Databases.
Proceedings of the Database Systems for Advanced Applications, 2009

Space-economical partial gram indices for exact substring matching.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Emerging Database Systems in Support of Scientific Data.
Proceedings of the Scientific Data Management - Challenges, Technology, and Deployment., 2009

2008
Flexible and efficient IR using array databases.
VLDB J., 2008

Breaking the memory wall in MonetDB.
Commun. ACM, 2008

XRPC: distributed XQuery and update processing with heterogeneous XQuery engines.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

DSM vs. NSM: CPU performance tradeoffs in block-oriented query processing.
Proceedings of the 4th Workshop on Data Management on New Hardware, 2008

2007
Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

XRPC: Interoperable and Efficient Distributed XQuery.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Integrating XQuery and P2P in MonetDB/XQuery*.
Proceedings of the 1st Workshop on Emerging Research Opportunities for Web Data Management (EROW 2007) Collocated with the 11th International Conference on Database Theory (ICDT 2007), 2007

Vectorized data processing on the cell broadband engine.
Proceedings of the Workshop on Data Management on New Hardware, 2007

Efficient and Flexible Information Retrieval using MonetDB/X100.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
Report on the Second International Workshop on Data Management on Modern Hardware (DaMoN'06).
SIGMOD Rec., 2006

XIRAF - XML-based indexing and querying for digital forensics.
Digit. Investig., 2006

Efficient XQuery Support for Stand-Off Annotation.
Proceedings of the 3rd International Workshop on XQuery Implementation, 2006

MonetDB/X100 at the 2006 TREC Terabyte Track.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

MonetDB/XQuery: a fast XQuery processor powered by a relational engine.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

Super-Scalar RAM-CPU Cache Compression.
Proceedings of the 22nd International Conference on Data Engineering, 2006

MonetDB/XQuery-Consistent and Efficient Updates on the Pre/Post Plane.
Proceedings of the Advances in Database Technology, 2006

Architecture-conscious hashing.
Proceedings of the Workshop on Data Management on New Hardware, 2006

P2P XQuery and the StreetTiVo application.
Proceedings of the Scalable Data Management in Evolving Networks, 22.10. - 27.10.2006, 2006

06472 Abstracts Collection -- XQuery Implementation Paradigms.
Proceedings of the XQuery Implementation Paradigms, 19.11. - 22.11.2006, 2006

06472 Executive Summary -- XQuery Implementation Paradigms.
Proceedings of the XQuery Implementation Paradigms, 19.11. - 22.11.2006, 2006

06431 Working Group Report on Managing and Integrating Data in P2P Databases.
Proceedings of the Scalable Data Management in Evolving Networks, 22.10. - 27.10.2006, 2006

06431 Working Group Summary: P2P, Ad Hoc and Sensor Networks - All the Different or All the Same?.
Proceedings of the Scalable Data Management in Evolving Networks, 22.10. - 27.10.2006, 2006

Representing and Querying Multi-dimensional Markup for Question Answering.
Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing, 2006

2005
MonetDB/X100 - A DBMS In The CPU Cache.
IEEE Data Eng. Bull., 2005

Updating the Pre/Post Plane in MonetDB/XQuery.
Proceedings of the Second International Workshop on XQuery Implementation, 2005

Pathfinder: XQuery - The Relational Way.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

MonetDB/X100: Hyper-Pipelining Query Execution.
Proceedings of the Second Biennial Conference on Innovative Data Systems Research, 2005

2004
Reminiscences on Influential Papers.
SIGMOD Rec., 2004

Cache-Conscious Radix-Decluster Projections.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

AmbientDB: P2P Data Management Middleware for Ambient Intelligence.
Proceedings of the 2nd IEEE Conference on Pervasive Computing and Communications Workshops (PerCom 2004 Workshops), 2004

2003
AmbientDB: Relational Query Processing in a P2P Network.
Proceedings of the Databases, 2003

2002
Optimizing Main-Memory Join on Modern Hardware.
IEEE Trans. Knowl. Data Eng., 2002

Generic Database Cost Models for Hierarchical Memory Systems.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

2001
Macro- and Micro-parallelism in a DBMS.
Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000
Optimizing database architecture for the new bottleneck: memory access.
VLDB J., 2000

What Happens During a Join? Dissecting CPU and Memory Optimization Effects.
Proceedings of the VLDB 2000, 2000

1999
MIL Primitives for Querying a Fragmented World.
VLDB J., 1999

Database Architecture Optimized for the New Bottleneck: Memory Access.
Proceedings of the VLDB'99, 1999

1998
The Drill Down Benchmark.
Proceedings of the VLDB'98, 1998

Flattening an Object Algebra to Provide Performance.
Proceedings of the Fourteenth International Conference on Data Engineering, 1998

1996
Monet And Its Geographic Extensions: A Novel Approach to High Performance GIS Processing.
Proceedings of the Advances in Database Technology, 1996

High Performance Support for OO Traversals in Monet.
Proceedings of the Advances in Databases, 14th British National Conferenc on Databases, 1996


  Loading...