Carsten Binnig

Orcid: 0000-0002-2744-7836

Affiliations:
  • TU Darmstadt, Germany
  • Brown University (former)


According to our database1, Carsten Binnig authored at least 178 papers between 2002 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Benchmarking Analytical Query Processing in Intel SGXv2.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

2024
Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs.
Proc. ACM Manag. Data, February, 2024

Hardware Support for Cloud Database Systems in the Post-Moore's Law Era (Dagstuhl Seminar 24162).
Dagstuhl Reports, 2024

CardBench: A Benchmark for Learned Cardinality Estimation in Relational Databases.
CoRR, 2024

Box2Go: Collaborative Interactive Infobox Filling.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

LLMs for Data Engineering on Enterprise Data.
Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, 2024

Demonstrating CAESURA: Language Models as Multi-Modal Query Planners.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

SmartNICs in the Cloud: The Why, What and How of In-network Processing for Data-Intensive Applications.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

International Workshop on Data Management on New Hardware (DaMoN).
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Costream: Learned Cost Models for Operator Placement in Edge-Cloud Environments.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

ZERoTuNE: Learned Zero-Shot Cost Models for Parallelism Tuning in Stream Processing.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

More of that, please: Domain Adaptation of Information Extraction through Examples & Feedback.
Proceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics, 2024

Towards Extending XAI for Full Data Science Pipelines.
Proceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics, 2024

Pythagoras: Semantic Type Detection of Numerical Data in Enterprise Data Lakes.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

TrustDDL: A Privacy-Preserving Byzantine-Robust Distributed Deep Learning Framework.
Proceedings of the 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2024

Surprise Benchmarking: The Why, What, and How.
Proceedings of the Tenth International Workshop on Testing Database Systems, 2024

Parallel Processing of Temporal Anti-Joins in Memory.
Proceedings of the Database Systems for Advanced Applications, 2024

CAESURA: Language Models as Multi-Modal Query Planners.
Proceedings of the 14th Conference on Innovative Data Systems Research, 2024

Rethinking Table Retrieval from Data Lakes.
Proceedings of the Seventh International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, 2024

2023
SportsTables: A New Corpus for Semantic Type Detection (Extended Version).
Datenbank-Spektrum, November, 2023

WikiDBs 10k - A Corpus Of Relational Databases From Wikidata.
Dataset, August, 2023

Erratum to: Reviving the Workshop Series on Testing Database Systems - DBTest.
Datenbank-Spektrum, March, 2023

Technical Perspective for Skeena: Efficient and Consistent Cross-Engine Transactions.
SIGMOD Rec., 2023

Databases on Modern Networks: A Decade of Research that now comes into Practice.
Proc. VLDB Endow., 2023

Front Matter.
Proc. VLDB Endow., 2023

Distributed GPU Joins on Fast RDMA-capable Networks.
Proc. ACM Manag. Data, 2023

Steered Training Data Generation for Learned Semantic Type Detection.
Proc. ACM Manag. Data, 2023

Design Guidelines for Correct, Efficient, and Scalable Synchronization using One-Sided RDMA.
Proc. ACM Manag. Data, 2023

SPARE: A Single-Pass Neural Model for Relational Databases.
CoRR, 2023

Towards Foundation Models for Relational Databases [Vision Paper].
CoRR, 2023

Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables.
CoRR, 2023

WikiDBs: A Corpus of Relational Databases From Wikidata.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

QFilter: Towards a Fine-Grained Access Control for Aggregation Query Processing over Secret Shared Data.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

Towards Merkle Trees for High-Performance Data Systems.
Proceedings of the 1st Workshop on Verifiable Database Systems, 2023

Benchmarking Generative AI Performance Requires a Holistic Approach.
Proceedings of the Performance Evaluation and Benchmarking: 15th TPC Technology Conference, 2023

A New Primitive for Processing Temporal Joins.
Proceedings of the 18th International Symposium on Spatial and Temporal Data, 2023

Pythagoras: Semantic Type Detection of Numerical Data Using Graph Neural Networks.
Proceedings of the Lernen, 2023

SafeML: A Privacy-Preserving Byzantine-Robust Framework for Distributed Machine Learning Training.
Proceedings of the IEEE International Conference on Data Mining, 2023

DiffML: End-to-end Differentiable ML Pipelines.
Proceedings of the Seventh Workshop on Data Management for End-to-End Machine Learning, 2023

Zero-sided RDMA: Network-driven Data Shuffling.
Proceedings of the 19th International Workshop on Data Management on New Hardware, 2023

Is Scalable OLTP in the Cloud a Solved Problem?
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023

Benchmarking the Second Generation of Intel SGX for Machine Learning Workloads.
Proceedings of the Datenbanksysteme für Business, 2023

SportsTables: A new Corpus for Semantic Type Detection.
Proceedings of the Datenbanksysteme für Business, 2023

WannaDB: Ad-hoc SQL Queries over Text Collections.
Proceedings of the Datenbanksysteme für Business, 2023

OmniscientDB: A Large Language Model-Augmented DBMS That Knows What Other DBMSs Do Not Know.
Proceedings of the Sixth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, 2023

Zero-Shot Cost Models for Parallel Stream Processing.
Proceedings of the Sixth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, 2023

2022
Towards a Benchmark for Shared Databases [Vision Paper].
Datenbank-Spektrum, November, 2022

Editorial.
Datenbank-Spektrum, November, 2022

Reviving the Workshop Series on Testing Database Systems - DBTest.
Datenbank-Spektrum, November, 2022

The full story of 1000 cores.
VLDB J., 2022

DFI: The Data Flow Interface for High-Speed Networks.
SIGMOD Rec., 2022

DBMS Annihilator: A High-Performance Database Workload Generator in Action.
Proc. VLDB Endow., 2022

Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction.
Proc. VLDB Endow., 2022

Demonstrating Quest: A Query-Driven Framework to Explain Classification Models on Tabular Data.
Proc. VLDB Endow., 2022

Demonstrating CAT: Synthesizing Data-Aware Conversational Agents for Transactional Databases.
Proc. VLDB Endow., 2022

P4DB - The Case for In-Network OLTP (Extended Technical Report).
CoRR, 2022

ASET: Ad-hoc Structured Exploration of Text Collections [Extended Abstract].
CoRR, 2022

P4DB - The Case for In-Network OLTP.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Demonstrating ASET: Ad-hoc Structured Exploration of Text Collections.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Introducing quest: a query-driven framework to explain classification models on tabular data.
Proceedings of the HILDA@SIGMOD 2022: Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2022

GaccO - A GPU-accelerated OLTP DBMS.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Hybrid Database Operations: Learned Operations for Seamless Querying of Textual and Tabular Data.
Proceedings of the LWDA 2022 Workshops: FGWM, 2022

Know Better - A Clickbait Resolving Challenge.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Zero-shot cost models for distributed stream processing.
Proceedings of the 16th ACM International Conference on Distributed and Event-based Systems, 2022

PANDA: performance prediction for parallel and dynamic stream processing.
Proceedings of the 16th ACM International Conference on Distributed and Event-based Systems, 2022

Towards Decentralized Parameter Servers for Secure Federated Learning.
Proceedings of the 11th International Conference on Data Science, 2022

Benchmarking the Second Generation of Intel SGX Hardware.
Proceedings of the International Conference on Management of Data, 2022

EFA: A Viable Alternative to RDMA over InfiniBand for DBMSs?
Proceedings of the International Conference on Management of Data, 2022

One Model to Rule them All: Towards Zero-Shot Learning for Databases.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

A DBMS-centric Evaluation of BlueField DPUs on Fast Networks.
Proceedings of the International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, 2022

2021
Chiller: Contention-centric Transaction Execution and Data Partitioning for Modern Networks.
SIGMOD Rec., 2021

ReStore - Neural Data Completion for Relational Databases.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Scalable and Flexible High-Performance In-Network Processing of Hash Joins in Distributed Databases.
Proceedings of the International Conference on Field-Programmable Technology, 2021

ACID-V: Towards a New Class of DBMSs for Data Sharing.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2021

Netted?! How to Improve the Usefulness of Spider & Co.
Proceedings of the Second International Conference on Design of Experimental Search & Information REtrieval Systems, 2021

GalOP: Towards a GPU-accelerated OLTP DBMS.
Proceedings of the 17th International Workshop on Data Management on New Hardware, 2021

AnyDB: An Architecture-less DBMS for Any Workload.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

Towards Learned Metadata Extraction for Data Lakes.
Proceedings of the Datenbanksysteme für Business, 2021

Exploiting 3D Memory for Accelerated In-Network Processing of Hash Joins in Distributed Databases.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2021

2020
Sharing Opportunities for OLTP Workloads in Different Isolation Levels.
Proc. VLDB Endow., 2020

DeepDB: Learn from Data, not from Queries!
Proc. VLDB Endow., 2020

On the Throughput Optimization in Large-scale Batch-processing Systems.
Perform. Evaluation, 2020

RDMA Communciation Patterns.
Datenbank-Spektrum, 2020

It's AI Match: A Two-Step Approach for Schema Matching Using Embeddings.
Proceedings of the AIDB@VLDB 2020, 2020

DBPal: A Fully Pluggable NL2SQL Training Pipeline.
Proceedings of the 2020 International Conference on Management of Data, 2020

Workload merging potential in SAP Hybris.
Proceedings of the 8th International Workshop on Testing Database Systems, 2020

DB4ML - An In-Memory Database Kernel with Machine Learning Support.
Proceedings of the 2020 International Conference on Management of Data, 2020

Learning a Partitioning Advisor for Cloud Databases.
Proceedings of the 2020 International Conference on Management of Data, 2020

IDEBench: A Benchmark for Interactive Data Exploration.
Proceedings of the 2020 International Conference on Management of Data, 2020

Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data.
Proceedings of the 2020 International Conference on Management of Data, 2020

Robust Performance of Main Memory Data Structures by Configuration.
Proceedings of the 2020 International Conference on Management of Data, 2020

Summarization Beyond News: The Automatically Acquired Fandom Corpora.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

The tale of 1000 Cores: an evaluation of concurrency control on real(ly) large multi-socket hardware.
Proceedings of the 16th International Workshop on Data Management on New Hardware, 2020

DBMS Fitting: Why should we learn what we already know?
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

2019
BlockchainDB - A Shared Database on Blockchains.
Proc. VLDB Endow., 2019

A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores.
Proc. VLDB Endow., 2019

ProgressiveDB - Progressive Data Analytics as a Middleware.
Proc. VLDB Endow., 2019

DBPal: Weak Supervision for Learning a Natural Language Interface to Databases.
CoRR, 2019

Learning a Partitioning Advisor with Deep Reinforcement Learning.
CoRR, 2019

Democratizing Data Science through Interactive Curation of ML Pipelines.
Proceedings of the 2019 International Conference on Management of Data, 2019

Towards learning a partitioning advisor with deep reinforcement learning.
Proceedings of the Second International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, 2019

Interactive Summarization of Large Document Collections.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2019

FITing-Tree: A Data-aware Index Structure.
Proceedings of the 2019 International Conference on Management of Data, 2019

BlockchainDB - Towards a Shared Database on Blockchains.
Proceedings of the 2019 International Conference on Management of Data, 2019

Designing Distributed Tree-based Index Structures for Fast RDMA-capable Networks.
Proceedings of the 2019 International Conference on Management of Data, 2019

XAI: A Middleware for Scalable AI.
Proceedings of the 8th International Conference on Data Science, 2019

DPI: The Data Processing Interface for Modern Networks.
Proceedings of the 9th Biennial Conference on Innovative Data Systems Research, 2019

DPI: The Data Processing Interface for Modern Networks (Extended Abstract).
Proceedings of the Datenbanksysteme für Business, 2019

Skew-resilient Query Processing for Fast Networks.
Proceedings of the Datenbanksysteme für Business, 2019

High-Performance In-Network Data Processing.
Proceedings of the 10th International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, 2019

2018
Replication with Snapshot Isolation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Estimating the Impact of Unknown Unknowns on Aggregate Query Results.
ACM Trans. Database Syst., 2018

RODI: Benchmarking relational-to-ontology mapping generation quality.
Semantic Web, 2018

Sherlock: A System for Interactive Summarization of Large Text Collections.
Proc. VLDB Endow., 2018

OLTPShare: The Case for Sharing in OLTP Workloads.
Proc. VLDB Endow., 2018

Scalable Data Management on Modern Networks.
Datenbank-Spektrum, 2018

Progressive Data Science: Potential and Challenges.
CoRR, 2018

Chiller: Contention-centric Transaction Execution and Data Partitioning for Fast Networks.
CoRR, 2018

Model-based Approximate Query Processing.
CoRR, 2018

An End-to-end Neural Natural Language Interface for Databases.
CoRR, 2018

A-Tree: A Bounded Approximate Index Structure.
CoRR, 2018

Towards Interactive Curation & Automatic Tuning of ML Pipelines.
Proceedings of the Second Workshop on Data Management for End-To-End Machine Learning, 2018

Evaluating Visual Data Analysis Systems: A Discussion Report.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2018

DBPal: A Learned NL-Interface for Databases.
Proceedings of the 2018 International Conference on Management of Data, 2018

Automatic Mapping of the Sum-Product Network Inference Problem to FPGA-Based Accelerators.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018

Towards Interactive Data Exploration.
Proceedings of the 7th International Conference on Data Science, 2018

Boosting scalable data analytics with modern programmable networks.
Proceedings of the 14th International Workshop on Data Management on New Hardware, 2018

2017
The End of a Myth: Distributed Transaction Can Scale.
Proc. VLDB Endow., 2017

Revisiting Reuse for Approximate Query Processing.
Proc. VLDB Endow., 2017

Rethinking Distributed Query Execution on High-Speed Networks.
IEEE Data Eng. Bull., 2017

Safe Visual Data Exploration.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Controlling False Discoveries During Interactive Data Exploration.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

What you see is not what you get!: Detecting Simpson's Paradoxes during Data Exploration.
Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics, 2017

Revisiting Reuse in Main Memory Database Systems.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

SiliconDB: rethinking DBMSs for modern heterogeneous co-processor environments.
Proceedings of the 13th International Workshop on Data Management on New Hardware, 2017

Toward Sustainable Insights, or Why Polygamy is Bad for You.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

Interactive Data Analytics for the Humanities.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

IncMap: A Journey towards Ontology-based Data Integration.
Proceedings of the Datenbanksysteme für Business, 2017

Spotlytics: How to Use Cloud Market Places for Analytics?
Proceedings of the Datenbanksysteme für Business, 2017


2016
The End of Slow Networks: It's Time for a Redesign.
Proc. VLDB Endow., 2016

HILDA 2016 Workshop: A Report.
IEEE Data Eng. Bull., 2016

Towards a Benchmark for Interactive Data Exploration.
IEEE Data Eng. Bull., 2016

The End of a Myth: Distributed Transactions Can Scale.
CoRR, 2016

Making the Case for Query-by-Voice with EchoQuery.
Proceedings of the 2016 International Conference on Management of Data, 2016

VisTrees: fast indexes for interactive data exploration.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

The case for interactive data exploration accelerators (IDEAs).
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

2015
Vizdom: Interactive Analytics through Pen and Touch.
Proc. VLDB Endow., 2015

An Architecture for Compiling UDF-centric Workflows.
Proc. VLDB Endow., 2015

The End of Slow Networks: It's Time for a Redesign.
CoRR, 2015

Locality-aware Partitioning in Parallel Database Systems.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Cost-based Fault-tolerance for Parallel Data Processing.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Spotgres - parallel data analytics on Spot Instances.
Proceedings of the 31st IEEE International Conference on Data Engineering Workshops, 2015

RODI: A Benchmark for Automatic Mapping Generation in Relational-to-Ontology Data Integration.
Proceedings of the Semantic Web. Latest Advances and New Domains, 2015

2014
Distributed snapshot isolation: global transactions pay globally, local transactions pay locally.
VLDB J., 2014

DoomDB: kill the query.
Proceedings of the International Conference on Management of Data, 2014

How to Best Find a Partner? An Evaluation of Editing Approaches to Construct R2RML Mappings.
Proceedings of the Semantic Web: Trends and Challenges - 11th International Conference, 2014

XDB - A Novel Database Architecture for Data Analytics as a Service.
Proceedings of the 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27, 2014

2013
IncMap: pay as you go matching of relational schemata to OWL ontologies.
Proceedings of the 8th International Workshop on Ontology Matching co-located with the 12th International Semantic Web Conference (ISWC 2013), 2013

Pay as you go Matching of Relational Schemata to OWL Ontologies with IncMap.
Proceedings of the ISWC 2013 Posters & Demonstrations Track, 2013

SQLScript: Efficiently Analyzing Big Enterprise Data in SAP HANA.
Proceedings of the Datenbanksysteme für Business, 2013

2012
FunSQL: it is time to make SQL functional.
Proceedings of the 2012 Joint EDBT/ICDT Workshops, Berlin, Germany, March 30, 2012, 2012

2010
A framework for testing DBMS features.
VLDB J., 2010

2009
How is the weather tomorrow?: towards a benchmark for the cloud.
Proceedings of the 2nd International Workshop on Testing Database Systems, 2009

Dictionary-based order-preserving string compression for main memory column stores.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

2008
The ETH Zurich systems group and enterprise computing center.
SIGMOD Rec., 2008

Towards Automatic Test Database Generation.
IEEE Data Eng. Bull., 2008

Multi-RQP: generating test databases for the functional testing of OLTP applications.
Proceedings of the 1st International Workshop on Testing Database Systems, 2008

Automatic Result Verification for the Functional Testing of a Query Language.
Proceedings of the 24th International Conference on Data Engineering, 2008

Generierung Relevanter Testdatenbanken [Generating Meaningful Test Databases].
Proceedings of the Ausgezeichnete Informatikdissertationen 2008, 2008

Generating meaningful test databases.
PhD thesis, 2008

2007
Die SIKOSA-Methodik.
Wirtschaftsinf., 2007

QAGen: generating query-aware test databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Reverse Query Processing.
Proceedings of the 23rd International Conference on Data Engineering, 2007

2006
Testing database applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

2002
Experience Gained in the Development of a Route Planner for Mobile End Devices.
Proceedings of the IASTED International Conference on Communications, 2002

Development of an UIML Renderer for Different Target Languages.
Proceedings of the Computer-Aided Design of User Interfaces III, 2002


  Loading...