Volker Markl

Orcid: 0009-0009-0964-026X

Affiliations:
  • German Research Center for Artificial Intelligence (DFKI), Berlin, Germany
  • TU Berlin, Institute of Software Engineering and Theoretical Computer Science, Germany


According to our database1, Volker Markl authored at least 314 papers between 1998 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2020, "For contributions to query optimization, scalable data processing, and data programmability".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Looking Deeply into the Magic Mirror: An Interactive Analysis of Database Index Selection Approaches.
Proc. VLDB Endow., August, 2024

Assisted design of data science pipelines.
VLDB J., July, 2024

Fainder: A Fast and Accurate Index for Distribution-Aware Dataset Search.
Proc. VLDB Endow., July, 2024

POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance.
Proc. VLDB Endow., February, 2024

Efficient Placement of Decomposable Aggregation Functions for Stream Processing over Large Geo-Distributed Topologies.
Proc. VLDB Endow., February, 2024

Correction to: Survey of window types for aggregation in stream processing systems.
VLDB J., January, 2024

A survey on transactional stream processing.
VLDB J., 2024

Fault Tolerance Placement in the Internet of Things.
Proc. ACM Manag. Data, 2024

Query Compilation Without Regrets.
Proc. ACM Manag. Data, 2024

reBEN: Refined BigEarthNet Dataset for Remote Sensing Image Analysis.
CoRR, 2024

Multi-Backend Zonal Statistics Execution with Raven.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Counting Butterflies in Fully Dynamic Bipartite Graph Streams.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

Bridging the Gap: Complex Event Processing on Stream Processing Systems.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

Benchmarking Stream Join Algorithms on GPUs: A Framework and its Application to the State-of-the-art.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

Evaluation of Sampling Methods for Discovering Facts from Knowledge Graph Embeddings.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

Using and Enhancing NebulaStream - A Tutorial.
Proceedings of the 18th ACM International Conference on Distributed and Event-based Systems, 2024

NebulaStream - Data Stream Processing in Massively Distributed, Heterogeneous, Volatile Environments.
Proceedings of the 18th ACM International Conference on Distributed and Event-based Systems, 2024

2023
Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation.
Proc. VLDB Endow., November, 2023

Survey of window types for aggregation in stream processing systems.
VLDB J., September, 2023

Apache Wayang: A Unified Data Analytics Framework.
SIGMOD Rec., September, 2023

SheetReader: Efficient Specialized Spreadsheet Parsing.
Inf. Syst., May, 2023

Cracking-Like Join for Trusted Execution Environments.
Proc. VLDB Endow., 2023

Showcasing Data Management Challenges for Future IoT Applications with NebulaStream.
Proc. VLDB Endow., 2023

Optimistic Data Parallelism for FPGA-Accelerated Sketching.
Proc. VLDB Endow., 2023

XDB in Action: Decentralized Cross-Database Query Processing for Black-Box DBMSes.
Proc. VLDB Endow., 2023

MorphStream: Adaptive Scheduling for Scalable Transactional Stream Processing on Multicores.
Proc. ACM Manag. Data, 2023

Query Processing on Heterogeneous CPU/GPU Systems.
ACM Comput. Surv., 2023

Missing Value Imputation for Multi-attribute Sensor Data Streams via Message Propagation (Extended Version).
CoRR, 2023

Enhancing In-Memory Spatial Indexing with Learned Search.
CoRR, 2023

MorphStream: Scalable Processing of Transactions over Streams on Multicores.
CoRR, 2023

Artificial intelligence to advance Earth observation: a perspective.
CoRR, 2023

Towards Efficient and Secure UDF Execution with BabelfishLib (Lightning Talk).
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

TeeBench: Seamless Benchmarking in Trusted Execution Environments.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

In-Situ Cross-Database Query Processing.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

The Art of Losing to Win: Using Lossy Image Compression to Improve Data Loading in Deep Learning Pipelines.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

On Irregularity Localization for Scientific Data Analysis Workflows.
Proceedings of the Computational Science - ICCS 2023, 2023

Incremental Stream Query Merging.
Proceedings of the Proceedings 26th International Conference on Extending Database Technology, 2023

P2D: A Transpiler Framework for Optimizing Data Science Pipelines.
Proceedings of the Seventh Workshop on Data Management for End-to-End Machine Learning, 2023

Exploiting Access Pattern Characteristics for Join Reordering.
Proceedings of the 19th International Workshop on Data Management on New Hardware, 2023

Good Intentions: Adaptive Parameter Management via Intent Signaling.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Towards Unifying Query Interpretation and Compilation.
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023

Workload Prediction for IoT Data Management Systems.
Proceedings of the Datenbanksysteme für Business, 2023

Learn What Really Matters: A Learning-to-Rank Approach for ML-based Query Optimization.
Proceedings of the Datenbanksysteme für Business, 2023

2022
Fast datalog evaluation for batch and stream graph processing.
World Wide Web, 2022

Making Learned Query Optimization Practical: A Technical Perspective.
SIGMOD Rec., 2022

Imperative or Functional Control Flow Handling: Why not the Best of Both Worlds?
SIGMOD Rec., 2022

DORIAN in action: Assisted Design of Data Science Pipelines.
Proc. VLDB Endow., 2022

Space-Efficient Random Walks on Streaming Graphs.
Proc. VLDB Endow., 2022

Satellite Image Search in AgoraEO.
Proc. VLDB Endow., 2022

IoT-PMA: Patient Health Monitoring in Medical IoT Ecosystems.
Open J. Internet Things, 2022

Navigating Compliance with Data Transfers in Federated Data Processing.
IEEE Data Eng. Bull., 2022

NebulaStream: Data Management for the Internet of Things.
Datenbank-Spektrum, 2022

Algorithms for Windowed Aggregations and Joins on Distributed Stream Processing Systems.
Datenbank-Spektrum, 2022

Editorial.
Datenbank-Spektrum, 2022

Handling Iterations in Distributed Dataflow Systems.
ACM Comput. Surv., 2022

Good Intentions: Adaptive Parameter Servers via Intent Signaling.
CoRR, 2022

Towards Loosely-Coupling Knowledge Graph Embeddings and Ontology-based Reasoning.
CoRR, 2022

Towards a Modular Data Management System Framework.
Proceedings of the 1st International Workshop on Composable Data Management Systems, 2022

In-Place Updates in Tree-Encoded Bitmaps.
Proceedings of the SSDBM 2022: 34th International Conference on Scientific and Statistical Database Management, Copenhagen, Denmark, July 6, 2022

NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Rethinking Stateful Stream Processing with RDMA.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Materialization and Reuse Optimizations for Production Data Science Pipelines.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines [Abstract].
Proceedings of the LWDA 2022 Workshops: FGWM, 2022

Farming Your ML-based Query Optimizer's Food.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Efficient Specialized Spreadsheet Parsing for Data Science.
Proceedings of the 24th International Workshop on Design, 2022

DataFarm: Farm Your ML-based Query Optimizer's Food! - Human-Guided Training Data Generation -.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

2021
Scotty: General and Efficient Open-source Window Aggregation for Stream Processing Systems.
ACM Trans. Database Syst., 2021

Just Move It! Dynamic Parameter Allocation in Action.
Proc. VLDB Endow., 2021

What Is the Price for Joining Securely? Benchmarking Equi-Joins in Trusted Execution Environments.
Proc. VLDB Endow., 2021

In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All.
Proc. VLDB Endow., 2021

Babelfish: Efficient Execution of Polyglot Queries.
Proc. VLDB Endow., 2021

HyMAC: A Hybrid Matrix Computation System.
Proc. VLDB Endow., 2021

Compliant Geo-distributed Data Processing in Action.
Proc. VLDB Endow., 2021

Streaming Data through the IoT via Actor-Based Semantic Routing Trees.
Open J. Internet Things, 2021

Monitoring of Stream Processing Engines Beyond the Cloud: An Overview.
Open J. Internet Things, 2021

Continuous Training and Deployment of Deep Learning Models.
Datenbank-Spektrum, 2021

The Collaborative Research Center FONDA.
Datenbank-Spektrum, 2021

BigEarthNet-MM: A Large Scale Multi-Modal Multi-Label Benchmark Archive for Remote Sensing Image Classification and Retrieval.
CoRR, 2021

Replicate or Relocate? Non-Uniform Access in Parameter Servers.
CoRR, 2021

Expand your Training Limits! Generating Training Data for ML-based Data Management.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

The Power of Nested Parallelism in Big Data Processing - Hitting Three Flies with One Slap -.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Hybrid Evaluation for Distributed Iterative Matrix Computation.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Compliant Geo-distributed Query Processing.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021


Parallelizing Intra-Window Join on Multicores: An Experimental Study.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Efficient Control Flow in Dataflow Systems: When Ease-of-Use Meets High Performance.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Automating Data Quality Validation for Dynamic Data Ingestion.
Proceedings of the 24th International Conference on Extending Database Technology, 2021

Automatic Tuning of Read-Time Tolerances for Optimized On-Demand Data-Streaming from Sensor Nodes.
Proceedings of the 24th International Conference on Extending Database Technology, 2021

AdCom: Adaptive Combiner for Streaming Aggregations.
Proceedings of the 24th International Conference on Extending Database Technology, 2021

Database Systems and Information Management: Trends and a Vision.
Proceedings of the 10th International Conference on Data Science, 2021

An Energy-Efficient Stream Join for the Internet of Things.
Proceedings of the 17th International Workshop on Data Management on New Hardware, 2021

The Case for Distance-Bounded Spatial Approximations.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

Towards Resilient Data Management for the Internet of Moving Things.
Proceedings of the Datenbanksysteme für Business, 2021

Fast CSV Loading Using GPUs and RDMA for In-Memory Data Processing.
Proceedings of the Datenbanksysteme für Business, 2021

2020
Agora: Bringing Together Datasets, Algorithms, Models and More in a Unified Ecosystem [Vision].
SIGMOD Rec., 2020

Dynamic Parameter Allocation in Parameter Servers.
Proc. VLDB Endow., 2020

Scotch: Generating FPGA-Accelerators for Sketching at Line Rate.
Proc. VLDB Endow., 2020

Demand-based Sensor Data Gathering with Multi-Query Optimization.
Proc. VLDB Endow., 2020

NebulaStream: Complex Analytics Beyond the Cloud.
Open J. Internet Things, 2020

A distributed data exchange engine for polystores.
it Inf. Technol., 2020

Distributed Graph Analytics with Datalog Queries in Flink.
Proceedings of the Software Foundations for Data Interoperability and Large Scale Graph Data Analytics, 2020

Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines.
Proceedings of the 2020 International Conference on Management of Data, 2020

Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects.
Proceedings of the 2020 International Conference on Management of Data, 2020

Grizzly: Efficient Stream Processing Through Adaptive Query Compilation.
Proceedings of the 2020 International Conference on Management of Data, 2020

Optimizing Machine Learning Workloads in Collaborative Environments.
Proceedings of the 2020 International Conference on Management of Data, 2020

Towards Unsupervised Data Quality Validation on Dynamic Data.
Proceedings of the Workshops of the EDBT/ICDT 2020 Joint Conference, 2020

Scaling a Public Transport Monitoring System to Internet of Things Infrastructures.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Governor: Operator Placement for a Unified Fog-Cloud Environment.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Disco: Efficient Distributed Window Aggregation.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

A survey of adaptive sampling and filtering algorithms for the internet of things.
Proceedings of the 14th ACM International Conference on Distributed and Event-based Systems, 2020

The NebulaStream Platform for Data and Application Management in the Internet of Things.
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

2019
Representations and Optimizations for Embedded Parallel Dataflow Languages.
ACM Trans. Database Syst., 2019

The Seattle Report on Database Research.
SIGMOD Rec., 2019

Analyzing Efficient Stream Processing on Modern Hardware.
Proc. VLDB Endow., 2019

An Intermediate Representation for Optimizing Machine Learning Pipelines.
Proc. VLDB Endow., 2019

AJoin: Ad-hoc Stream Joins at Scale.
Proc. VLDB Endow., 2019

Particulate Matter Matters - The Data Science Challenge @ BTW 2019.
Datenbank-Spektrum, 2019

SENSE: Scalable Data Acquisition from Distributed Sensors with Guaranteed Time Coherence.
CoRR, 2019

Agora: Towards An Open Ecosystem for Democratizing Data Science & Artificial Intelligence.
CoRR, 2019

End-to-End Benchmarking of Deep Learning Platforms.
Proceedings of the Performance Evaluation and Benchmarking for the Era of Cloud(s), 2019

Learning to Validate the Predictions of Black Box Machine Learning Models on Unseen Data.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2019

AStream: Ad-hoc Shared Stream Processing.
Proceedings of the 2019 International Conference on Management of Data, 2019

Extended Kalman Filter for Large Scale Vessels Trajectory Tracking in Distributed Stream Processing Systems.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2019

Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Muses: Distributed Data Migration System for Polystores.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Multi-Dimensional Genomic Data Management for Region-Preserving Operations.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Time Series Similarity Search for Streaming Data in Distributed Systems.
Proceedings of the Workshops of the EDBT/ICDT 2019 Joint Conference, 2019

Efficient Window Aggregation with General Stream Slicing.
Proceedings of the Advances in Database Technology, 2019

Resense: Transparent Record and Replay of Sensor Data in the Internet of Things.
Proceedings of the Advances in Database Technology, 2019

Continuous Deployment of Machine Learning Pipelines.
Proceedings of the Advances in Database Technology, 2019

Generating Reproducible Out-of-Order Data Streams.
Proceedings of the 13th ACM International Conference on Distributed and Event-based Systems, 2019

Performance Analysis and Automatic Tuning of Hash Aggregation on GPUs.
Proceedings of the 15th International Workshop on Data Management on New Hardware, 2019

A Comparison of Distributed Stream Processing Systems for Time Series Analysis.
Proceedings of the Datenbanksysteme für Business, 2019

Explanation of Air Pollution Using External Data Sources.
Proceedings of the Datenbanksysteme für Business, 2019

An Overview of Hawk: A Hardware-Tailored Code Generator for the Heterogeneous Many Core Age.
Proceedings of the Datenbanksysteme für Business, 2019

On-the-fly Reconfiguration of Query Plans for Stateful Stream Processing Engines.
Proceedings of the Datenbanksysteme für Business, 2019

2018
Big Data Platforms for Data Analytics.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Query Processing (in Relational Databases).
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

A survey of state management in big data processing systems.
VLDB J., 2018

Generating custom code for efficient query execution on heterogeneous processors.
VLDB J., 2018

Data Management Systems Research at TU Berlin.
SIGMOD Rec., 2018

Fault-Tolerance for Distributed Iterative Dataflows in Action.
Proc. VLDB Endow., 2018

Eine nationale Daten- und Analyseinfrastruktur als Grundlage digitaler Souveränität.
Inform. Spektrum, 2018

Efficient and Scalable k‑Means on GPUs.
Datenbank-Spektrum, 2018

Labyrinth: Compiling Imperative Control Flow to Parallel Dataflows.
CoRR, 2018

Benchmarking Distributed Stream Processing Engines.
CoRR, 2018

PolyBench: The First Benchmark for Polystores.
Proceedings of the Performance Evaluation and Benchmarking for the Era of Artificial Intelligence, 2018

Benchmarking Distributed Data Processing Systems for Machine Learning Workloads.
Proceedings of the Performance Evaluation and Benchmarking for the Era of Artificial Intelligence, 2018

Pipelined Query Processing in Coprocessor Environments.
Proceedings of the 2018 International Conference on Management of Data, 2018

Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Benchmarking Distributed Stream Data Processing Systems.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing.
Proceedings of the 21st International Conference on Extending Database Technology, 2018

Efficient SIMD Vectorization for Hashing in OpenCL.
Proceedings of the 21st International Conference on Extending Database Technology, 2018

Mosaics in Big Data: Stratosphere, Apache Flink, and Beyond.
Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems, 2018

Efficient k-means on GPUs.
Proceedings of the 14th International Workshop on Data Management on New Hardware, 2018

ScootR: Scaling R Dataframes on Dataflow Systems.
Proceedings of the ACM Symposium on Cloud Computing, 2018

Heatflip: Temporal-Spatial Sampling for Progressive Heat Maps on Social Media Data.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Erratum to: Special issue on best papers of VLDB 2015.
VLDB J., 2017

Special issue on best papers of VLDB 2015.
VLDB J., 2017

On Fault Tolerance for Distributed Iterative Dataflow Processing.
IEEE Trans. Knowl. Data Eng., 2017

BlockJoin: Efficient Matrix Partitioning Through Joins.
Proc. VLDB Endow., 2017

Estimating Join Selectivities using Bandwidth-Optimized Kernel Density Models.
Proc. VLDB Endow., 2017

Efficiently Storing and Analyzing Genome Data in Database Systems.
Datenbank-Spektrum, 2017

Generating Custom Code for Efficient Query Execution on Heterogeneous Processors.
CoRR, 2017

PEEL: A Framework for Benchmarking Distributed Systems and Algorithms.
Proceedings of the Performance Evaluation and Benchmarking for the Analytics Era, 2017

Benchmarking Data Flow Systems for Scalable Machine Learning.
Proceedings of the 4th ACM SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond, 2017

Mosaics: Stratosphere, Flink and Beyond.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Enabling Wide Area Data Analytics with Collaborative Distributed Processing Pipelines (CDPPs).
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

I<sup>2</sup>: Interactive Real-Time Visualization for Streaming Data.
Proceedings of the 20th International Conference on Extending Database Technology, 2017

PROTEUS: Scalable Online Machine Learning for Predictive Analytics and Real-Time Interactive Visualization.
Proceedings of the Workshops of the EDBT/ICDT 2017 Joint Conference (EDBT/ICDT 2017), 2017

STREAMLINE - Streamlined Analysis of Data at Rest and Data in Motion.
Proceedings of the Workshops of the EDBT/ICDT 2017 Joint Conference (EDBT/ICDT 2017), 2017

Optimized on-demand data streaming from sensor nodes.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

Gilbert: Declarative Sparse Linear Algebra on Massively Parallel Dataflow Systems.
Proceedings of the Datenbanksysteme für Business, 2017

Efficient Storage and Analysis of Genome Data in Databases.
Proceedings of the Datenbanksysteme für Business, 2017

Emma in Action: Deklarative Datenflüsse für Skalierbare Datenanalyse.
Proceedings of the Datenbanksysteme für Business, 2017

Large-Scale Data Stream Processing Systems.
Proceedings of the Handbook of Big Data Technologies, 2017

2016
VDDA: automatic visualization-driven data aggregation in relational databases.
VLDB J., 2016

Implicit Parallelism through Deep Language Embedding.
SIGMOD Rec., 2016

Apache Flink in current research.
it Inf. Technol., 2016

Towards Streamlined Big Data Analytics.
ERCIM News, 2016

Sharing Hash Codes for Multiple Purposes.
CoRR, 2016

Bridging the gap: towards optimization across linear and relational algebra.
Proceedings of the 3rd ACM SIGMOD Workshop on Algorithms and Systems for MapReduce and Beyond, 2016

Emma in Action: Declarative Dataflows for Scalable Data Analysis.
Proceedings of the 2016 International Conference on Management of Data, 2016

Semantification of Identifiers in Mathematics for Better Math Information Retrieval.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Evaluating Link-based Recommendations for Wikipedia.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

Efficient fault-tolerance for iterative graph processing on distributed dataflow systems.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Cutty: Aggregate Sharing for User-Defined Windows.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Big data on a few pixels.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

JEDI: Joint Entity and Relation Detection using Type Inference.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

2015
Front Matter.
Proc. VLDB Endow., 2015

Gesprengte Ketten - Smart Data, deklarative Datenanalyse, Apache Flink.
Inform. Spektrum, 2015

Apache Flink™: Stream and Batch Processing in a Single Engine.
IEEE Data Eng. Bull., 2015

The Operator Variant Selection Problem on Heterogeneous Hardware.
Proceedings of the International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures, 2015

Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Optimistic Recovery for Iterative Dataflows in Action.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Implicit Parallelism through Deep Language Embedding.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Challenges of Mathematical Information Retrievalin the NTCIR-11 Math Wikipedia Task.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Die Apache Flink Plattform zur parallelen Analyse von Datenströmen und Stapeldaten.
Proceedings of the LWA 2015 Workshops: KDML, 2015

Efficient sample generation for scalable meta learning.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Demonstrating Transfer-Efficient Sample Maintenance on Graphics Cards.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Big Data-Zentren - Vorstellung und Panel.
Proceedings of the Datenbanksysteme für Business, 2015

SMART: An Application Framework for Real Time Big Data Analysis on Heterogeneous Cloud Environments.
Proceedings of the 15th IEEE International Conference on Computer and Information Technology, 2015

2014
Big Data - Eine interdisziplinäre Chance für die Wirtschaftsinformatik.
Wirtschaftsinf., 2014

The Stratosphere platform for big data analytics.
VLDB J., 2014

The Beckman Report on Database Research.
SIGMOD Rec., 2014

Breaking the Chains: On Declarative Data Analysis and Data Independence in the Big Data Era.
Proc. VLDB Endow., 2014

M4: A Visualization-Oriented Time Series Data Aggregation.
Proc. VLDB Endow., 2014

Faster Visual Analytics through Pixel-Perfect Aggregation.
Proc. VLDB Endow., 2014

Ocelot/HyPE: Optimized Data Processing on Heterogeneous Hardware.
Proc. VLDB Endow., 2014

Big Data - An Interdisciplinary Opportunity for Information Systems Research.
Bus. Inf. Syst. Eng., 2014

Composite Key Generation on a Shared-Nothing Architecture.
Proceedings of the Performance Characterization and Benchmarking. Traditional to Big Data, 2014

Demonstrating efficient query processing in heterogeneous environments.
Proceedings of the International Conference on Management of Data, 2014

Asymmetry in Large-Scale Graph Analysis, Explained.
Proceedings of the Second International Workshop on Graph Data Management Experiences and Systems, 2014

Evaluation of Similarity-Measure Factors for Formulae Based on the NTCIR-11 Math Task.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Challenges and Opportunities in Big Data Generation.
Proceedings of the Informatiktage 2014: Big (Data) is beautiful, 2014

Breaking the Chains: On Declarative Analysis and Independence in the Big Data Era.
Proceedings of the 44. Jahrestagung der Gesellschaft für Informatik, Big Data, 2014

Demonstrating Self-Learning Algorithm Adaptivity in a Hardware-Oblivious Database Engine.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

Optimization of Massively Parallel Data Flows.
Proceedings of the Large-Scale Data Analytics, 2014

2013
Hardware-Oblivious Parallelism for In-Memory Column-Stores.
Proc. VLDB Endow., 2013

Large-scale social-media analytics on stratosphere.
Proceedings of the 22nd International World Wide Web Conference, 2013

Iterative parallel data processing with stratosphere: an inside look.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Issues in big data testing and benchmarking.
Proceedings of the Sixth International Workshop on Testing Database Systems, 2013

Distributed matrix factorization with mapreduce using a series of broadcast-joins.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

Querying Large Collections of Mathematical Publications: NTCIR10 Math Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Peeking into the optimization of data flow programs with MapReduce-style UDFs.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

"All roads lead to Rome": optimistic recovery for distributed iterative data processing.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Applying Stratosphere for Big Data Analytics.
Proceedings of the Datenbanksysteme für Business, 2013

2012
Spinning Fast Iterative Data Flows.
Proc. VLDB Endow., 2012

Myriad: Scalable and Expressive Data Generation.
Proc. VLDB Endow., 2012

A First Step Towards GPU-assisted Query Optimization.
Proceedings of the International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures, 2012

Benchmarking in the Cloud: What It Should, Can, and Cannot Be.
Proceedings of the Selected Topics in Performance Evaluation and Benchmarking, 2012

Scalable similarity-based neighborhood methods with MapReduce.
Proceedings of the Sixth ACM Conference on Recommender Systems, 2012

Correlation Support for Risk Evaluation in Databases.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Big Data Analytics on Modern Hardware Architectures: A Technology Survey.
Proceedings of the Business Intelligence - Second European Summer School, 2012

2011
Special issue: best papers of VLDB 2009.
VLDB J., 2011

Das Fachgebiet "Datenbanksysteme und Informationsmanagement" (DIMA) an der Technischen Universität Berlin stellt sich vor.
Datenbank-Spektrum, 2011

Information Management in the Cloud (Dagstuhl Seminar 11321).
Dagstuhl Reports, 2011

Tractor pulling on data warehouses.
Proceedings of the Fourth International Workshop on Testing Database Systems, 2011

Information Management in the Cloud - Parallel Dataflow Programming Beyond Map/Reduce.
Proceedings of the 17th International Conference on Management of Data, 2011

MapReduce and PACT - Comparing Data Parallel Programming Models.
Proceedings of the Datenbanksysteme für Business, 2011

Operators for Analyzing and Modifying Probabilistic Data - A Question of Efficiency.
Proceedings of the Datenbanksysteme für Business, 2011

2010
Special issue: best papers of VLDB 2008.
VLDB J., 2010

Massively Parallel Data Analysis with PACTs on Nephele.
Proc. VLDB Endow., 2010

Cloudy skies for data management.
Proceedings of the 26th International Conference on Data Engineering, 2010

10381 Summary and Abstracts Collection - Robust Query Processing.
Proceedings of the Robust Query Processing, 19.09. - 24.09.2010, 2010

Nephele/PACTs: a programming model and execution framework for web-scale analytical processing.
Proceedings of the 1st ACM Symposium on Cloud Computing, 2010

Handling of Uncertainty and Temporal Indeterminacy for What-if Analysis.
Proceedings of the Enabling Real-Time Business Intelligence - 4th International Workshop, 2010

2009
Query Processing (in Relational Databases).
Proceedings of the Encyclopedia of Database Systems, 2009

Cost-Based Predictive Spatiotemporal Join.
IEEE Trans. Knowl. Data Eng., 2009

Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey.
Stat. Anal. Data Min., 2009

A Bayesian Approach to Estimating the Selectivity of Conjunctive Predicates.
Proceedings of the Datenbanksysteme in Business, 2009

Ad-Hoc Queries over Document Collections - A Case Study.
Proceedings of the Enabling Real-Time Business Intelligence - Third International Workshop, 2009

2008
Databases and Web 2.0 panel at VLDB 2007.
SIGMOD Rec., 2008

Parallelizing query optimization.
Proc. VLDB Endow., 2008

Informationsintegration im Web 2.0.
Datenbank-Spektrum, 2008

Damia: data mashups for intranet applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Data Mashups for Situational Applications.
Proceedings of the Model-Based Software and Data Integration, 2008

08421 Executive Summary - Uncertainty Management in Information Systems.
Proceedings of the Uncertainty Management in Information Systems, 12.10. - 17.10.2008, 2008

08421 Abstracts Collection - Uncertainty Management in Information Systems.
Proceedings of the Uncertainty Management in Information Systems, 12.10. - 17.10.2008, 2008

Situational Business Intelligence.
Proceedings of the Business Intelligence for the Real-Time Enterprise, 2008

2007
Consistent selectivity estimation via maximum entropy.
VLDB J., 2007

An Approach to Optimize Data Processing in Business Processes.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Detecting Attribute Dependencies from Query Feedback.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

DAMIA - A Data Mashup Fabric for Intranet Applications.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Progressive optimization in a shared-nothing parallel database.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Adaptively Reordering Joins during Query Execution.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Collecting and Maintaining Just-in-Time Statistics.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Impliance: A Next Generation Information Management Appliance.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

Integrating Query-Feedback Based Statistics into Informix Dynamic Server.
Proceedings of the Datenbanksysteme in Business, 2007

2006
Autonomic features of the IBM DB2 universal database for linux, UNIX, and windows.
IEEE Trans. Syst. Man Cybern. Syst., 2006

Making DB2Products Self-Managing: Strategies and Experiences.
IEEE Data Eng. Bull., 2006

Challenges and Trends in Information Management.
Datenbank-Spektrum, 2006

Impliance: A Next Generation Information Management Appliance
CoRR, 2006

POP/FED: Progressive Query Optimization for Federated Queries in DB2.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

MAXENT: consistent cardinality estimation in action.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

An inside analysis of a genetic-programming based optimizer.
Proceedings of the Tenth International Database Engineering and Applications Symposium (IDEAS 2006), 2006

ISOMER: Consistent Histogram Construction Using Query Feedback.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Integrating a Maximum-Entropy Cardinality Estimator into DB2 UDB.
Proceedings of the Advances in Database Technology, 2006

Progressive Query Optimization for Federated Queries.
Proceedings of the Advances in Database Technology, 2006

Analyzing the Genetic Operations of an Evolutionary Query Optimizer.
Proceedings of the Flexible and Efficient Information Handling, 2006

2005
A learning optimizer for a federated database management system.
Inform. Forsch. Entwickl., 2005

Consistently Estimating the Selectivity of Conjuncts of Predicates.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Automated statistics collection in action.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

2004
Reminiscences on Influential Papers.
SIGMOD Rec., 2004

Progressive Optimization in Action.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

CORDS: Automatic Generation of Correlation Statistics in DB2.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Automated Statistics Collection in DB2 UDB.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Robust Query Processing through Progressive Optimization.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Automatic Relationship Discovery in Self-Managing Database Systems.
Proceedings of the 1st International Conference on Autonomic Computing (ICAC 2004), 2004

2003
LEO: An autonomic query optimizer for DB2.
IBM Syst. J., 2003

Combining Hierarchy Encoding and Pre-Grouping: Intelligent Grouping in Star Join Processing.
Proceedings of the 19th International Conference on Data Engineering, 2003

Transbase: a Leading-edge ROLAP Engine Supporting Multidimensional Indexing and Hierarchy Clustering.
Proceedings of the BTW 2003, 2003

2002
Processing Star Queries on Hierarchically-Clustered Fact Tables.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Learning table access cardinalities with LEO.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Interval Processing with the UB-Tree.
Proceedings of the International Database Engineering & Applications Symposium, 2002

2001
LEO - DB2's LEarning Optimizer.
Proceedings of the VLDB 2001, 2001

Exploitation of Pre-sortedness for Sorting in Query Processing: The TempTris-Algorithm for UB-Trees.
Proceedings of the International Database Engineering & Applications Symposium, 2001

Interactive ROLAP on Large Datasets: A Case Study with UB-Trees.
Proceedings of the International Database Engineering & Applications Symposium, 2001

The Transbase Hypercube RDBMS: Multidimensional Indexing of Relational Tables.
Proceedings of the ICDE 2001, Demo Session Abstracts (Informal Proceedings), 2001

HINTA: A Linearization Algorithm for Physical Clustering of Complex OLAP Hierarchies.
Proceedings of the 3rd Intl. Workshop on Design and Management of Data Warehouses, 2001

2000
Mistral - Processing Relational Queries using a Multidimensional Access Technique.
Datenbank Rundbr., 2000

Integrating the UB-Tree into a Database System Kernel.
Proceedings of the VLDB 2000, 2000

A Cost Function for Uniformly Partitioned UB-Trees.
Proceedings of the 2000 International Database Engineering and Applications Symposium, 2000

Bulk Loading a Data Warehouse Built Upon a UB-Tree.
Proceedings of the 2000 International Database Engineering and Applications Symposium, 2000

Modellierung und Verwaltung hierarchisch strukturierter Informationen in relationalen Datenbanksystemen.
Proceedings of the 12. GI-Workshop Grundlagen von Datenbanken, 2000

Processing relational OLAP queries with UB-Trees and multidimensional hierarchical clustering.
Proceedings of the Second Intl. Workshop on Design and Management of Data Warehouses, 2000

1999
Improving OLAP Performance by Multidimensional Hierarchical Clustering.
Proceedings of the 1999 International Database Engineering and Applications Symposium, 1999

Processing Operations with Restrictions in RDBMS without External Sorting: The Tetris Algorithm.
Proceedings of the 15th International Conference on Data Engineering, 1999

Physical Data Modeling for Multidimensional Access Methods.
Proceedings of the 11. Workshop Grundlagen von Datenbanken, 1999

MISTRAL: Processing Relational Queries using a Multidimensional Access Technique.
Proceedings of the Ausgezeichnete Informatikdissertationen 1999, 1999

MISTRAL: Processing Relational Queries using a Multidimensional Access Technique.
DISDBIS 59, Infix Verlag, St. Augustin, Germany, ISBN: 3-89601-459-5, 1999

1998
The Tetris-Algorithm for Sorted Reading from UB-Tree.
Proceedings of the Kurzfassungen, 1998


  Loading...