Maik Thiele

Orcid: 0000-0002-1665-977X

According to our database1, Maik Thiele authored at least 87 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Active Learning with Aggregated Uncertainties from Image Augmentations.
Proceedings of the Engineering Applications of Neural Networks, 2024

HTW-DIL at Touché: Multimodal Dense Information Retrieval for Arguments.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

2023
An Empirical Study on the Robustness of Active Learning for Biomedical Image Classification Under Model Transfer Scenarios.
Proceedings of the New Trends in Database and Information Systems, 2023

Comparing and Improving Active Learning Uncertainty Measures for Transformer Models.
Proceedings of the Advances in Databases and Information Systems, 2023

2022
To Softmax, or not to Softmax: that is the question when applying Active Learning for Transformer Models.
CoRR, 2022

ALWars: Combat-Based Evaluation of Active Learning Strategies.
Proceedings of the Advances in Information Retrieval, 2022

ImitAL: Learned Active Learning Strategy on Synthetic Data.
Proceedings of the Discovery Science - 25th International Conference, 2022

Transformer-Encoder and Decoder Models for Questions on Math.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Transformer-Encoder-Based Mathematical Information Retrieval.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

2021
Season- and Trend-aware Symbolic Approximation for Accurate and Efficient Time Series Matching.
Datenbank-Spektrum, 2021

ImitAL: Learning Active Learning Strategies from Synthetic Data.
CoRR, 2021

Accurate and Efficient Time Series Matching by Season- and Trend-aware Symbolic Approximation - Extended Version Including Additional Evaluation and Proofs.
CoRR, 2021

Pre-Trained Web Table Embeddings for Table Discovery.
Proceedings of the aiDM '21: Fourth Workshop in Exploiting AI Techniques for Data Management, 2021

An ALBERT-based Similarity Measure for Mathematical Answer Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

RecipeGM: A Hierarchical Recipe Generation Model.
Proceedings of the 37th IEEE International Conference on Data Engineering Workshops, 2021

TU_DBS in the ARQMath Lab 2021, CLEF.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

2020
A cost-based storage format selector for materialized results in big data frameworks.
Distributed Parallel Databases, 2020

Configuring Parallelism for Hybrid Layouts Using Multi-Objective Optimization.
Big Data, 2020

FacetE: exploiting web tables for domain-specific word embedding evaluation.
Proceedings of the 8th International Workshop on Testing Database Systems, 2020

Active Learning for Spreadsheet Cell Classification.
Proceedings of the Workshops of the EDBT/ICDT 2020 Joint Conference, 2020

RetroLive: Analysis of Relational Retrofitted Word Embeddings.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Retro: Relation Retrofitting For In-Database Machine Learning on Textual Data.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

WeakAL: Combining Active Learning and Weak Supervision.
Proceedings of the Discovery Science - 23rd International Conference, 2020

Learning from Textual Data in Database Systems.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Parallelizing user-defined functions in the ETL workflow using orchestration style sheets.
Int. J. Appl. Math. Comput. Sci., 2019

Cardinality estimation with local deep learning models.
Proceedings of the Second International Workshop on Exploiting Artificial Intelligence Techniques for Data Management, 2019

DECO: A Dataset of Annotated Spreadsheets for Layout and Table Recognition.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

A Genetic-Based Search for Adaptive Table Recognition in Spreadsheets.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

XLIndy: Interactive Recognition and Information Extraction in Spreadsheets.
Proceedings of the ACM Symposium on Document Engineering 2019, 2019

Explore FREDDY: Fast Word Embeddings in Database Systems.
Proceedings of the Datenbanksysteme für Business, 2019

Fast Approximated Nearest Neighbor Joins For Relational Database Systems.
Proceedings of the Datenbanksysteme für Business, 2019

Automatically Configuring Parallelism for Hybrid Layouts.
Proceedings of the New Trends in Databases and Information Systems, 2019

2018
Intermediate Results Materialization Selection and Format for Data-Intensive Flows.
Fundam. Informaticae, 2018

A Cost-based Storage Format Selector for Materialization in Big Data Frameworks.
CoRR, 2018

Modeling Customers and Products with Word Embeddings from Receipt Data.
Proceedings of the 22nd International Database Engineering & Applications Symposium, 2018

Table Recognition in Spreadsheets via a Graph Representation.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

ATUN-HL: Auto Tuning of Hybrid Layouts Using Workload and Data Characteristics.
Proceedings of the Advances in Databases and Information Systems, 2018

2017
Frequent patterns in ETL workflows: An empirical approach.
Data Knowl. Eng., 2017

Table Identification and Reconstruction in Spreadsheets.
Proceedings of the Advanced Information Systems Engineering, 2017

Context Similarity for Retrieval-Based Imputation.
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

Exploratory Ad-Hoc Analytics for Big Data.
Proceedings of the Handbook of Big Data Technologies, 2017

2016
Answering "Why Empty?" and "Why So Many?" queries in graph databases.
J. Comput. Syst. Sci., 2016

Quality measures for ETL processes: from goals to implementation.
Concurr. Comput. Pract. Exp., 2016

ResilientStore: A Heuristic-Based Data Format Selector for Intermediate Results.
Proceedings of the Model and Data Engineering - 6th International Conference, 2016

DebEAQ - debugging empty-answer queries on large data graphs.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

A Machine Learning Approach for Layout Inference in Spreadsheets.
Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - Volume 1: KDIR, Porto - Portugal, November 9, 2016

Cell Classification for Layout Recognition in Spreadsheets.
Proceedings of the Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2016

Putting Web Tables into Context.
Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - Volume 1: KDIR, Porto - Portugal, November 9, 2016

2015
Considering User Intention in Differential Graph Queries.
J. Database Manag., 2015

Relaxation of subgraph queries delivering empty results.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015

DrillBeyond: processing multi-result open world SQL queries.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015

Top-k entity augmentation using consistent set covering.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015

Column-specific context extraction for web tables.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

From Web Tables to Concepts: A Semantic Normalization Approach.
Proceedings of the Conceptual Modeling - 34th International Conference, 2015

POIESIS: a Tool for Quality-aware ETL Process Redesign.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Building the Dresden Web Table Corpus: A Classification Approach.
Proceedings of the 2nd IEEE/ACM International Symposium on Big Data Computing, 2015

Towards a Hybrid Imputation Approach Using Web Tables.
Proceedings of the 2nd IEEE/ACM International Symposium on Big Data Computing, 2015

SCIT: A Schema Change Interpretation Tool for Dynamic-Schema Data Warehouses.
Proceedings of the Databases Theory and Applications, 2015

2014
GraphMCS: Discover the Unknown in Large Data Graphs.
Proceedings of the Workshops of the EDBT/ICDT 2014 Joint Conference (EDBT/ICDT 2014), 2014

A Framework for User-Centered Declarative ETL.
Proceedings of the 17th International Workshop on Data Warehousing and OLAP, 2014

Top-k Differential Queries in Graph Databases.
Proceedings of the Advances in Databases and Information Systems, 2014

2013
On-Demand ELT Architecture for Right-Time BI: Extending the Vision.
Int. J. Data Warehous. Min., 2013

Publish-time data integration for open data platforms.
Proceedings of the 2nd International Workshop on Open Data, 2013

Leveraging flexible data management with graph databases.
Proceedings of the First International Workshop on Graph Data Management Experiences and Systems, 2013

DeExcelerator: a framework for extracting relational data from partially structured documents.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

DrillBeyond: Open-World SQL Queries Using Web Tables.
Proceedings of the Datenbanksysteme für Business, 2013

Enhancing Named Entity Extraction by Effectively Incorporating the Crowd.
Proceedings of the Datenbanksysteme für Business, Technologie und Web (BTW),, 2013

2012
DrillBeyond: Enabling Business Analysts to Explore the Web of Open Data.
Proc. VLDB Endow., 2012

Frontiers in Crowdsourced Data Integration.
it Inf. Technol., 2012

OPEN - Enabling Non-expert Users to Extract, Integrate, and Analyze Open Data.
Datenbank-Spektrum, 2012

Identifying and weighting integration hypotheses on open data platforms.
Proceedings of the International Workshop on Open Data, 2012

An On-Demand ELT Architecture for Real-Time BI.
Proceedings of the Enabling Real-Time Business Intelligence - 6th International Workshop, 2012

2011
Echtzeit-Data-Warehouse-Systeme.
Datenbank-Spektrum, 2011

Data-Warehousing 3.0 - Die Rolle von Data-Warehouse-Systemen auf Basis von In-Memory Technologie.
Proceedings of the Innovative Unternehmensanwendungen mit In-Memory Data Management, 2011

A Domain-Specific Language for Do-It-Yourself Analytical Mashups.
Proceedings of the Current Trends in Web Engineering, 2011

A Flexible Graph-Based Data Model Supporting Incremental Schema Design and Evolution.
Proceedings of the Current Trends in Web Engineering, 2011

Next Generation Database Programming and Execution Environment.
Proceedings of the Database Programming Languages, 2011

2010
Qualitätsgetriebene Datenproduktionssteuerung in Echtzeit-Data-Warehouse-Systemen.
PhD thesis, 2010

One Clustering Process Fits All - A Visually Guided Ensemble Approach.
Proceedings of the LWA 2010, 2010

2009
Partition-based workload scheduling in living data warehouse environments.
Inf. Syst., 2009

Multi-objective scheduling for real-time data warehouses.
Comput. Sci. Res. Dev., 2009

Cardinality estimation in ETL processes.
Proceedings of the DOLAP 2009, 2009

Evaluation of Load Scheduling Strategies for Real-Time Data Warehouse Environments.
Proceedings of the Enabling Real-Time Business Intelligence - Third International Workshop, 2009

2007
Supporting Gene Expression Analysis Processes by a Service-Oriented Platform.
Proceedings of the 2007 IEEE International Conference on Services Computing (SCC 2007), 2007

Orchestrierung datenintensiver Prozesse - Einsatz von BPEL in der Genexpressionsanalyse.
VDM, ISBN: 978-3-8364-0239-2, 2007

2006
Optimistic Coarse-Grained Cache Semantics for Data Marts.
Proceedings of the 18th International Conference on Scientific and Statistical Database Management, 2006

Shrinked Data Marts Enabled for Negative Caching.
Proceedings of the Tenth International Database Engineering and Applications Symposium (IDEAS 2006), 2006


  Loading...