2025
Fuzzy Integration of Data Lake Tables.
CoRR, January, 2025
SLACE: A Monotone and Balance-Sensitive Loss Function for Ordinal Regression.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Diversity, Equity and Inclusion Activities in Database Conferences: A 2023 Report.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
SIGMOD Rec., June, 2024
ALT-GEN: Benchmarking Table Union Search using Large Language Models.
Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, 2024
Finding Support for Tabular LLM Outputs.
Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, 2024
Eighth Workshop on Human-In-the-Loop Data Analytics (HILDA).
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024
ExtractGPT: Exploring the Potential of Large Language Models for Product Attribute Value Extraction.
Proceedings of the Information Integration and Web Intelligence, 2024
Gen-T: Table Reclamation in Data Lakes.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024
SC-Block: Supervised Contrastive Blocking Within Entity Resolution Pipelines.
Proceedings of the Semantic Web - 21st International Conference, 2024
A Generative Benchmark Creation Framework for Detecting Common Data Table Versions.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024
2023
The Battleship Approach to the Low Resource Entity Matching Problem.
Proc. ACM Manag. Data, December, 2023
Conceptually-grounded mapping patterns for Virtual Knowledge Graphs.
Data Knowl. Eng., May, 2023
One Algorithm to Rule Them All: On the Changing Roles of Humans in Data Integration.
Computer, April, 2023
Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V.
Proc. VLDB Endow., 2023
SANTOS: Relationship-based Semantic Table Union Search.
Proc. ACM Manag. Data, 2023
FlexER: Flexible Entity Resolution for Multiple Intents.
Proc. ACM Manag. Data, 2023
Product Attribute Value Extraction using Large Language Models.
CoRR, 2023
Generative Benchmark Creation for Table Union Search.
CoRR, 2023
LOUC: Leave-One-Out-Calibration Measure for Analyzing Human Matcher Performance.
CoRR, 2023
Product Information Extraction using ChatGPT.
CoRR, 2023
Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V (Technical Report).
CoRR, 2023
DIALITE: Discover, Align and Integrate Open Data Tables.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023
2022
Integrating Data Lake Tables.
Proc. VLDB Endow., 2022
PoWareMatch: A Quality-aware Deep Learning Approach to Improve Human Schema Matching.
ACM J. Data Inf. Qual., 2022
Process discovery with context-aware process trees.
Inf. Syst., 2022
Human's Role in-the-Loop.
CoRR, 2022
From Limited Annotated Raw Material Data to Quality Production Data: A Case Study in the Milk Industry (Technical Report).
CoRR, 2022
HumanAL: calibrating human matching beyond a single task.
Proceedings of the HILDA@SIGMOD 2022: Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2022
2021
Learning to Rerank Schema Matches.
IEEE Trans. Knowl. Data Eng., 2021
OSOUM Framework for Trading Data Research.
CoRR, 2021
Learning to Characterize Matching Experts.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021
ADaMaP: Automatic Alignment of Relational Data Sources using Mapping Patterns (Abstract).
Proceedings of the 34th International Workshop on Description Logics (DL 2021) part of Bratislava Knowledge September (BAKS 2021), 2021
From Limited Annotated Raw Material Data to Quality Production Data: A Case Study in the Milk Industry.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021
ADaMaP: Automatic Alignment of Relational Data Sources Using Mapping Patterns.
Proceedings of the Advanced Information Systems Engineering, 2021
2020
ADnEV: Cross-Domain Schema Matching using Deep Similarity Matrix Adjustment and Evaluation.
Proc. VLDB Endow., 2020
Mapping Patterns for Virtual Knowledge Graphs.
CoRR, 2020
Projection-based Relevance Model for Table Retrieval.
Proceedings of the Companion of The 2020 Web Conference 2020, 2020
Ad Hoc Table Retrieval using Intrinsic and Extrinsic Similarities.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020
(Artificial) Mind over Matter: Humans In and Humans Out in Matching.
Proceedings of the VLDB 2020 PhD Workshop co-located with the 46th International Conference on Very Large Databases (VLDB 2020), ONLINE, August 31, 2020
InCognitoMatch: Cognitive-aware Matching via Crowdsourcing.
Proceedings of the 2020 International Conference on Management of Data, 2020
Web Table Retrieval using Multimodal Deep Learning.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020
Query Performance Prediction for Multifield Document Retrieval.
Proceedings of the ICTIR '20: The 2020 ACM SIGIR International Conference on the Theory of Information Retrieval, 2020
Queueing Inference for Process Performance Analysis with Missing Life-Cycle Data.
Proceedings of the 2nd International Conference on Process Mining, 2020
Mapping Patterns for Virtual Knowledge Graphs (A Report on Ongoing Research).
Proceedings of the 33rd International Workshop on Description Logics (DL 2020) co-located with the 17th International Conference on Principles of Knowledge Representation and Reasoning (KR 2020), 2020
2019
A Cognitive Model of Human Bias in Matching.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019
Inductive Context-aware Process Discovery.
Proceedings of the International Conference on Process Mining, 2019
The Changing Roles of Humans and Algorithms in (Process) Matching.
Proceedings of the Business Process Management Workshops, 2019
2018
What Type of a Matcher Are You?: Coordination of Human and Algorithmic Matchers.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2018
Community Detection in Financial Entities: An Extended Abstract.
Proceedings of the Fourth International Workshop on Data Science for Macro-Modeling with Financial and Economic Datasets, 2018
(Artificial) Mind over Matter: Integrating Humans and Algorithms in Solving Matching Problems.
Proceedings of the 2018 International Conference on Management of Data, 2018
Heterogeneous Data Integration by Learning to Rerank Schema Matches.
Proceedings of the IEEE International Conference on Data Mining, 2018
2017
Instance-Based Process Matching Using Event-Log Information.
Proceedings of the Advanced Information Systems Engineering, 2017