OmniMatch: Effective Self-Supervised Any-Join Discovery in Tabular Data Repositories.
CoRR, 2024
Data Lakes: A Survey of Functions and Systems (Extended abstract).
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024
Data Lakes: A Survey of Functions and Systems.
IEEE Trans. Knowl. Data Eng., December, 2023
Amalur: Data Integration Meets Machine Learning.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023
SiMa: Effective and Efficient Data Silo Federation Using Graph Neural Networks.
CoRR, 2022
Bridging the Gap between Data Integration and ML Systems.
CoRR, 2022
Amalur: Next-generation Data Integration in Data Lakes.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022
Valentine in Action: Matching Tabular Data at Scale.
Proc. VLDB Endow., 2021
A study of the sensitivity of biomechanical models of the spine for scoliosis brace design.
Comput. Methods Programs Biomed., 2021
Valentine: Evaluating Matching Techniques for Dataset Discovery.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021
REMA: Graph Embeddings-based Relational Schema Matching.
Proceedings of the Workshops of the EDBT/ICDT 2020 Joint Conference, 2020
Data as a Language: A Novel Approach to Data Integration.
Proceedings of the VLDB 2019 PhD Workshop, 2019
ChronicOnline: Implementing a mHealth solution for monitoring and early alerting in chronic obstructive pulmonary disease.
Health Informatics J., 2017
Probabilistic k-Nearest Neighbor Monitoring of Moving Gaussians.
Proceedings of the 29th International Conference on Scientific and Statistical Database Management, 2017