Giovanni Simonini

Orcid: 0000-0002-3466-509X

According to our database1, Giovanni Simonini authored at least 53 papers between 2013 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Evaluation of Dataframe Libraries for Data Preparation on a Single Machine.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

SPO-Join: Efficient Stream Inequality Join.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

2024
Determining the Largest Overlap between Tables.
Proc. ACM Manag. Data, February, 2024

GSM: A generalized approach to Supervised Meta-blocking for scalable entity resolution.
Inf. Syst., February, 2024

Stream-aware indexing for distributed inequality join processing.
Inf. Syst., 2024

Overlap-Based Duplicate Table Detection.
Proceedings of the 32nd Symposium of Advanced Database Systems, 2024


Efficient Stream Join Processing: Novel Approaches and Challenges.
Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024

2023
Experiences and Lessons Learned from the SIGMOD Entity Resolution Programming Contests.
SIGMOD Rec., June, 2023

A big data platform exploiting auditable tokenization to promote good practices inside local energy communities.
Future Gener. Comput. Syst., April, 2023

BrewER: Entity Resolution On-Demand.
Proc. VLDB Endow., 2023

DXP: Billing Data Preparation for Big Data Analytics.
CoRR, 2023

Entity Resolution On-Demand for Querying Dirty Datasets.
Proceedings of the 31st Symposium of Advanced Database Systems, 2023

A general approach to Supervised Meta-Blocking.
Proceedings of the 31st Symposium of Advanced Database Systems, 2023

Bridging the Gap between Buyers and Sellers in Data Marketplaces with Personalized Datasets.
Proceedings of the 31st Symposium of Advanced Database Systems, 2023

HKS: Efficient Data Partitioning for Stateful Streaming.
Proceedings of the Big Data Analytics and Knowledge Discovery, 2023

2022
Entity Resolution On-Demand.
Proc. VLDB Endow., 2022

Generalized Supervised Meta-blocking.
Proc. VLDB Endow., 2022

Generalized Supervised Meta-blocking (technical report).
CoRR, 2022

Progressive Entity Resolution with Node Embeddings.
Proceedings of the 30th Italian Symposium on Advanced Database Systems, 2022

ECDP: A Big Data Platform for the Smart Monitoring of Local Energy Communities.
Proceedings of the Workshops of the EDBT/ICDT 2022 Joint Conference, 2022

2021
Reproducible experiments on Three-Dimensional Entity Resolution with JedAI.
Inf. Syst., 2021

The Case for Multi-task Active Learning Entity Resolution (Discussion Paper).
Proceedings of the 29th Italian Symposium on Advanced Database Systems, 2021

2020
<i>BLAST2</i>: An Efficient Technique for Loose Schema Information Extraction from Heterogeneous Big Data Sources.
ACM J. Data Inf. Qual., 2020

Three-dimensional Entity Resolution with JedAI.
Inf. Syst., 2020

Entity Resolution on Camera Records Without Machine Learning.
Proceedings of the 2nd International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs co-located with 46th International Conference on Very Large Data Bases, 2020

Scaling Up Record-level Matching Rules.
Proceedings of the 28th Italian Symposium on Advanced Database Systems, 2020

RulER: Scaling Up Record-level Matching Rules.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

JedAI<sup>3</sup> : beyond batch, blocking-based Entity Resolution.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Dagger: A Data (not code) Debugger.
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

2019
Schema-Agnostic Progressive Entity Resolution.
IEEE Trans. Knowl. Data Eng., 2019

Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics.
Proc. VLDB Endow., 2019

Scaling entity resolution: A loosely schema-aware approach.
Inf. Syst., 2019

Computing inter-document similarity with Context Semantic Analysis.
Inf. Syst., 2019

Schema-agnostic Progressive Entity Resolution (extended version).
CoRR, 2019

Entity Resolution and Data Fusion: An Integrated Approach.
Proceedings of the 27th Italian Symposium on Advanced Database Systems, 2019

SparkER: Scaling Entity Resolution in Spark.
Proceedings of the Advances in Database Technology, 2019

2018
Towards Progressive Search-driven Entity Resolution.
Proceedings of the 26th Italian Symposium on Advanced Database Systems, 2018

MOMIS Dashboard: A Powerful Data Analytics Tool for Industry 4.0.
Proceedings of the Transdisciplinary Engineering Methods for Social Innovation of Industry 4.0, 2018

BigDedup: A Big Data Integration Toolkit for Duplicate Detection in Industrial Scenarios.
Proceedings of the Transdisciplinary Engineering Methods for Social Innovation of Industry 4.0, 2018

How Improve Set Similarity Join Based on Prefix Approach in Distributed Environment.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

Enhancing Loosely Schema-aware Entity Resolution with User Interaction.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

From Data Integration to Big Data Integration.
Proceedings of the A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years., 2018

2017
SOPJ: A Scalable Online Provenance Join for Data Integration.
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

2016
BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution.
Proc. VLDB Endow., 2016

Providing Insight into Data Source Topics.
J. Data Semant., 2016

Enhancing Entity Resolution Efficiency with Loosely Schema-aware Techniques.
Proceedings of the 24th Italian Symposium on Advanced Database Systems, 2016

2015
Supporting Image Search with Tag Clouds: A Preliminary Approach.
Adv. Multim., 2015

Big data exploration with faceted browsing.
Proceedings of the 2015 International Conference on High Performance Computing & Simulation, 2015

2014
Discovering the Topics of a Data Source: a Statistical Approach.
Proceedings of the Workshop on Surfacing the Deep and the Social Web co-located with the 13th International Semantic Web Conference (ISWC 2014), 2014

Towards Declarative Imperative Data-parallel Systems.
Proceedings of the 22nd Italian Symposium on Advanced Database Systems, 2014

Using big data to support automatic Word Sense Disambiguation.
Proceedings of the International Conference on High Performance Computing & Simulation, 2014

2013
Keyword Search over Relational Databases: Issues, Approaches and Open Challenges.
Proceedings of the Bridging Between Information Retrieval and Databases, 2013


  Loading...