Ziawasch Abedjan

Orcid: 0000-0002-2846-1373

Affiliations:
  • Leibniz Universität Hannover, Germany
  • TU Berlin, Germany (former)


According to our database1, Ziawasch Abedjan authored at least 77 papers between 2010 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
How Green is AutoML for Tabular Data?
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

MaTElDa: Multi-Table Error Detection.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

2024
AutoML in heavily constrained applications.
VLDB J., July, 2024

Accelerating the Data Cleaning Systems Raha and Baran through Task and Data Parallelism.
Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, 2024

Guiding Catalogue Enrichment with User Queries.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2024

Data Augmentation for Supervised Code Translation Learning.
Proceedings of the 21st IEEE/ACM International Conference on Mining Software Repositories, 2024

Towards Data Augmentation for Supervised Code Translation.
Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings, 2024

Learning Action Embeddings for Off-Policy Evaluation.
Proceedings of the Advances in Information Retrieval, 2024

2023
Blend: A Unified Data Discovery System.
CoRR, 2023

Demonstrating MATE and COCOA for Data Discovery.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Duetcs: Code Style Transfer through Generation and Retrieval.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023


Duplicate Table Discovery with Xash.
Proceedings of the Datenbanksysteme für Business, 2023

2022
MATE: Multi-Attribute Table Extraction.
Proc. VLDB Endow., 2022

Enabling data-centric AI through data quality management and data literacy.
it Inf. Technol., 2022

Data Cleaning and AutoML: Would an Optimizer Choose to Clean?
Datenbank-Spektrum, 2022

Search Filter Ranking with Language-Aware Label Embeddings.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Measuring Fairness of Rankings under Noisy Sensitive Information.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

2021
Automated Feature Engineering for Algorithmic Fairness.
Proc. VLDB Endow., 2021

Big Data - Ergebnisse und Herausforderungen im Jahr 2020.
LOG IN, 2021

From Cleaning before ML to Cleaning for ML.
IEEE Data Eng. Bull., 2021

Enforcing Constraints for Machine Learning Systems via Declarative Feature Selection: An Experimental Study.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Interactive Cross-language Code Retrieval with Auto-Encoders.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

RPT: Effective and Efficient Retrieval of Program Translations from Big Code.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2021

COCOA: COrrelation COefficient-Aware Data Augmentation.
Proceedings of the 24th International Conference on Extending Database Technology, 2021

Semi-Supervised Data Cleaning with Raha and Baran.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

Combining Programming-by-Example with Transformation Discovery from large Databases.
Proceedings of the Datenbanksysteme für Business, 2021

2020
Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning.
Proc. VLDB Endow., 2020

Anatomy of Metadata for Data Curation.
ACM J. Data Inf. Qual., 2020

Algorithms for Big Data.
it Inf. Technol., 2020

Data Science für alle: Grundlagen der Datenprogrammierung.
Inform. Spektrum, 2020

Optimizing Machine Learning Workloads in Collaborative Environments.
Proceedings of the 2020 International Conference on Management of Data, 2020

CAFE: Constraint-Aware Feature Extraction from Large Databases.
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

2019
Data Profiling.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

Particulate Matter Matters - The Data Science Challenge @ BTW 2019.
Datenbank-Spektrum, 2019

ED2: Two-stage Active Learning for Error Detection - Technical Report.
CoRR, 2019

REDS: Estimating the Performance of Error Detection Strategies Based on Dirtiness Profiles.
Proceedings of the 31st International Conference on Scientific and Statistical Database Management, 2019

Raha: A Configuration-Free Error Detection System.
Proceedings of the 2019 International Conference on Management of Data, 2019

Towards Automated Data Cleaning Workflows.
Proceedings of the Conference on "Lernen, Wissen, Daten, Analysen", Berlin, Germany, September 30, 2019

Unsupervised String Transformation Learning for Entity Consolidation.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

CLRL: Feature Engineering for Cross-Language Record Linkage.
Proceedings of the Advances in Database Technology, 2019

ED2: A Case for Active Learning in Error Detection.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Explanation of Air Pollution Using External Data Sources.
Proceedings of the Datenbanksysteme für Business, 2019


2018
Data Profiling
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, ISBN: 978-3-031-01865-7, 2018

Data Management Systems Research at TU Berlin.
SIGMOD Rec., 2018

Metadata-driven error detection.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

Building Data Civilizer Pipelines with an Advanced Workflow Engine.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Aurum: A Data Discovery System.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

2017
Entity Consolidation: The Golden Record Problem.
CoRR, 2017

A Demo of the Data Civilizer System.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Data Profiling: A Tutorial.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

An Introduction to Data Profiling.
Proceedings of the Business Intelligence and Big Data - 7th European Summer School, 2017

The Data Civilizer System.
Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

2016
Detecting Data Errors: Where are we and what needs to be done?
Proc. VLDB Endow., 2016

Towards large-scale data discovery: position paper.
Proceedings of the Third International Workshop on Exploratory Search in Databases and the Web, 2016

DataXFormer: A robust transformation discovery system.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Data profiling.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

2015
Profiling relational data: a survey.
VLDB J., 2015

Temporal Rules Discovery for Web Data Cleaning.
Proc. VLDB Endow., 2015

DataXFormer: An Interactive Data Transformation Tool.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Dataxformer: Leveraging the Web for Semantic Transformations.
Proceedings of the Seventh Biennial Conference on Innovative Data Systems Research, 2015

2014
Improving RDF data with data mining.
PhD thesis, 2014

Assigning global relevance scores to DBpedia facts.
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

Detecting unique column combinations on dynamic data.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Profiling and mining RDF data with ProLOD++.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Amending RDF Entities with New Facts.
Proceedings of the 3rd Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data co-located with 11th Extended Semantic Web Conference (ESWC 2014), 2014

Improving RDF Data with Data Mining.
Proceedings of the 8th Joint Workshop of the German Research Training Groups in Computer Science, 2014

DFD: Efficient Functional Dependency Discovery.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Scalable Discovery of Unique Column Combinations.
Proc. VLDB Endow., 2013

Improving RDF Data Through Association Rule Mining.
Datenbank-Spektrum, 2013

Synonym Analysis for Predicate Expansion.
Proceedings of the Semantic Web: Semantics and Big Data, 10th International Conference, 2013

2012
Discovering conditional inclusion dependencies.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Reconciling ontologies and the web of data.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
SPRINT: ranking search results by paths.
Proceedings of the EDBT 2011, 2011

Advancing the discovery of unique column combinations.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Profiling linked open data with ProLOD.
Proceedings of the Workshops Proceedings of the 26th International Conference on Data Engineering, 2010


  Loading...