Stefan Grafberger

Orcid: 0000-0002-9884-9517

According to our database1, Stefan Grafberger authored at least 21 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Automated Provenance-Based Screening of ML Data Preparation Pipelines.
Datenbank-Spektrum, November, 2024

Snapcase - Regain Control over Your Predictions with Low-Latency Machine Unlearning.
Proc. VLDB Endow., August, 2024

Red Onions, Soft Cheese and Data: From Food Safety to Data Traceability for Responsible AI.
IEEE Data Eng. Bull., 2024

Messy Code Makes Managing ML Pipelines Difficult? Just Let LLMs Rewrite the Code!
CoRR, 2024

Instrumentation and Analysis of Native ML Pipelines via Logical Query Plans.
CoRR, 2024

Towards Interactively Improving ML Data Preparation Code via "Shadow Pipelines".
Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning, 2024

2023
shubhaguha/mlwhatif-demo: Demo for VLDB 2023.
Dataset, July, 2023

MLWHATIF: What If You Could Stop Re-Implementing Your Machine Learning Pipeline Analyses Over and Over?
Proc. VLDB Endow., 2023

Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines.
Proc. ACM Manag. Data, 2023

Improving Retrieval-Augmented Large Language Models via Data Importance Learning.
CoRR, 2023

How to Compliment a Human - Designing Affective and Well-being Promoting Conversational Things.
CoRR, 2023

Provenance Tracking for End-to-End Machine Learning Pipelines.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Proactively Screening Machine Learning Pipelines with ARGUSEYES.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

2022

Data distribution debugging in machine learning pipelines.
VLDB J., 2022

Towards data-centric what-if analysis for native machine learning pipelines.
Proceedings of the DEEM '22: Proceedings of the Sixth Workshop on Data Management for End-To-End Machine Learning Philadelphia, 2022

Screening Native Machine Learning Pipelines with ArgusEyes.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

2021
HedgeCut: Maintaining Randomised Trees for Low-Latency Machine Unlearning.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines.
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

2019
Differential Data Quality Verification on Partitioned Data.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019


  Loading...