Matteo Interlandi

Orcid: 0000-0002-5756-8321

Affiliations:
  • Microsoft, USA


According to our database1, Matteo Interlandi authored at least 78 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Eighth Workshop on Data Management for End-to-End Machine Learning (DEEM).
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

Data Debugging with Shapley Importance over Machine Learning Pipelines.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Pushing ML Predictions into DBMSs (Extended Abstract).
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
GPU Database Systems Characterization and Optimization.
Proc. VLDB Endow., November, 2023

Pushing ML Predictions Into DBMSs.
IEEE Trans. Knowl. Data Eng., October, 2023

A Deep Dive into Common Open Formats for Analytical DBMSs.
Proc. VLDB Endow., 2023

Optimizing Data Pipelines for Machine Learning in Feature Stores.
Proc. VLDB Endow., 2023

Revisiting Query Performance in GPU Database Systems.
CoRR, 2023


Optimizing Tensor Computations: From Applications to Compilation and Runtime Techniques.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Unshackling Database Benchmarking from Synthetic Workloads.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Query Processing on Gaming Consoles.
Proceedings of the 19th International Workshop on Data Management on New Hardware, 2023

The Tensor Data Platform: Towards an AI-centric Database System.
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023

2022
Data Science Through the Looking Glass: Analysis of Millions of GitHub Notebooks and ML.NET Pipelines.
SIGMOD Rec., 2022

Query Processing on Tensor Computation Runtimes.
Proc. VLDB Endow., 2022

Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem.
Proc. VLDB Endow., 2022

Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem.
CoRR, 2022

Data Debugging with Shapley Importance over End-to-End Machine Learning Pipelines.
CoRR, 2022

Deploying a Steered Query Optimizer in Production at Microsoft.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

End-to-end Optimization of Machine Learning Prediction Queries.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

2021
SEIZE: Runtime Inspection for Parallel Dataflow Systems.
IEEE Trans. Parallel Distributed Syst., 2021

Phoebe: A Learning-based Checkpoint Optimizer.
Proc. VLDB Endow., 2021

WindTunnel: Towards Differentiable ML Pipelines Beyond a Single Modele.
Proc. VLDB Endow., 2021

Tensors: An abstraction for general data processing.
Proc. VLDB Endow., 2021

Machine Learning for Cloud Data Systems: the Promise, the Progress, and the Path Forward.
Proc. VLDB Endow., 2021

Steering Query Optimizers: A Practical Take on Big Data Workloads.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Transforming ML Predictive Pipelines into SQL with MASQ.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Using Descriptions for Explaining Entity Matches (Discussion Paper).
Proceedings of the 29th Italian Symposium on Advanced Database Systems, 2021

2020
Explaining data with descriptions.
Inf. Syst., 2020

A Tensor Compiler for Unified Machine Learning Prediction Serving.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Building Continuous Integration Services for Machine Learning.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

SEIZE User Desired Moments: Runtime Inspection for Parallel Dataflow Systems.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

Extending Relational Query Processing with ML Inference.
Proceedings of the 10th Conference on Innovative Data Systems Research, 2020


2019
Data Science through the looking glass and what we found there.
CoRR, 2019

Making Classical Machine Learning Pipelines Differentiable: A Neural Translation Approach.
CoRR, 2019

Machine Learning at Microsoft with ML .NET.
CoRR, 2019


Coded Elastic Computing.
Proceedings of the IEEE International Symposium on Information Theory, 2019

Acorn: Aggressive Result Caching in Distributed Data Processing Frameworks.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019

Understanding Data in the Blink of an Eye.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Adding data provenance support to Apache Spark.
VLDB J., 2018

A datalog-based computational model for coordination-free, data-parallel systems.
Theory Pract. Log. Program., 2018

Scaling-up reasoning and advanced analytics on BigData.
Theory Pract. Log. Program., 2018

From the Edge to the Cloud: Model Serving in ML.NET.
IEEE Data Eng. Bull., 2018

Supporting Data Provenance in Data-Intensive Scalable Computing Systems.
IEEE Data Eng. Bull., 2018

PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Declarative Algorithms in Datalog with Extrema: Their Formal Semantics Simplified.
Proceedings of the Technical Communications of the 34th International Conference on Logic Programming, 2018

RIOS: Runtime Integrated Optimizer for Spark.
Proceedings of the ACM Symposium on Cloud Computing, 2018

Declarative BigData Algorithms via Aggregates and Relational Database Dependencies.
Proceedings of the 12th Alberto Mendelzon International Workshop on Foundations of Data Management, 2018

2017
Fixpoint semantics and optimization of recursive Datalog programs with aggregates.
Theory Pract. Log. Program., 2017

Apache REEF: Retainable Evaluator Execution Framework.
ACM Trans. Comput. Syst., 2017

Debugging Big Data Analytics in Spark with <i>BigDebug</i>.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Cleaning MapReduce Workflows.
Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

Towards Accelerating Generic Machine Learning Prediction Pipelines.
Proceedings of the 2017 IEEE International Conference on Computer Design, 2017

Automated debugging in data-intensive scalable computing.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

2016
Combining user and database perspective for solving keyword queries over relational databases.
Inf. Syst., 2016

BigDebug: interactive debugger for big data analytics in Apache Spark.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

Big Data Analytics with Datalog Queries on Spark.
Proceedings of the 2016 International Conference on Management of Data, 2016

BigDebug: debugging primitives for interactive big data processing in spark.
Proceedings of the 38th International Conference on Software Engineering, 2016

Interactive Debugging for Big Data Analytics.
Proceedings of the 8th USENIX Workshop on Hot Topics in Cloud Computing, 2016

Optimizing Interactive Development of Data-Intensive Applications.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

Programming and Runtime Support to Blaze FPGA Accelerator Deployment at Datacenter Scale.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

The Magic of Pushing Extrema into Recursion: Simple, Powerful Datalog Programs.
Proceedings of the 10th Alberto Mendelzon International Workshop on Foundations of Data Management, 2016

2015
Titian: Data Provenance Support in Spark.
Proc. VLDB Endow., 2015

Proof positive and negative in data cleaning.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

On the CALM Principle for BSP Computation.
Proceedings of the 9th Alberto Mendelzon International Workshop on Foundations of Data Management, Lima, Peru, May 6, 2015

2014
On the CALM Principle for Bulk Synchronous Parallel Computation.
CoRR, 2014

Towards Declarative Imperative Data-parallel Systems.
Proceedings of the 22nd Italian Symposium on Advanced Database Systems, 2014

2013
QUEST: A Keyword Search System for Relational Data based on Semantic and Machine Learning Techniques.
Proc. VLDB Endow., 2013

Using a HMM based approach for mapping keyword queries into database terms.
Proceedings of the 21st Italian Symposium on Advanced Database Systems, 2013

Datalog in Time and Space, Synchronously.
Proceedings of the 7th Alberto Mendelzon International Workshop on Foundations of Data Management, 2013

2012
Enhancing Datalog with Epistemic Operators to Reason About Knowledge in Distributed Systems.
Proceedings of the Twentieth Italian Symposium on Advanced Database Systems, 2012

A Meta-language for MDX Queries in eLog Business Solution.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Knowlog: A Declarative Language for Reasoning about Knowledge in Distributed Systems.
Proceedings of the Conceptual Modeling, 2012

Reasoning about Knowledge in Distributed Systems Using Datalog.
Proceedings of the Datalog in Academia and Industry - Second International Workshop, 2012

2011
MediaPresenter, a Web Platform for Multimedia Content Management.
Proceedings of the Sistemi Evoluti per Basi di Dati, 2011

2010
Automatic Fault Behavior Detection and Modeling by a State-Based Specification Method.
Proceedings of the 12th IEEE High Assurance Systems Engineering Symposium, 2010


  Loading...