Theodoros Rekatsinas

Orcid: 0000-0001-6148-1854

According to our database1, Theodoros Rekatsinas authored at least 58 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
TSDS: Data Selection for Task-Specific Model Finetuning.
CoRR, 2024

Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Simple Adaptive Query Processing vs. Learned Query Optimizers: Observations and Analysis.
Proc. VLDB Endow., 2023

High-Throughput Vector Similarity Search in Knowledge Graphs.
Proc. ACM Manag. Data, 2023

Fact Ranking over Large-Scale Knowledge Graphs with Reasoning Embedding Models.
IEEE Data Eng. Bull., 2023

Open Domain Knowledge Extraction for Knowledge Graphs.
CoRR, 2023

Growing and Serving Large Open-domain Knowledge Graphs.
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

Co-design Hardware and Algorithm for Vector Search.
Proceedings of the International Conference for High Performance Computing, 2023

EnergAt: Fine-Grained Energy Attribution for Multi-Tenancy.
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 2023

MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
Picket: guarding against corrupted data in tabular data during learning and inference.
VLDB J., 2022

Machine Learning and Data Cleaning: Which Serves the Other?
ACM J. Data Inf. Qual., 2022

Marius++: Large-Scale Training of Graph Neural Networks on a Single Machine.
CoRR, 2022

Saga: A Platform for Continuous Construction and Serving of Knowledge at Scale.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Can Transfer Learning be used to build a Query Optimizer?
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

2021
Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins.
Proc. VLDB Endow., 2021

Demonstration of Marius: Graph Embeddings with a Single Machine.
Proc. VLDB Endow., 2021

Learning Massive Graph Embeddings on a Single Machine.
CoRR, 2021

Marius: Learning Massive Graph Embeddings on a Single Machine.
Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

On Robust Mean Estimation under Coordinate-level Corruption.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Principal Component Networks: Parameter Reduction Early in Training.
CoRR, 2020

Record fusion: A learning approach.
CoRR, 2020

Picket: Self-supervised Data Diagnostics for ML Pipelines.
CoRR, 2020

An Empirical Analysis of the Impact of Data Augmentation on Knowledge Distillation.
CoRR, 2020

Robust Mean Estimation under Coordinate-level Corruption.
CoRR, 2020

A Statistical Perspective on Discovering Functional Dependencies in Noisy Data.
Proceedings of the 2020 International Conference on Management of Data, 2020

Attention-based Learning for Missing Data Imputation in HoloClean.
Proceedings of the Third Conference on Machine Learning and Systems, 2020

Data-Dependent Differentially Private Parameter Learning for Directed Graphical Models.
Proceedings of the 37th International Conference on Machine Learning, 2020

Unsupervised Relation Extraction from Language Models using Constrained Cloze Completion.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
Opportunities for Data Management Research in the Era of Horizontal AI/ML.
Proc. VLDB Endow., 2019

Fine-Grained Object Detection over Scientific Document Images with Region Embeddings.
CoRR, 2019

Learning Functional Dependencies with Sparse Regression.
CoRR, 2019

SysML: The New Frontier of Machine Learning Systems.
CoRR, 2019

Approximate Inference in Structured Instances with Noisy Categorical Observations.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

HoloDetect: Few-Shot Learning for Error Detection.
Proceedings of the 2019 International Conference on Management of Data, 2019

A Formal Framework for Probabilistic Unclean Databases.
Proceedings of the 22nd International Conference on Database Theory, 2019

CRUX: Adaptive Querying for Efficient Crowdsourced Data Extraction.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Data Integration and Machine Learning: A Natural Synergy.
Proc. VLDB Endow., 2018

Deep Learning for Entity Matching: A Design Space Exploration.
Proceedings of the 2018 International Conference on Management of Data, 2018

Fonduer: Knowledge Base Construction from Richly Formatted Data.
Proceedings of the 2018 International Conference on Management of Data, 2018

2017
Forecasting rare disease outbreaks from open source indicators.
Stat. Anal. Data Min., 2017

HoloClean: Holistic Data Repairs with Probabilistic Inference.
Proc. VLDB Endow., 2017

SLiMFast: Guaranteed Results for Data Fusion and Source Reliability.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

2016
SourceSight: Enabling Effective Source Selection.
Proceedings of the 2016 International Conference on Management of Data, 2016

2015
Quality-Aware Data Source Management.
PhD thesis, 2015

CrowdGather: Entity Extraction over Structured Domains.
CoRR, 2015

Exploiting Features for Data Source Quality Estimation.
CoRR, 2015

StoryPivot: Comparing and Contrasting Story Evolution.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

SourceSeer: Forecasting Rare Disease Outbreaks Using Multiple Data Sources.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

HawkesTopic: A Joint Model for Network Inference and Topic Modeling from Text-Based Cascades.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration.
Proceedings of the Seventh Biennial Conference on Innovative Data Systems Research, 2015

2014
Characterizing and selecting fresh data sources.
Proceedings of the International Conference on Management of Data, 2014

2013
A SPARSI: Partitioning Sensitive Data amongst Multiple Adversaries.
Proc. VLDB Endow., 2013

Multi-relational Learning Using Weighted Tensor Decomposition with Modular Loss
CoRR, 2013

On Sharing Private Data with Multiple Non-Colluding Adversaries
CoRR, 2013

2012
Local structure and determinism in probabilistic databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

2010
Fuzzy rule based neuro-dynamic programming for mobile robot skill acquisition on the basis of a nested multi-agent architecture.
Proceedings of the 2010 IEEE International Conference on Robotics and Biomimetics, 2010


  Loading...