Saravanan Thirumuruganathan

Orcid: 0000-0002-1517-480X

According to our database1, Saravanan Thirumuruganathan authored at least 95 papers between 2011 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Pythia: A Neural Model for Data Prefetching.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025

2024
Detecting and Mitigating Sampling Bias in Cybersecurity with Unlabeled Data.
Proceedings of the 33rd USENIX Security Symposium, 2024

Relational Data Imputation with Graph Neural Networks.
Proceedings of the Proceedings 27th International Conference on Extending Database Technology, 2024

2023
Will they take this offer? A machine learning price elasticity model for predicting upselling acceptance of premium airline seating.
Inf. Manag., April, 2023

Fairness and Bias in Truth Discovery Algorithms: An Experimental Analysis.
CoRR, 2023

The Art of Losing to Win: Using Lossy Image Compression to Improve Data Loading in Deep Learning Pipelines.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

2022
Automated Annotations for AI Data and Model Transparency.
ACM J. Data Inf. Qual., 2022

Editorial: Special Issue on Deep Learning for Data Quality.
ACM J. Data Inf. Qual., 2022

Fair active learning.
Expert Syst. Appl., 2022

Big Data, Small Personas: How Algorithms Shape the Demographic Representation of Data-Driven User Segments.
Big Data, 2022

SIRAJ: A Unified Framework for Aggregation of Malicious Entity Detectors.
Proceedings of the 43rd IEEE Symposium on Security and Privacy, 2022

Prediction Intervals for Learned Cardinality Estimation: An Experimental Evaluation.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Accelerating Entity Lookups in Knowledge Graphs Through Embeddings.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

2021
Deep Learning for Blocking in Entity Matching: A Design Space Exploration.
Proc. VLDB Endow., 2021

Forecasting the nearly unforecastable: why aren't airline bookings adhering to the prediction algorithm?
Electron. Commer. Res., 2021

Scalable signal reconstruction for a broad range of applications.
Commun. ACM, 2021

Random Walks on Huge Graphs at Cache Efficiency.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

To Intervene or Not To Intervene: Cost based Intervention for Combating Fake News.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Shahin: Faster Algorithms for Generating Explanations for Multiple Predictions.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

EmbDI: Generating Embeddings for Relational Data Integration (Discussion Paper).
Proceedings of the 29th Italian Symposium on Advanced Database Systems, 2021

2020
Scalable algorithms for signal reconstruction by leveraging similarity joins.
VLDB J., 2020

Making AI Machines Work for Humans in FoW.
SIGMOD Rec., 2020

Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning.
Proc. VLDB Endow., 2020

Orca-SR: A Real-Time Traffic Engineering Framework leveraging Similarity Joins.
Proc. VLDB Endow., 2020

ZeroER: Entity Resolution using Zero Labeled Examples.
Proceedings of the 2020 International Conference on Management of Data, 2020

Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries.
Proceedings of the 2020 International Conference on Management of Data, 2020

Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks.
Proceedings of the 2020 International Conference on Management of Data, 2020

Approximate Query Processing for Data Exploration using Deep Generative Models.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Data Curation with Deep Learning.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

2019
Optimized group formation for solving collaborative tasks.
VLDB J., 2019

Efficient Signal Reconstruction for a Broad Range of Applications.
SIGMOD Rec., 2019

Combating Fake News: A Data Management and Mining Perspective.
Proc. VLDB Endow., 2019

ApproxML: Efficient Approximate Ad-Hoc ML Models Through Materialization and Reuse.
Proc. VLDB Endow., 2019

Platform Design for Crowdsourcing and Future of Work.
IEEE Data Eng. Bull., 2019

On Benchmarking for Crowdsourcing and Future of Work Platforms.
IEEE Data Eng. Bull., 2019

Local Embeddings for Relational Data Integration.
CoRR, 2019

AutoER: Automated Entity Resolution using Generative Modelling.
CoRR, 2019

Are Outlier Detection Methods Resilient to Sampling?
CoRR, 2019

Approximate Query Processing using Deep Generative Models.
CoRR, 2019

Multi-Attribute Selectivity Estimation Using Deep Learning.
CoRR, 2019

A Human-in-the-loop Attribute Design Framework for Classification.
Proceedings of the World Wide Web Conference, 2019

Explaining Entity Resolution Predictions: Where are we and What needs to be done?
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2019

EXPLAINER: Entity Resolution Explanations.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

TagSniff: Simplified Big Data Debugging for Dataflow Jobs.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019

2018
Human Factors Modeling in Crowdsourcing.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Structure Analytics in Social Media.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Efficient Construction of Approximate Ad-Hoc ML models Through Materialization and Reuse.
Proc. VLDB Endow., 2018

Distributed Representations of Tuples for Entity Resolution.
Proc. VLDB Endow., 2018

Discovery of Genuine Functional Dependencies from Relational Data with Missing Values.
Proc. VLDB Endow., 2018

Leveraging Similarity Joins for Signal Reconstruction.
Proc. VLDB Endow., 2018

RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -.
Proc. VLDB Endow., 2018

Malware in the future? Forecasting of analyst detection of cyber events.
J. Cybersecur., 2018

Reuse and Adaptation for Entity Resolution through Transfer Learning.
CoRR, 2018

Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation.
CoRR, 2018

Robust Road Map Inference through Network Alignment of Trajectories.
Proceedings of the 2018 SIAM International Conference on Data Mining, 2018

Road Network Fusion for Incremental Map Updates.
Proceedings of the Progress in Location Based Services 2018, 2018

An Empirical Study of Questionnaires for the Diagnosis of Pediatric Obstructive Sleep Apnea.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

2017
DeepER - Deep Entity Resolution.
CoRR, 2017

Kharita: Robust Map Inference using Graph Spanners.
CoRR, 2017

Malware in the Future? Forecasting Analyst Detection of Cyber Events.
CoRR, 2017

UGuide: User-Guided Discovery of FD-Detectable Errors.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

A Cost-based Optimizer for Gradient Descent Optimization.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Density Based Clustering over Location Based Services.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

2016
Collaborative Crowdsourcing with Crowd4U.
Proc. VLDB Endow., 2016

Discovering the Skyline of Web Databases.
Proc. VLDB Endow., 2016

HDBSCAN: Density based Clustering over Location Based Services.
CoRR, 2016

ANALOC: Efficient analytics over Location Based Services.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

2015
Task assignment optimization in knowledge-intensive crowdsourcing.
VLDB J., 2015

Worker Skill Estimation in Team-Based Tasks.
Proc. VLDB Endow., 2015

Privacy Implications of Database Ranking.
Proc. VLDB Endow., 2015

Walk, Not Wait: Faster Sampling Over Online Social Networks.
Proc. VLDB Endow., 2015

Aggregate Estimations over Location Based Services.
Proc. VLDB Endow., 2015

Hidden Database Research and Analytics (HYDRA) System .
IEEE Data Eng. Bull., 2015

"The Whole Is Greater Than the Sum of Its Parts": Optimization in Collaborative Crowdsourcing.
CoRR, 2015

Querying Hidden Attributes in an Online Community Network.
Proceedings of the 12th IEEE International Conference on Mobile Ad Hoc and Sensor Systems, 2015

Answering Complex Queries in an Online Community Network.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

Task Assignment Optimization in Collaborative Crowdsourcing.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

2014
An expressive framework and efficient algorithms for the analysis of collaborative tagging.
VLDB J., 2014

Beyond Itemsets: Mining Frequent Featuresets over Structured Items.
Proc. VLDB Endow., 2014

Aggregate Estimation Over Dynamic Hidden Web Databases.
Proc. VLDB Endow., 2014

HDBTracker: Monitoring the Aggregates On Dynamic Hidden Web Databases.
Proc. VLDB Endow., 2014

Optimization in Knowledge-Intensive Crowdsourcing.
CoRR, 2014

Rank-Based Inference over Web Databases.
CoRR, 2014

Aggregate estimation over a microblog platform.
Proceedings of the International Conference on Management of Data, 2014

Query Hidden Attributes in Social Networks.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Exploiting group recommendation functions for flexible preferences.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Ranking item features by mining online user-item interactions.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Price trade-offs in social media advertising.
Proceedings of the second ACM conference on Online social networks, 2014

2013
Rank Discovery From Web Databases.
Proc. VLDB Endow., 2013

Breaking the top-k barrier of hidden web databases?
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Crowds, not Drones: Modeling Human Factors in Interactive Crowdsourcing.
Proceedings of the First VLDB Workshop on Databases and Crowdsourcing, 2013

2012
MapRat: Meaningful Explanation, Interactive Exploration and Geo-Visualization of Collaborative Ratings.
Proc. VLDB Endow., 2012

Who Tags What? An Analysis Framework.
Proc. VLDB Endow., 2012

Digging Deeper into Deep Web Databases by Breaking Through the Top-k Barrier
CoRR, 2012

2011
Building Bayesian Network based expert systems from rules.
Proceedings of the IEEE International Conference on Systems, 2011


  Loading...