Sherif Sakr

Orcid: 0000-0002-2503-523X

Affiliations:
  • University of Tartu, Tartu, Estonia
  • University of New South Wales, Sydney, Australia


According to our database1, Sherif Sakr authored at least 170 papers between 2004 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
To tune or not to tune? An approach for recommending important hyperparameters for classification and clustering algorithms.
Future Gener. Comput. Syst., 2025

2024
AutoMLBench: A comprehensive experimental evaluation of automated machine learning frameworks.
Expert Syst. Appl., 2024

2022
D<sup>2</sup>IA: User-defined interval analytics on distributed streams.
Inf. Syst., 2022

AutoMLBench: A Comprehensive Experimental Evaluation of Automated Machine Learning Frameworks.
CoRR, 2022

TPE-AutoClust: A Tree-based Pipline Ensemble Framework for Automated Clustering.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2022

cSmartML-Glassbox: Increasing Transparency and Controllability in Automated Clustering.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2022

2021
SDDM: an interpretable statistical concept drift detection method for data streams.
J. Intell. Inf. Syst., 2021

Graph Generators: State of the Art and Open Challenges.
ACM Comput. Surv., 2021

To tune or not to tune? An Approach for Recommending Important Hyperparameters.
CoRR, 2021

DLBench: a comprehensive experimental evaluation of deep learning frameworks.
Clust. Comput., 2021

Interpretability in healthcare: A comparative study of local machine learning interpretability techniques.
Comput. Intell., 2021

The future is big graphs: a community view on graph processing systems.
Commun. ACM, 2021

Towards Automated Concept-based Decision TreeExplanations for CNNs.
Proceedings of the 24th International Conference on Extending Database Technology, 2021

The impact of Auto-Sklearn's Learning Settings: Meta-learning, Ensembling, Time Budget, and Search Space Size.
Proceedings of the Workshops of the EDBT/ICDT 2021 Joint Conference, 2021

cSmartML: A Meta Learning-Based Framework for Automated Selection and Hyperparameter Tuning for Clustering.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
The views, measurements and challenges of elasticity in the cloud: A review.
Comput. Commun., 2020

Benchmarking big data systems: A survey.
Comput. Commun., 2020

Towards making sense of Spark-SQL performance for processing vast distributed RDF datasets.
Proceedings of The International Workshop on Semantic Big Data, 2020

A First Step Towards a Streaming Linked Data Life-Cycle.
Proceedings of the Semantic Web - ISWC 2020, 2020

Comparing Schema Advancements for Distributed RDF Querying Using SparkSQL.
Proceedings of the ISWC 2020 Demos and Industry Tracks: From Novel Ideas to Industrial Practice co-located with 19th International Semantic Web Conference (ISWC 2020), 2020

On Teaching Web Stream Processing - Lessons Learned.
Proceedings of the Web Engineering - 20th International Conference, 2020

Process Mining over Unordered Event Streams.
Proceedings of the 2nd International Conference on Process Mining, 2020

D-SmartML: A Distributed Automated Machine Learning Framework.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

DISGD: A Distributed Shared-nothing Matrix Factorization for Large Scale Online Recommender Systems.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

Declarative Languages for Big Streaming Data.
Proceedings of the 23rd International Conference on Extending Database Technology, 2020

The Kaiju project: enabling event-driven observability.
Proceedings of the 14th ACM International Conference on Distributed and Event-based Systems, 2020

AutoDLCon: An Approach for Controlling the Automated Tuning for Deep Learning Networks.
Proceedings of the 6th IEEE International Conference on Big Data Computing Service and Applications, 2020

Big Data 2.0 Processing Systems - A Systems Overview, Second Edition
Springer, ISBN: 978-3-030-44186-9, 2020

2019
Native Distributed RDF Systems.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

Framework-Based Scale-Out RDF Systems.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

NoSQL Database Systems.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

On the interpretability of machine learning-based model for predicting hypertension.
BMC Medical Informatics Decis. Mak., 2019

Editorial for Special issue of FGCS special issue on "Benchmarking big data systems".
Future Gener. Comput. Syst., 2019

Big Graph Processing Systems (Dagstuhl Seminar 19491).
Dagstuhl Reports, 2019

Automated Machine Learning: State-of-The-Art and Open Challenges.
CoRR, 2019

Early prediction of diabetes by applying data mining techniques.
Comput. Methods Programs Biomed., 2019

Correction to: Runtime self-monitoring approach of business process compliance in cloud environments.
Clust. Comput., 2019

Big SQL systems: an experimental evaluation.
Clust. Comput., 2019

Benchmarking Spark-SQL under Alliterative RDF Relational Storage Backends.
Proceedings of the QuWeDa 2019: 3rd Workshop on Querying and Benchmarking the Web of Data co-located with 18th International Semantic Web Conference (ISWC 2019), 2019

Bootstrapping the Publication of Linked Data Streams.
Proceedings of the ISWC 2019 Satellite Tracks (Posters & Demonstrations, 2019

A Decision Support Framework for AutoML Systems: A Meta-Learning Approach.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

Big Stream Processing Systems: An Experimental Evaluation.
Proceedings of the 35th IEEE International Conference on Data Engineering Workshops, 2019

MINARET: A Recommendation Framework for Scientific Reviewers.
Proceedings of the Advances in Database Technology, 2019

SmartML: A Meta Learning-Based Framework for Automated Selection and Hyperparameter Tuning for Machine Learning Algorithms.
Proceedings of the Advances in Database Technology, 2019

Adaptive Watermarks: A Concept Drift-based Approach for Predicting Event-Time Progress in Data Streams.
Proceedings of the Advances in Database Technology, 2019

Automated Machine Learning: Techniques and Frameworks.
Proceedings of the Big Data Management and Analytics - 9th European Summer School, 2019

An Outlook to Declarative Languages for Big Steaming Data.
Proceedings of the 13th ACM International Conference on Distributed and Event-based Systems, 2019

LDLCT An Instance-Based Framework for Lesion Detection on Lung CT Scans.
Proceedings of the 32nd IEEE International Symposium on Computer-Based Medical Systems, 2019

D ^2 2 IA: Stream Analytics on User-Defined Event Intervals.
Proceedings of the Advanced Information Systems Engineering, 2019

DLBench: An Experimental Evaluation of Deep Learning Frameworks.
Proceedings of the 2019 IEEE International Congress on Big Data, 2019

Calculation of Average Road Speed Based on Car-to-Car Messaging.
Proceedings of the IEEE International Conference on Big Data and Smart Computing, 2019

ILIME: Local and Global Interpretable Model-Agnostic Explainer of Black-Box Decision.
Proceedings of the Advances in Databases and Information Systems, 2019

2018
A Differentiated Caching Mechanism to Enable Primary Storage Deduplication in Clouds.
IEEE Trans. Parallel Distributed Syst., 2018

HDM: A Composable Framework for Big Data Processing.
IEEE Trans. Big Data, 2018

Dagstuhl Seminar on Big Stream Processing.
SIGMOD Rec., 2018

Stream Processing Languages in the Big Data Era.
SIGMOD Rec., 2018

RDF Data Storage and Query Processing Schemes: A Survey.
ACM Comput. Surv., 2018

Correction To: Large scale graph processing systems: survey and an experimental evaluation.
Clust. Comput., 2018

Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service.
Big Data Res., 2018

Editorial for Special Issue of Journal of Big Data Research on "Big Medical/Healthcare Data Analytics".
Big Data Res., 2018

Business Process Analytics and Big Data Systems: A Roadmap to Bridge the Gap.
IEEE Access, 2018

Predictive Model for the Incidence of Hyperkalemia for Congestive Heart Failure Patients on Spironolactone.
Proceedings of the IEEE International Conference on Healthcare Informatics, 2018

HDM-MC in-Action: A Framework for Big Data Analytics across Multiple Clusters.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018

Linked Data - Storing, Querying, and Reasoning
Springer, ISBN: 978-3-319-73514-6, 2018

2017
On business process monitoring using cross-flow coordination.
Serv. Oriented Comput. Appl., 2017

Comparison of machine learning techniques to predict all-cause mortality using fitness data: the Henry ford exercIse testing (FIT) project.
BMC Medical Informatics Decis. Mak., 2017

Big Data Processing Stacks.
IT Prof., 2017

Big Stream Processing Systems (Dagstuhl Seminar 17441).
Dagstuhl Reports, 2017

Structural XML Query Processing.
ACM Comput. Surv., 2017

Big Data Systems Meet Machine Learning Challenges: Towards Big Data Science as a Service.
CoRR, 2017

HPDedup: A Hybrid Prioritized Data Deduplication Mechanism for Primary Storage in the Cloud.
CoRR, 2017

HDM: Optimized Big Data Processing with Data Provenance.
Proceedings of the 20th International Conference on Extending Database Technology, 2017

Towards Big Data Analytics across Multiple Clusters.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

Big Data Programming Models.
Proceedings of the Handbook of Big Data Technologies, 2017

Big Data Storage and Data Models.
Proceedings of the Handbook of Big Data Technologies, 2017

Non-native RDF Storage Engines.
Proceedings of the Handbook of Big Data Technologies, 2017

2016
Big Data 2.0 Processing Systems - A Survey
Springer Briefs in Computer Science, Springer, ISBN: 978-3-319-38776-5, 2016

Building Pipelines for Heterogeneous Execution Environments for Big Data Processing.
IEEE Softw., 2016

Network-based social coordination of business processes.
Inf. Syst., 2016

Big Data 2.0 Processing Systems: Taxonomy and Open Challenges.
J. Grid Comput., 2016

Towards a Comprehensive Data Analytics Framework for Smart Healthcare Services.
Big Data Res., 2016

DREAM in Action: A Distributed and Adaptive RDF System on the Cloud.
Proceedings of the 25th International Conference on World Wide Web, 2016

Large-Scale Graph Processing Using Apache Giraph
Springer, ISBN: 978-3-319-47431-1, 2016

Process Analytics - Concepts and Techniques for Querying and Analyzing Process Data
Springer, ISBN: 978-3-319-25037-3, 2016

2015
A Framework for Consumer-Centric SLA Management of Cloud-Hosted Databases.
IEEE Trans. Serv. Comput., 2015

DREAM: Distributed RDF Engine with Adaptive Query Planner and Minimal Communication.
Proc. VLDB Endow., 2015

How to Make Business Processes "Socialize"?
EAI Endorsed Trans. Ind. Networks Intell. Syst., 2015

Large scale graph processing systems: survey and an experimental evaluation.
Clust. Comput., 2015

Runtime self-monitoring approach of business process compliance in cloud environments.
Clust. Comput., 2015

Runtime detection of business process compliance violations: an approach based on anti patterns.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Liquid Benchmarking: A Platform for Democratizing the Performance Evaluation Process.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

BP-MaaS: A Runtime Compliance-Monitoring System for Business Processes.
Proceedings of the BPM Demo Session 2015 Co-located with the 13th International Conference on Business Process Management (BPM 2015), 2015

Big Graph Processing Systems: State-of-the-Art and Open Challenges.
Proceedings of the First IEEE International Conference on Big Data Computing Service and Applications, 2015

Composable and efficient functional big data processing framework.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

A Framework of Enriching Business Processes Life-Cycle with Tagging Information.
Proceedings of the Databases Theory and Applications, 2015

2014
Hybrid query execution engine for large attributed graphs.
Inf. Syst., 2014

Cloud-hosted databases: technologies, challenges and opportunities.
Clust. Comput., 2014

Enterprise 2.0: Research Challenges and Opportunities.
Proceedings of the Web Information Systems and Technologies, 2014

Towards an Extensible Middleware for Database Benchmarking.
Proceedings of the Performance Characterization and Benchmarking. Traditional to Big Data, 2014

On Characterizing the Performance of Distributed Graph Computation Platforms.
Proceedings of the Performance Characterization and Benchmarking. Traditional to Big Data, 2014

CDPort: A Framework of Data Portability in Cloud Platforms.
Proceedings of the 16th International Conference on Information Integration and Web-based Applications & Services, 2014

SUPER: Social-Based Business Process Management Framework.
Proceedings of the Service-Oriented Computing - ICSOC 2014 Workshops, 2014

Cloud Data Management
Springer, ISBN: 978-3-319-04765-2, 2014

The Family of Map-Reduce.
Proceedings of the Large-Scale Data Analytics, 2014

An Overview of the NoSQL World.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014

An Overview of Large-Scale Stream Processing Engines.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014

CloudDB AutoAdmin: A Consumer-Centric Framework for SLA Management of Virtualized Database Servers.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014

MapReduce Family of Large-Scale Data-Processing Systems.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014

2013
The family of mapreduce and large-scale data processing systems.
ACM Comput. Surv., 2013

Modeling performance of a parallel streaming engine: bridging theory and costs.
Proceedings of the ACM/SPEC International Conference on Performance Engineering, 2013

Towards Comprehensive Measurement of Consistency Guarantees for Cloud-Hosted Data Storage Services.
Proceedings of the Performance Characterization and Benchmarking, 2013

Is Your Cloud-Hosted Database Truly Elastic?
Proceedings of the IEEE Ninth World Congress on Services, 2013

D-SPARQ: Distributed, Scalable and Efficient RDF Query Engine.
Proceedings of the ISWC 2013 Posters & Demonstrations Track, 2013

Availability analysis for deployment of in-cloud applications.
Proceedings of the 4th international ACM Sigsoft symposium on Architecting critical systems, 2013

Consumer-centric SLA manager for cloud-hosted databases.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Incorporating Uncertainty into In-Cloud Application Deployment Decisions for Availability.
Proceedings of the 2013 IEEE Sixth International Conference on Cloud Computing, Santa Clara, CA, USA, June 28, 2013

Improving Availability of Cloud-Based Applications through Deployment Choices.
Proceedings of the 2013 IEEE Sixth International Conference on Cloud Computing, Santa Clara, CA, USA, June 28, 2013

2012
A decade of database conferences: a look inside the program committees.
Scientometrics, 2012

On understanding the economics and elasticity challenges of deploying business applications on public cloud infrastructure.
J. Internet Serv. Appl., 2012

AdaptRDF: adaptive storage management for RDF databases.
Int. J. Web Inf. Syst., 2012

Extending SPARQL to Support Entity Grouping and Path Queries
CoRR, 2012

On efficient processing of BPMN-Q queries.
Comput. Ind., 2012

Trade-Off Analysis of Elasticity Approaches for Cloud-Based Business Applications.
Proceedings of the Web Information Systems Engineering - WISE 2012, 2012

An architecture framework for application-managed scaling of cloud-hosted relational databases.
Proceedings of the 2012 Joint Working IEEE/IFIP Conference on Software Architecture and European Conference on Software Architecture, 2012

Application-Managed Database Replication on Virtualized Cloud Environments.
Proceedings of the Workshops Proceedings of the IEEE 28th International Conference on Data Engineering, 2012

G-SPARQL: a hybrid engine for querying large attributed graphs.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Querying Process Models Repositories by Aggregated Graph Search.
Proceedings of the Business Process Management Workshops, 2012

Application-Managed Replication Controller for Cloud-Hosted Databases.
Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, 2012

SLA-Based and Consumer-centric Dynamic Provisioning for Cloud Databases.
Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, 2012

2011
A decade of database research publications: a look inside.
Scientometrics, 2011

Partial process models to manage business process variants.
Int. J. Bus. Process. Integr. Manag., 2011

A Decade of Database Research Publications
CoRR, 2011

A Survey of Large Scale Data Management Approaches in Cloud Environments.
IEEE Commun. Surv. Tutorials, 2011

Liquid benchmarks: benchmarking-as-a-service.
Proceedings of the 2011 Joint International Conference on Digital Libraries, 2011

CloudDB AutoAdmin: Towards a Truly Elastic Cloud-Based Data Store.
Proceedings of the IEEE International Conference on Web Services, 2011

Preface.
Proceedings of the Workshops Proceedings of the 27th International Conference on Data Engineering, 2011

Design by Selection: A Reuse-Based Approach for Business Process Modeling.
Proceedings of the Conceptual Modeling - ER 2011, 30th International Conference, 2011

BPModelMasher: Manage Your Process Variants Effectively.
Proceedings of the Demo Track of the Nineth Conference on Business Process Management 2011, 2011

A Query Language for Analyzing Business Processes Execution.
Proceedings of the Business Process Management - 9th International Conference, 2011

One Size Does Not Fit All: A Group-Based Service Selection for Web-Based Business Processes.
Proceedings of the 25th IEEE International Conference on Advanced Information Networking and Applications Workshops, 2011

MyDeepWeb: An Integration Service for Your OWN Deep Web Data.
Proceedings of the IEEE International Conference on Services Computing, 2011

An Overview of Graph Indexing and Querying Techniques.
Proceedings of the Graph Data Management: Techniques and Applications., 2011

2010
Efficient Relational Techniques for Processing Graph Queries.
J. Comput. Sci. Technol., 2010

Graph indexing and querying: a review.
Int. J. Web Inf. Syst., 2010

Towards a comprehensive assessment for selectivity estimation approaches of XML queries.
Int. J. Web Eng. Technol., 2010

A framework for querying graph-based business process models.
Proceedings of the 19th International Conference on World Wide Web, 2010

Efficient and Adaptable Query Workload-Aware Management for RDF Data.
Proceedings of the Web Information Systems Engineering - WISE 2010, 2010

Liquid Benchmarks: Towards an Online Platform for Collaborative Assessment of Computer Science Research Results.
Proceedings of the Performance Evaluation, Measurement and Characterization of Complex Systems, 2010

An efficient features-based processing technique for supergraph queries.
Proceedings of the Fourteenth International Database Engineering and Applications Symposium (IDEAS 2010), 2010

GDM2010 Workshop Organizers' Message.
Proceedings of the Database Systems for Advanced Applications, 2010

An Experimental Evaluation of Relational RDF Storage and Querying Techniques.
Proceedings of the Database Systems for Advanced Applications, 2010

Querying Graph-Based Repositories of Business Process Models.
Proceedings of the Database Systems for Advanced Applications, 2010

On Maintaining Consistency of Process Model Variants.
Proceedings of the Business Process Management Workshops, 2010

2009
Relational processing of RDF queries: a survey.
SIGMOD Rec., 2009

Cardinality-Aware Purely Relational XQuery Processor.
J. Database Manag., 2009

XML compression techniques: A survey and comparison.
J. Comput. Syst. Sci., 2009

Storing and Querying Graph Data Using Efficient Relational Processing Techniques.
Proceedings of the Information Systems: Modeling, 2009

FeedRank: A Semantic-Based Management System of Web Feeds.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2009

An Empirical Evaluation of XML Compression Tools.
Proceedings of the Database Systems for Advanced Applications, 2009

GraphREL: A Decomposition-Based and Selectivity-Aware Relational Framework for Processing Sub-graph Queries.
Proceedings of the Database Systems for Advanced Applications, 2009

2008
Dependable cardinality forecasts for XQuery.
Proc. VLDB Endow., 2008

Algebra-based XQuery cardinality estimation.
Int. J. Web Inf. Syst., 2008

An Experimental Investigation of XML Compression Tools
CoRR, 2008

Improving the Relational Evaluation of XML Queries by Means of Path Summaries.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2008

XML Tree Structure Compression.
Proceedings of the 19th International Workshop on Database and Expert Systems Applications (DEXA 2008), 2008

XSelMark: A Micro-benchmark for Selectivity Estimation Approaches of XML Queries.
Proceedings of the Database and Expert Systems Applications, 19th International Conference, 2008

2007
Cardinality-aware and purely relational implementation of an XQuery processor.
PhD thesis, 2007

A SQL: 1999 code generator for the pathfinder xquery compiler.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

2004
XQuery on SQL Hosts.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Towards a Framework for Mapping Between UML/OCL and XML/XQuery.
Proceedings of the «UML» 2004, 2004


  Loading...