2024
ASA - The Adaptive Scheduling Algorithm.
CoRR, 2024
2021
Programming Abstractions for Managing Workflows on Tiered Storage Systems.
ACM Trans. Storage, 2021
Science Capsule - Capturing the Data Life Cycle.
J. Open Source Softw., 2021
Science Capsule: Towards Sharing and Reproducibility of Scientific Workflows.
Proceedings of the 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS), 2021
2020
Performance characterization of scientific workflows for the optimal use of Burst Buffers.
Future Gener. Comput. Syst., 2020
Characterizing Scientific Workflows on HPC Systems using Logs.
Proceedings of the IEEE/ACM Workflows in Support of Large-Scale Science, 2020
Towards Interactive, Reproducible Analytics at Scale on HPC Systems.
Proceedings of the IEEE/ACM HPC for Urgent Decision Making, UrgentHPC@SC 2020, Atlanta, GA, 2020
ASA - The Adaptive Scheduling Architecture.
Proceedings of the HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, 2020
2019
Data Jockey: Automatic Data Management for HPC Multi-tiered Storage Systems.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Understanding Data Similarity in Large-Scale Scientific Datasets.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019
Analysis and Prediction of Data Transfer Throughput for Data-Intensive Workloads.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019
2018
MaDaTS: Managing Data on Tiered Storage for Scientific Workflows.
J. Open Source Softw., 2018
Dac-Man: data change management for scientific datasets on HPC systems.
Proceedings of the International Conference for High Performance Computing, 2018
Bringing Data Science to Qualitative Analysis.
Proceedings of the 14th IEEE International Conference on e-Science, 2018
2017
FRIEDA: Flexible Robust Intelligent Elastic Data Management Framework.
J. Open Source Softw., 2017
E-HPC: a library for elastic resource management in HPC environments.
Proceedings of the 12th Workshop on Workflows in Support of Large-Scale Science, 2017
Usability Heuristic Evaluation of Scientific Data Analysis and Visualization Tools.
Proceedings of the Advances in Usability and User Experience, 2017
2016
Tigres Workflow Library: Supporting Scientific Pipelines on HPC Systems.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016
2014
Regenerating and Quantifying Quality of Benchmarking Data Using Static and Dynamic Provenance.
Proceedings of the Provenance and Annotation of Data and Processes, 2014
Provisioning, Placement and Pipelining Strategies for Data-Intensive Applications in Cloud Environments.
Proceedings of the 2014 IEEE International Conference on Cloud Engineering, 2014
Study in Usefulness of Middleware-Only Provenance.
Proceedings of the 10th IEEE International Conference on e-Science, 2014
Storage and Data Life Cycle Management in Cloud Environments with FRIEDA.
Proceedings of the Cloud Computing for Data-Intensive Applications, 2014
2013
Static compiler analysis for workflow provenance.
Proceedings of WORKS 2013: 8th Workshop On Workflows in Support of Large-Scale Science, 2013
Provenance from log files: a BigData problem.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013
2012
FRIEDA: Flexible Robust Intelligent Elastic Data Management in Cloud Environments.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
Visualization of network data provenance.
Proceedings of the 19th International Conference on High Performance Computing, 2012
2011
Distributed Speculative Parallelization using Checkpoint Restart.
Proceedings of the International Conference on Computational Science, 2011