2024
QoS-pro: A QoS-enhanced Transaction Processing Framework for Shared SSDs.
ACM Trans. Archit. Code Optim., March, 2024
I/O Access Patterns in HPC Applications: A 360-Degree Survey.
ACM Comput. Surv., February, 2024
IEEE Internet Comput., 2024
StreamBox: A Lightweight GPU SandBox for Serverless Inference Workflow.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
Toward Stream Processing Elasticity in Realistic Geo-Distributed Environments.
Proceedings of the IEEE International Conference on Cloud Engineering, 2024
2023
Guest Editorial: Interplay Between Machine Learning and Networking Systems.
IEEE Netw., 2023
QoS-Aware and Cost-Efficient Dynamic Resource Allocation for Serverless ML Workflows.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Quantifying the Performance of Conflict-free Replicated Data Types in InterPlanetary File System.
Proceedings of the 4th International Workshop on Distributed Infrastructure for the Common Good, 2023
2022
Taming System Dynamics on Resource Optimization for Data Processing Workflows: A Probabilistic Approach.
IEEE Trans. Parallel Distributed Syst., 2022
Shadow: Exploiting the Power of Choice for Efficient Shuffling in MapReduce.
IEEE Trans. Big Data, 2022
Container-aware I/O stack: bridging the gap between container storage drivers and solid state devices.
Proceedings of the VEE '22: 18th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2022
Understanding the performance of erasure codes in hadoop distributed file system.
Proceedings of the CHEOPS@EuroSys 2022: Proceedings of the Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems, 2022
PGPregel: an end-to-end system for privacy-preserving graph processing in geo-distributed data centers.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022
Stragglers' Detection in Big Data Analytic Systems: The Impact of Heartbeat Arrival.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022
2021
Gear: Enable Efficient Container Storage and Deployment with a New Image Format.
Proceedings of the 41st IEEE International Conference on Distributed Computing Systems, 2021
2020
Cost-Aware Partitioning for Efficient Large Graph Processing in Geo-Distributed Datacenters.
IEEE Trans. Parallel Distributed Syst., 2020
Rethinking Operators Placement of Stream Data Application in the Edge.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020
2019
A New Framework for Evaluating Straggler Detection Mechanisms in MapReduce.
ACM Trans. Model. Perform. Evaluation Comput. Syst., 2019
Is it Time to Revisit Erasure Coding in Data-Intensive Clusters?
Proceedings of the 27th IEEE International Symposium on Modeling, 2019
NCQ-Aware I/O Scheduling for Conventional Solid State Drives.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Incorporating Probabilistic Optimizations for Resource Provisioning of Data Processing Workflows.
Proceedings of the 48th International Conference on Parallel Processing, 2019
When FPGA-Accelerator Meets Stream Data Processing in the Edge.
Proceedings of the 39th IEEE International Conference on Distributed Computing Systems, 2019
On the Importance of Container Image Placement for Service Provisioning in the Edge.
Proceedings of the 28th International Conference on Computer Communication and Networks, 2019
2018
Improving the Effectiveness of Burst Buffers for Big Data Processing in HPC Systems with Eley.
Future Gener. Comput. Syst., 2018
On the Performance of Spark on HPC Systems: Towards a Complete Picture.
Proceedings of the Supercomputing Frontiers - 4th Asian Conference, 2018
Introduction to CEBDA 2018.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018
Energy-Efficient Speculative Execution using Advanced Reservation for Heterogeneous Clusters.
Proceedings of the 47th International Conference on Parallel Processing, 2018
Dual-Paradigm Stream Processing.
Proceedings of the 47th International Conference on Parallel Processing, 2018
TurboStream: Towards Low-Latency Data Stream Processing.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018
Nitro: Network-Aware Virtual Machine Image Management in Geo-Distributed Clouds.
Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018
2017
Enabling fast failure recovery in shared Hadoop clusters: Towards failure-aware scheduling.
Future Gener. Comput. Syst., 2017
On Achieving Efficient Data Transfer for Graph Processing in Geo-Distributed Datacenters.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017
Characterizing Performance and Energy-Efficiency of the RAMCloud Storage System.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017
Energy-Driven Straggler Mitigation in MapReduce.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017
Eley: On the Effectiveness of Burst Buffers for Big Data Processing in HPC Systems.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
An Empirical Evaluation of How The Network Impacts The Performance and Energy Efficiency in RAMCloud.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
2016
Fault Tolerance in MapReduce: A Survey.
Proceedings of the Resource Management for Big Data Platforms, 2016
Using Formal Grammars to Predict I/O Behaviors in HPC: The Omnisc'IO Approach.
IEEE Trans. Parallel Distributed Syst., 2016
Damaris: Addressing Performance Variability in Data Management for Post-Petascale Simulations.
ACM Trans. Parallel Comput., 2016
Governing energy consumption in Hadoop through CPU frequency scaling: An analysis.
Future Gener. Comput. Syst., 2016
On the energy footprint of I/O management in Exascale HPC systems.
Future Gener. Comput. Syst., 2016
<i>iShare</i>: Balancing I/O performance isolation and disk I/O efficiency in virtualized environments.
Concurr. Comput. Pract. Exp., 2016
On the usability of shortest remaining time first policy in shared Hadoop clusters.
Proceedings of the 31st Annual ACM Symposium on Applied Computing, 2016
On the Root Causes of Cross-Application I/O Interference in HPC Storage Systems.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
2015
Spatial Locality Aware Disk Scheduling in Virtualized Environment.
IEEE Trans. Parallel Distributed Syst., 2015
Inaccuracy in Private BitTorrent Measurements.
Int. J. Parallel Program., 2015
Exploring Energy-Consistency Trade-Offs in Cassandra Cloud Storage System.
Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015
An Eye on the Elephant in the Wild: A Performance Evaluation of Hadoop's Schedulers Under Failures.
Proceedings of the Adaptive Resource Management and Scheduling for Cloud Computing, 2015
On Understanding the Energy Impact of Speculative Execution in Hadoop.
Proceedings of the IEEE International Conference on Data Science and Data Intensive Systems, 2015
Energy-Aware Massively Distributed Cloud Facilities: The DISCOVERY Initiative.
Proceedings of the IEEE International Conference on Data Science and Data Intensive Systems, 2015
Chronos: Failure-aware scheduling in shared Hadoop clusters.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015
2014
A Taxonomy and Survey on eScience as a Service in the Cloud.
CoRR, 2014
Omnisc'IO: A Grammar-Based Approach to Spatial and Temporal I/O Patterns Prediction.
Proceedings of the International Conference for High Performance Computing, 2014
Towards Efficient Power Management in MapReduce: Investigation of CPU-Frequencies Scaling on Power Efficiency in Hadoop.
Proceedings of the Adaptive Resource Management and Scheduling for Cloud Computing, 2014
CALCioM: Mitigating I/O Interference in HPC Systems through Cross-Application Coordination.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
A performance and energy analysis of I/O management approaches for exascale systems.
Proceedings of the DIDC'14, 2014
Consistency Management in Cloud Storage Systems.
Proceedings of the Large Scale and Big Data - Processing and Management., 2014
2013
Petri net based Grid workflow verification and optimization.
J. Supercomput., 2013
Handling partitioning skew in MapReduce using LEEN.
Peer-to-Peer Netw. Appl., 2013
Flubber: Two-level disk scheduling in virtualized environment.
Future Gener. Comput. Syst., 2013
Exploiting Spatial Locality to Improve Disk Efficiency in Virtualized Environments.
Proceedings of the 2013 IEEE 21st International Symposium on Modelling, 2013
Consistency in the Cloud: When Money Does Matter!
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
2012
Harmony: Towards Automated Self-Adaptive Consistency in Cloud Storage.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012
Efficient Disk I/O Scheduling with QoS Guarantee for Xen-based Hosting Platforms.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
Maestro: Replica-Aware Map Scheduling for MapReduce.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
2011
Adaptive Disk I/O Scheduling for MapReduce in Virtualized Environment.
Proceedings of the International Conference on Parallel Processing, 2011
Towards Pay-As-You-Consume Cloud Computing.
Proceedings of the IEEE International Conference on Services Computing, 2011
2010
Tools and Technologies for Building Clouds.
Proceedings of the Cloud Computing, Principles, Systems and Applications, 2010
MR-scope: a real-time tracing tool for MapReduce.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010
LEEN: Locality/Fairness-Aware Key Partitioning for MapReduce in the Cloud.
Proceedings of the Cloud Computing, Second International Conference, 2010
Cloud Types and Services.
Proceedings of the Handbook of Cloud Computing., 2010
2009
CLOUDLET: towards mapreduce implementation on virtual machines.
Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, 2009
Evaluating MapReduce on Virtual Machines: The Hadoop Case.
Proceedings of the Cloud Computing, First International Conference, CloudCom 2009, Beijing, 2009
2005
A Proposal of Next Generation Grid-Operating System.
Proceedings of The 2005 International Conference on Grid Computing and Applications, 2005