2025

Towards Workload-aware Cloud Efficiency: A Large-scale Empirical Study of Cloud Workload Characteristics.

[DOI]

Anjaly Parayil

Jue Zhang

Proceedings of the 16th ACM/SPEC International Conference on Performance Engineering, 2025

2024

Towards Cloud Efficiency with Large-scale Workload Characterization.

[DOI]

CoRR, 2024

Workload Intelligence: Punching Holes Through the Cloud Abstraction.

[DOI]

CoRR, 2024

Fast and Accurate DNN Performance Estimation across Diverse Hardware Platforms.

[DOI]

Vishwas Vasudeva Kakrannaya

Siddhartha Balakrishna Rai

Anand Sivasubramaniam

Timothy Zhu

Proceedings of the 32nd International Conference on Modeling, 2024

TraceUpscaler: Upscaling Traces to Evaluate Systems at High Load.

[DOI]

Proceedings of the Nineteenth European Conference on Computer Systems, 2024

AutoBurst: Autoscaling Burstable Instances for Cost-effective Latency SLOs.

[DOI]

Rubaba Hasan

Timothy Zhu

Bhuvan Urgaonkar

Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023

SplitRPC: A {Control + Data} Path Splitting RPC Stack for ML Inference Serving.

[DOI]

Adithya Kumar

Anand Sivasubramaniam

Timothy Zhu

Proc. ACM Meas. Anal. Comput. Syst., 2023

Kerveros: Efficient and Scalable Cloud Admission Control.

[DOI]

Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

2022

Overflowing emerging neural network inference tasks from the GPU to the CPU on heterogeneous servers.

[DOI]

Adithya Kumar

Anand Sivasubramaniam

Timothy Zhu

Proceedings of the SYSTOR '22: The 15th ACM International Systems and Storage Conference, Haifa, Israel, June 13, 2022

Metastable Failures in the Wild.

[DOI]

Lexiang Huang

Matthew Magnusson

Abishek Bangalore Muralikrishna

Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

2021

Metastable failures in distributed systems.

[DOI]

Proceedings of the HotOS '21: Workshop on Hot Topics in Operating Systems, 2021

TraceSplitter: a new paradigm for downscaling traces.

[DOI]

Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

tprof: Performance profiling via structural aggregation and automated analysis of distributed systems traces.

[DOI]

Lexiang Huang

Timothy Zhu

Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021

2020

The Fast and The Frugal: Tail Latency Aware Provisioning for Coping with Load Variations.

[DOI]

Adithya Kumar

Iyswarya Narayanan

Timothy Zhu

Anand Sivasubramaniam

Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Peafowl: in-application CPU scheduling to reduce power consumption of in-memory key-value stores.

[DOI]

Proceedings of the SoCC '20: ACM Symposium on Cloud Computing, 2020

2019

BurScale: Using Burstable Instances for Cost-Effective Autoscaling in the Public Cloud.

[DOI]

Ataollah Fatahi Baarzi

Timothy Zhu

Bhuvan Urgaonkar

Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019

2018

RobinHood: Tail Latency Aware Caching - Dynamic Reallocation from Cache-Rich to Cache-Poor.

[DOI]

Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

2017

WorkloadCompactor: reducing datacenter cost while providing tail latency SLO guarantees.

[DOI]

Timothy Zhu

Michael A. Kozuch

Mor Harchol-Balter

Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

2016

TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters.

[DOI]

Proceedings of the Eleventh European Conference on Computer Systems, 2016

SNC-Meister: Admitting More Tenants with Tail Latency SLOs.

[DOI]

Timothy Zhu

Daniel S. Berger

Mor Harchol-Balter

Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

2014

PriorityMeister: Tail Latency QoS for Shared Networked Storage.

[DOI]

Proceedings of the ACM Symposium on Cloud Computing, 2014

2013

IOFlow: a software-defined storage architecture.

[DOI]

Antony I. T. Rowstron

Tom Talpey

Richard Black

Timothy Zhu

Proceedings of the ACM SIGOPS 24th Symposium on Operating Systems Principles, 2013

2012

SOFTScale: Stealing Opportunistically for Transient Scaling.

[DOI]

Proceedings of the Middleware 2012, 2012

Saving Cash by Using Less Cache.

[DOI]

Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing, 2012