Pratyush Patel

Orcid: 0000-0003-3611-5160

According to our database1, Pratyush Patel authored at least 17 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Input-Dependent Power Usage in GPUs.
CoRR, 2024

Splitwise: Efficient Generative LLM Inference Using Phase Splitting.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Characterizing Power Management Opportunities for LLMs in the Cloud.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
POLCA: Power Oversubscription in LLM Cloud Providers.
CoRR, 2023

Hybrid Computing for Interactive Datacenter Applications.
CoRR, 2023

Towards Improved Power Management in Cloud GPUs.
IEEE Comput. Archit. Lett., 2023

An Agile Pathway Towards Carbon-aware Clouds.
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 2023

2022
SoundWatch: deep learning for sound accessibility on smartwatches.
Commun. ACM, 2022

SRIFTY: Swift and Thrifty Distributed Neural Network Training on the Cloud.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

2021
The Demikernel Datapath OS Architecture for Microsecond-scale Datacenter Systems.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

2020
The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework.
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

SoundWatch: Exploring Smartwatch-based Deep Learning Approaches to Support Sound Awareness for Deaf and Hard of Hearing Users.
Proceedings of the ASSETS '20: The 22nd International ACM SIGACCESS Conference on Computers and Accessibility, 2020

2018
A server-based approach for predictable GPU access with improved analysis.
J. Syst. Archit., 2018

Analytical Enhancements and Practical Insights for MPCP with Self-Suspensions.
Proceedings of the IEEE Real-Time and Embedded Technology and Applications Symposium, 2018

Gandiva: Introspective Cluster Scheduling for Deep Learning.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

2017
A server-based approach for predictable GPU access control.
Proceedings of the 23rd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2017

TimerShield: Protecting High-Priority Tasks from Low-Priority Timer Interference (Outstanding Paper).
Proceedings of the 2017 IEEE Real-Time and Embedded Technology and Applications Symposium, 2017


  Loading...