Jayashree Mohan

Orcid: 0009-0005-5260-3203

According to our database1, Jayashree Mohan authored at least 24 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inference.
CoRR, 2024

ASTRA: Accurate and Scalable ANNS-based Training of Extreme Classifiers.
CoRR, 2024

Metron: Holistic Performance Evaluation Framework for LLM Inference Systems.
CoRR, 2024

vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention.
CoRR, 2024

Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

VIDUR: A Large-Scale Simulation Framework for LLM Inference.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

2023
SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills.
CoRR, 2023


2022
Looking Beyond GPUs for DNN Scheduling on Multi-Tenant Clusters.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

2021
Analyzing and Mitigating Data Stalls in DNN Training.
Proc. VLDB Endow., 2021

Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters.
CoRR, 2021

Memory Optimization for Deep Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

CheckFreq: Frequent, Fine-Grained DNN Checkpointing.
Proceedings of the 19th USENIX Conference on File and Storage Technologies, 2021

2020
INSTalytics: Cluster Filesystem Co-design for Big-data Analytics.
ACM Trans. Storage, 2020

2019
CrashMonkey and ACE: Systematically Testing File-System Crash Consistency.
ACM Trans. Storage, 2019

Analyzing GDPR Compliance Through the Lens of Privacy Policy.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2019

Recipe: converting concurrent DRAM indexes to persistent-memory indexes.
Proceedings of the 27th ACM Symposium on Operating Systems Principles, 2019

2018
Finding Crash-Consistency Bugs with Bounded Black-Box Crash Testing.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

2017
Analyzing IO Amplification in Linux File Systems.
CoRR, 2017

Storage on Your SmartPhone Uses More Energy Than You Think.
Proceedings of the 9th USENIX Workshop on Hot Topics in Storage and File Systems, 2017

A reinforcement learning approach to optimize downloads over mobile networks.
Proceedings of the 9th International Conference on Communication Systems and Networks, 2017

The Dangers and Complexities of SQLite Benchmarking.
Proceedings of the 8th Asia-Pacific Workshop on Systems, Mumbai, India, September 2, 2017, 2017

2016
Optimizing Downloads over Random Duration Links in Mobile Networks.
Proceedings of the 25th International Conference on Computer Communication and Networks, 2016

2015
Smart and Secure Monitoring of Industrial Environments using IoT.
Proceedings of the Third International Symposium on Women in Computing and Informatics, 2015


  Loading...