Amey Agrawal

Orcid: 0000-0003-2286-577X

According to our database1, Amey Agrawal authored at least 12 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations.
CoRR, 2024

Metron: Holistic Performance Evaluation Framework for LLM Inference Systems.
CoRR, 2024

Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

VIDUR: A Large-Scale Simulation Framework for LLM Inference.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Inshrinkerator: Compressing Deep Learning Training Checkpoints via Dynamic Quantization.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023
SARATHI: Efficient LLM Inference by Piggybacking Decodes with Chunked Prefills.
CoRR, 2023

DynaQuant: Compressing Deep Learning Training Checkpoints via Dynamic Quantization.
CoRR, 2023

2022
Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads.
CoRR, 2022

2019
Learning Digital Circuits: A Journey Through Weight Invariant Self-Pruning Neural Networks.
CoRR, 2019

Delog: A Privacy Preserving Log Filtering Framework for Online Compute Platforms.
CoRR, 2019

Logan: A Distributed Online Log Parser.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Delog: A High-Performance Privacy Preserving Log Filtering Framework.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019


  Loading...