Gabriele Oliaro

Orcid: 0000-0001-5406-0736

According to our database1, Gabriele Oliaro authored at least 9 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning.
CoRR, 2024

Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models.
CoRR, 2024

SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Optimal Kernel Orchestration for Tensor Programs with Korch.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems.
CoRR, 2023

SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification.
CoRR, 2023

Direct Telemetry Access.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

2022
Direct Telemetry Access.
CoRR, 2022

2021
Zero-CPU Collection with Direct Telemetry Access.
Proceedings of the HotNets '21: The 20th ACM Workshop on Hot Topics in Networks, 2021


  Loading...