Shiyi Cao

According to our database1, Shiyi Cao authored at least 19 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference.
CoRR, 2024

Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version).
CoRR, 2024

GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism.
CoRR, 2024

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models.
CoRR, 2024

Optimizing LLM Queries in Relational Workloads.
CoRR, 2024

Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs.
Proceedings of the International Conference for High Performance Computing, 2024

Fairness in Serving Large Language Models.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

SLoRA: Scalable Serving of Thousands of LoRA Adapters.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

2023
Understanding Spatial-Temporal Interactions of Ecosystem Services and Their Drivers in a Multi-Scale Perspective of Miluo Using Multi-Source Remote Sensing Data.
Remote. Sens., July, 2023

Efficiently Programming Large Language Models using SGLang.
CoRR, 2023

S-LoRA: Serving Thousands of Concurrent LoRA Adapters.
CoRR, 2023

2022
Low-loss Mode Field Adapter Using Reverse Tapering for Fundamental Mode Transmission over MMFs.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2022

Novel Mirror-flipped Mode Permutation Technique for Long-haul Mode-division Multiplexing Transmissions.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2022

LightPro: Lightweight Probabilistic Workload Prediction Framework for Database-as-a-Service.
Proceedings of the IEEE International Conference on Web Services, 2022

Accelerating Data Serialization/Deserialization Protocols with In-Network Compute.
Proceedings of the IEEE/ACM International Workshop on Exascale MPI, 2022

2019
AdaM: An Adaptive Fine-Grained Scheme for Distributed Metadata Management.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2014
Demonstration of ultra-compact contentionless-ROADM based on flexible wavelength router.
Proceedings of the European Conference on Optical Communication, 2014

2013
A hybrid seasonal prediction model for tuberculosis incidence in China.
BMC Medical Informatics Decis. Mak., 2013

Green and agile petabit optical sub-wavelength switching prototype for the future OTN multi-chassis switch cluster.
Proceedings of the 2013 Optical Fiber Communication Conference and Exposition and the National Fiber Optic Engineers Conference (OFC/NFOEC), 2013


  Loading...