Jiangfei Duan

Orcid: 0000-0002-6327-2033

According to our database1, Jiangfei Duan authored at least 9 papers in 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Proteus: Simulating the Performance of Distributed DNN Training.
IEEE Trans. Parallel Distributed Syst., October, 2024

Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.
CoRR, 2024

SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention.
CoRR, 2024

SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models.
CoRR, 2024

MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving.
CoRR, 2024

Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SpotServe: Serving Generative Large Language Models on Preemptible Instances.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Centauri: Enabling Efficient Scheduling for Communication-Computation Overlap in Large Model Training via Communication Partitioning.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024


  Loading...