Lansong Diao

Orcid: 0009-0000-6193-6126

According to our database¹, Lansong Diao authored at least 16 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth European Conference on Computer Systems, 2024

FaPES: Enabling Efficient Elastic Scaling for Serverless Machine Learning Platforms.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023

HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.

[BibT_eX]

[DOI]

Dataset, November, 2023

HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.

[BibT_eX]

[DOI]

Dataset, November, 2023

Expediting Distributed DNN Training With Device Topology-Aware Graph Deployment.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., April, 2023

Ada-Grouper: Accelerating Pipeline Parallelism in Preempted Network by Adaptive Group-Scheduling for Micro-Batches.

[BibT_eX]

[DOI]

CoRR, 2023

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

Accelerating large-scale distributed neural network training with SPMD parallelism.

[BibT_eX]

[DOI]

Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2021

DAPPLE: a pipelined data parallel approach for training large models.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

DISC: A Dynamic Shape Compiler for Machine Learning Workloads.

[BibT_eX]

[DOI]

Proceedings of the EuroMLSys@EuroSys 2021, 2021

2020

FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads.

[BibT_eX]

[DOI]

CoRR, 2020

Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads.

[BibT_eX]

[DOI]

CoRR, 2020

Optimizing distributed training deployment in heterogeneous GPU clusters.

[BibT_eX]

[DOI]

Proceedings of the CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, 2020

2019

PAI-FCNN: FPGA Based CNN Inference System.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

PAI-FCNN: FPGA Based Inference System for Complex CNN Models.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE International Conference on Application-specific Systems, 2019

Lansong Diao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...