Lansong Diao

Orcid: 0009-0000-6193-6126

According to our database1, Lansong Diao authored at least 16 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

FaPES: Enabling Efficient Elastic Scaling for Serverless Machine Learning Platforms.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023
HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.
Dataset, November, 2023

HAP: SPMD DNN Training on Heterogeneous GPU Clusters with Automated Program Synthesis.
Dataset, November, 2023

Expediting Distributed DNN Training With Device Topology-Aware Graph Deployment.
IEEE Trans. Parallel Distributed Syst., April, 2023

Ada-Grouper: Accelerating Pipeline Parallelism in Preempted Network by Adaptive Group-Scheduling for Micro-Batches.
CoRR, 2023

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform.
CoRR, 2023

2022
Optimizing DNN Compilation for Distributed Training With Joint OP and Tensor Fusion.
IEEE Trans. Parallel Distributed Syst., 2022

Accelerating large-scale distributed neural network training with SPMD parallelism.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2021
DAPPLE: a pipelined data parallel approach for training large models.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

DISC: A Dynamic Shape Compiler for Machine Learning Workloads.
Proceedings of the EuroMLSys@EuroSys 2021, 2021

2020
FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads.
CoRR, 2020

Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads.
CoRR, 2020

Optimizing distributed training deployment in heterogeneous GPU clusters.
Proceedings of the CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, 2020

2019
PAI-FCNN: FPGA Based CNN Inference System.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

PAI-FCNN: FPGA Based Inference System for Complex CNN Models.
Proceedings of the 30th IEEE International Conference on Application-specific Systems, 2019


  Loading...