Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021

DAPPLE: a pipelined data parallel approach for training large models.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

DISC: A Dynamic Shape Compiler for Machine Learning Workloads.

[BibT_eX]

[DOI]

Proceedings of the EuroMLSys@EuroSys 2021, 2021

2020

EasyTransfer - A Simple and Scalable Deep Transfer Learning Platform for NLP Applications.

[BibT_eX]

[DOI]

CoRR, 2020

INT8 Winograd Acceleration for Conv1D Equipped ASR Models Deployed on Mobile Devices.

[BibT_eX]

[DOI]

CoRR, 2020

FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads.

[BibT_eX]

[DOI]

CoRR, 2020

Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads.

[BibT_eX]

[DOI]

CoRR, 2020

Fast Training of Deep Learning Models over Multiple GPUs.

[BibT_eX]

[DOI]

Proceedings of the Middleware '20: 21st International Middleware Conference, 2020

A History-Based Auto-Tuning Framework for Fast and High-Performance DNN Design on GPU.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

Optimizing distributed training deployment in heterogeneous GPU clusters.

[BibT_eX]

[DOI]

Proceedings of the CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, 2020

2019

FusionStitching: Boosting Execution Efficiency of Memory Intensive Computations for DL Workloads.

[BibT_eX]

[DOI]

Guoping Long

Jun Yang

Wei Lin

CoRR, 2019

Characterizing Deep Learning Training Workloads on Alibaba-PAI.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2019

2018

Graph-Adaptive Pruning for Efficient Inference of Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2018

FusionStitching: Deep Fusion and Code Generation for Tensorflow Computations on GPUs.

[BibT_eX]

[DOI]

CoRR, 2018

Efficient Deep Learning Inference Based on Model Compression.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Jun Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...