Compiling ONNX Neural Network Models Using MLIR.
CoRR, 2020
Acceleration of Large Deep Learning Training with Hybrid GPU Memory Management of Swapping and Re-computing.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Profiling based Out-of-core Hybrid Method for Large Neural Networks.
CoRR, 2019
Profiling based out-of-core hybrid method for large neural networks: poster.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019
High Resolution Medical Image Segmentation Using Data-Swapping Method.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019
Automatic GPU memory management for large neural models in TensorFlow.
Proceedings of the 2019 ACM SIGPLAN International Symposium on Memory Management, 2019
Large Data Flow Graphs in Limited GPU Memory.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019
Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method.
CoRR, 2018
TFLMS: Large Model Support in TensorFlow by Graph Rewriting.
CoRR, 2018
Involving CPUs into Multi-GPU Deep Learning.
Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, 2018
A Systematic Approach to Regular-Expression-Based Queries on Big Graphs.
PhD thesis, 2016