swPTS: an efficient parallel Thomas split algorithm for tridiagonal systems on Sunway manycore processors.
J. Supercomput., March, 2024
STMS-YOLOv5: A Lightweight Algorithm for Gear Surface Defect Detection.
Sensors, July, 2023
SPM-GCN: An adaptive reordering algorithm for sparse LU factorization via GCN.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
hcaPCG: A Heterogeneous and communication-avoid PCG with Jacobi preconditioner on SW26010-Pro architecture.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
AH-TDMA: An Adaptive Heterogeneous Tridiagonal Matrix Algorithm on the New Sunway Supercomputer.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
An optimized Hybrid Gauss-Seidel smoother In AMG solver of Hypre on Sunway Many-core Architecture.
Proceedings of the 7th International Conference on High Performance Compilation, 2023