29-Billion Atoms Molecular Dynamics Simulation With Ab Initio Accuracy on 35 Million Cores of New Sunway Supercomputer.
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Computers, May, 2025
Hybrid 3D Reconstruction of Indoor Scenes Integrating Object Recognition.
Remote. Sens., February, 2024
Deep learning representations for quantum many-body systems on heterogeneous hardware.
Mach. Learn. Sci. Technol., March, 2023
Bridging the Gap between Deep Learning and Frustrated Quantum Spin System for Extreme-Scale Simulations on New Generation of Sunway Supercomputer.
IEEE Trans. Parallel Distributed Syst., 2022
AI for Quantum Mechanics: High Performance Quantum Many-Body Simulations via Deep Learning.
,
,
,
,
,
,
,
,
,
,
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Machine Learning-enabled Performance Model for DNN Applications and AI Accelerator.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022
swFLOW: A large-scale distributed framework for deep learning on Sunway TaihuLight supercomputer.
Inf. Sci., 2021
Distributed deep learning system for cancerous region detection on Sunway TaihuLight.
CCF Trans. High Perform. Comput., 2020
Runtime Adaptive Matrix Multiplication for the SW26010 Many-Core Processor.
IEEE Access, 2020
Optimizing Astrophysical Simulation Software on Sunway Heterogeneous Manycore Architecture.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020
Degree-of-Node Task Scheduling of Fine-Grained Parallel Programs on Heterogeneous Systems.
J. Comput. Sci. Technol., 2019
Improving the Performance of Distributed MXNet with RDMA.
Int. J. Parallel Program., 2019
Gdarts: A GPU-Based Runtime System for Dataflow Task Programming on Dependency Applications.
Proceedings of the 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2019
swFLOW: A Dataflow Deep Learning Framework on Sunway TaihuLight Supercomputer.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
TripletRun: A Dataflow Runtime Simulator and Its Performance Model.
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019