2025
NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors.
Int. J. High Perform. Comput. Appl., 2025

DIDS: A distributed inference framework with dynamic scheduling capability.
Future Gener. Comput. Syst., 2025

2024
COALA: A Compiler-Assisted Adaptive Library Routines Allocation Framework for Heterogeneous Systems.
IEEE Trans. Computers, July, 2024

HPS Cholesky: Hierarchical Parallelized Supernodal Cholesky with Adaptive Parameters.
ACM Trans. Parallel Comput., March, 2024

ABSS: An Adaptive Batch-Stream Scheduling Module for Dynamic Task Parallelism on Chiplet-based Multi-Chip Systems.
ACM Trans. Parallel Comput., March, 2024

Parallel algorithm design and optimization of geodynamic numerical simulation application on the Tianhe new-generation high-performance computer.
J. Supercomput., January, 2024

2023
PETCH-DB: a Portal for Exploring Tissue-specific and Complex disease-associated 5-Hydroxymethylcytosines.
Database J. Biol. Databases Curation, 2023

2021
STM-multifrontal QR: streaming task mapping multifrontal QR factorization empowered by GCN.
Proceedings of the International Conference for High Performance Computing, 2021

Parallel Sparse LU Factorization With Machine-Learning Method on Multi-core Processors.
Proceedings of the 7th International Conference on Systems and Informatics, 2021