Jiazhi Jiang

Orcid: 0000-0002-1417-3012

According to our database1, Jiazhi Jiang authored at least 16 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Sophisticated Orchestrating Concurrent DLRM Training on CPU/GPU Platform.
IEEE Trans. Parallel Distributed Syst., November, 2024

SAIH: A Scalable Evaluation Methodology for Understanding AI Performance Trend on HPC Systems.
J. Comput. Sci. Technol., March, 2024

HTDcr: a job execution framework for high-throughput computing on supercomputers.
Sci. China Inf. Sci., 2024

Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

Efficient Coupling Streaming AI and Ensemble Simulations on HPC Clusters.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

Communication-Efficient Model Parallelism for Distributed In-Situ Transformer Inference.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

2023
Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUs.
ACM Trans. Archit. Code Optim., December, 2023

Hierarchical Model Parallelism for Optimizing Inference on Many-core Processor via Decoupled 3D-CNN Structure.
ACM Trans. Archit. Code Optim., September, 2023

Optimizing massively parallel sparse matrix computing on ARM many-core processor.
Parallel Comput., September, 2023

Full-Stack Optimizing Transformer Inference on ARM Many-Core CPU.
IEEE Trans. Parallel Distributed Syst., July, 2023

MixRec: Orchestrating Concurrent Recommendation Model Training on CPU-GPU platform.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

Accelerating Inference of 3D-CNN on ARMMany-core CPU via Hierarchical Model Partition.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

2022
Optimizing small channel 3D convolution on GPU with tensor core.
Parallel Comput., 2022

Handling heavy-tailed input of transformer inference on GPUs.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

Characterizing and Optimizing Transformer Inference on ARM Many-core Processor.
Proceedings of the 51st International Conference on Parallel Processing, 2022

2020
A mechanism for scheduling multi robot intelligent warehouse system face with dynamic demand.
J. Intell. Manuf., 2020


  Loading...