Jiazhi Jiang

Orcid: 0000-0002-1417-3012

According to our database1, Jiazhi Jiang authored at least 14 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SAIH: A Scalable Evaluation Methodology for Understanding AI Performance Trend on HPC Systems.
J. Comput. Sci. Technol., March, 2024

HTDcr: a job execution framework for high-throughput computing on supercomputers.
Sci. China Inf. Sci., 2024

Liger: Interleaving Intra- and Inter-Operator Parallelism for Distributed Large Model Inference.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

Communication-Efficient Model Parallelism for Distributed In-Situ Transformer Inference.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

2023
Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUs.
ACM Trans. Archit. Code Optim., December, 2023

Hierarchical Model Parallelism for Optimizing Inference on Many-core Processor via Decoupled 3D-CNN Structure.
ACM Trans. Archit. Code Optim., September, 2023

Optimizing massively parallel sparse matrix computing on ARM many-core processor.
Parallel Comput., September, 2023

Full-Stack Optimizing Transformer Inference on ARM Many-Core CPU.
IEEE Trans. Parallel Distributed Syst., July, 2023

MixRec: Orchestrating Concurrent Recommendation Model Training on CPU-GPU platform.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

Accelerating Inference of 3D-CNN on ARMMany-core CPU via Hierarchical Model Partition.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

2022
Optimizing small channel 3D convolution on GPU with tensor core.
Parallel Comput., 2022

Handling heavy-tailed input of transformer inference on GPUs.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

Characterizing and Optimizing Transformer Inference on ARM Many-core Processor.
Proceedings of the 51st International Conference on Parallel Processing, 2022

2020
A mechanism for scheduling multi robot intelligent warehouse system face with dynamic demand.
J. Intell. Manuf., 2020


  Loading...