Han Zhao

Orcid: 0000-0002-1561-5329

Affiliations:
  • Shanghai Jiao Tong University, Shanghai, China


According to our database1, Han Zhao authored at least 17 papers between 2016 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FLAPS: fluctuation-aware power auction strategy for reducing the power overload probability.
Frontiers Comput. Sci., May, 2025

2024
Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture.
CoRR, 2024

Towards Fast Setup and High Throughput of GPU Serverless Computing.
CoRR, 2024

A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters.
CoRR, 2024

FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation.
IEEE Trans. Computers, December, 2023

ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-Grained Resource Management.
IEEE Trans. Computers, May, 2023

Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

2022
DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on GPUs.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021
E<sup>2</sup>bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services.
IEEE Trans. Parallel Distributed Syst., 2021

Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.
Proceedings of the International Conference for High Performance Computing, 2021

Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks.
Proceedings of the 39th IEEE International Conference on Computer Design, 2021

2020
CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

2019
Bandwidth and Locality Aware Task-stealing for Manycore Architectures with Bandwidth-Asymmetric Memory.
ACM Trans. Archit. Code Optim., 2019

2016
Online Credit Card Fraud Detection: A Hybrid Framework with Big Data Technologies.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

Towards Scalable and Reliable In-Memory Storage System: A Case Study with Redis.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016


  Loading...