Han Zhao

Orcid: 0000-0002-1561-5329

Affiliations:

Shanghai Jiao Tong University, Shanghai, China

According to our database¹, Han Zhao authored at least 18 papers between 2016 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

FLAPS: fluctuation-aware power auction strategy for reducing the power overload probability.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., May, 2025

Adaptive Kernel Fusion for Improving the GPU Utilization While Ensuring QoS.

[BibT_eX]

[DOI]

IEEE Trans. Computers, February, 2025

2024

Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., December, 2024

Towards Fast Setup and High Throughput of GPU Serverless Computing.

[BibT_eX]

[DOI]

CoRR, 2024

A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters.

[BibT_eX]

[DOI]

CoRR, 2024

FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation.

[BibT_eX]

[DOI]

IEEE Trans. Computers, December, 2023

ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-Grained Resource Management.

[BibT_eX]

[DOI]

IEEE Trans. Computers, May, 2023

Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo.

[BibT_eX]

[DOI]

Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

2022

DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2022 USENIX Annual Technical Conference, 2022

Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021

E<sup>2</sup>bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE International Conference on Computer Design, 2021

2020

CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs.

[BibT_eX]

[DOI]

Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

2019

Bandwidth and Locality Aware Task-stealing for Manycore Architectures with Bandwidth-Asymmetric Memory.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2019

2016

Online Credit Card Fraud Detection: A Hybrid Framework with Big Data Technologies.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

Towards Scalable and Reliable In-Memory Storage System: A Case Study with Redis.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

Han Zhao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...