Zhenhua Han

Orcid: 0000-0002-2880-7100

According to our database1, Zhenhua Han authored at least 54 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Automating Cloud Deployment for Real-Time Online Foundation Model Inference.
IEEE/ACM Trans. Netw., April, 2024

DAG Scheduling in Mobile Edge Computing.
ACM Trans. Sens. Networks, January, 2024

Online Streaming Video Super-Resolution With Convolutional Look-Up Table.
IEEE Trans. Image Process., 2024

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval.
CoRR, 2024

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention.
CoRR, 2024

Parrot: Efficient Serving of LLM-based Applications with Semantic Variable.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Online Container Caching with Late-Warm for IoT Data Processing.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
Online Approximation Scheme for Scheduling Heterogeneous Utility Jobs in Edge Computing.
IEEE/ACM Trans. Netw., February, 2023

Energy efficiency optimization of water pump based on heuristic algorithm and computational fluid dynamics.
J. Comput. Des. Eng., January, 2023

Dissecting Arbitrary-scale Super-resolution Capability from Pre-trained Diffusion Generative Models.
CoRR, 2023

Online Video Streaming Super-Resolution with Adaptive Look-Up Table Fusion.
CoRR, 2023

SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR, 2023

Seismic P-wave first-arrival picking model based on EQK-IncResNet.
Concurr. Comput. Pract. Exp., 2023

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Optimizing Dynamic Neural Networks with Brainstorm.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Dynamic Resource Allocation for Deep Learning Clusters with Separated Compute and Storage.
Proceedings of the IEEE INFOCOM 2023, 2023

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Efficient Online Learning Based Cross-Tier Uplink Scheduling in HetNets.
IEEE/ACM Trans. Netw., 2022

Distributed Job Dispatching in Edge Computing Networks With Random Transmission Latency: A Low-Complexity POMDP Approach.
IEEE Internet Things J., 2022

Cross-Model Operator Batching for Neural Network Architecture Search.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2022

PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

Chip Surface Defect Recognition based on Improved Faster R-CNN.
Proceedings of the 28th International Conference on Mechatronics and Machine Vision in Practice, 2022

Study on the Reliability of SemGCN in Gait Analysis.
Proceedings of the 28th International Conference on Mechatronics and Machine Vision in Practice, 2022

Arithmetic optimization algorithm to optimize support vector machine for chip defect Identification.
Proceedings of the 28th International Conference on Mechatronics and Machine Vision in Practice, 2022

Online File Caching in Latency-Sensitive Systems with Delayed Hits and Bypassing.
Proceedings of the IEEE INFOCOM 2022, 2022

2021
Regularization-Based Coflow Scheduling in Optical Circuit Switches.
IEEE/ACM Trans. Netw., 2021

Asymptotically Optimal Online Caching on Multiple Caches With Relaying and Bypassing.
IEEE/ACM Trans. Netw., 2021

SPIN: BSP Job Scheduling With Placement-Sensitive Execution.
IEEE/ACM Trans. Netw., 2021

2020
Online Deadline-Aware Task Dispatching and Scheduling in Edge Computing.
IEEE Trans. Parallel Distributed Syst., 2020

Cooperative Jobs Dispatching in Edge Computing Network with Unpredictable Uploading Delay.
J. Commun. Inf. Networks, 2020

Online Learning-Based Co-task Dispatching with Function Configuration in Edge Computing.
Proceedings of the Parallel and Distributed Computing, Applications and Technologies, 2020

HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Retiarii: A Deep Learning Exploratory-Training Framework.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Online Distributed Job Dispatching with Outdated and Partially-Observable Information.
Proceedings of the 16th International Conference on Mobility, Sensing and Networking, 2020

Online dispatching and scheduling of jobs with heterogeneous utilities in edge computing.
Proceedings of the Mobihoc '20: The Twenty-first ACM International Symposium on Theory, 2020

Automating Cloud Deployment for Deep Learning Inference of Real-time Online Services.
Proceedings of the 39th IEEE Conference on Computer Communications, 2020

Scheduling Placement-Sensitive BSP Jobs with Inaccurate Execution Time Estimation.
Proceedings of the 39th IEEE Conference on Computer Communications, 2020

2019
Joint Online Coflow Routing and Scheduling in Data Center Networks.
IEEE/ACM Trans. Netw., 2019

Energy-Efficient Dynamic Virtual Machine Management in Data Centers.
IEEE/ACM Trans. Netw., 2019

OnDisc: Online Latency-Sensitive Job Dispatching and Scheduling in Heterogeneous Edge-Clouds.
IEEE/ACM Trans. Netw., 2019

Cooperative Job Dispatching in Edge Computing Network with Unpredictable Uploading Delay.
CoRR, 2019

Dependent task placement and scheduling with function configuration in edge computing.
Proceedings of the International Symposium on Quality of Service, 2019

Camul: Online Caching on Multiple Caches with Relaying and Bypassing.
Proceedings of the 2019 IEEE Conference on Computer Communications, 2019

2018
Gandiva: Introspective Cluster Scheduling for Deep Learning.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Online Learning based Uplink Scheduling in HetNets with Limited Backhaul Capacity.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

Scheduling CPU for GPU-based Deep Learning Jobs.
Proceedings of the ACM Symposium on Cloud Computing, 2018

2017
Congestion Game With Agent and Resource Failures.
IEEE J. Sel. Areas Commun., 2017

Online job dispatching and scheduling in edge-clouds.
Proceedings of the 2017 IEEE Conference on Computer Communications, 2017

2016
Cross-Layer Protocol Design for Wireless Communication in Hybrid Data Center Networks.
Proceedings of the 12th International Conference on Mobile Ad-Hoc and Sensor Networks, 2016

Dynamic virtual machine management via approximate Markov decision process.
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016

2015
Selfish task-driven routing in hybrid networks.
Proceedings of the 13th International Symposium on Modeling and Optimization in Mobile, 2015

Optimal Rendezvous Strategies for Different Environments in Cognitive Radio Networks.
Proceedings of the 18th ACM International Conference on Modeling, 2015

2014
Channel Selection for Rendezvous with High Link Stability in Cognitive Radio Network.
Proceedings of the Wireless Algorithms, Systems, and Applications, 2014


  Loading...