Hanyu Zhao

Orcid: 0000-0002-2536-0016

According to our database¹, Hanyu Zhao authored at least 42 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

FlexNF: Flexible Network Function Orchestration for Scalable On-Path Service Chain Serving.

[BibT_eX]

[DOI]

IEEE/ACM Trans. Netw., June, 2024

Class Hierarchy-Guided Generalized Few-Shot Ship Detection in Remote Sensing Images.

[BibT_eX]

[DOI]

IEEE Geosci. Remote. Sens. Lett., 2024

Modifying the one-hot encoding technique can enhance the adversarial robustness of the visual model for symbol recognition.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2024

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models.

[BibT_eX]

[DOI]

CoRR, 2024

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging.

[BibT_eX]

[DOI]

CoRR, 2024

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency.

[BibT_eX]

[DOI]

CoRR, 2024

Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling.

[BibT_eX]

[DOI]

CoRR, 2024

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies.

[BibT_eX]

[DOI]

CoRR, 2024

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach.

[BibT_eX]

[DOI]

CoRR, 2024

Variational Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache.

[BibT_eX]

[DOI]

CoRR, 2024

Llumnix: Dynamic Scheduling for Large Language Model Serving.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

A Class of Optimal Control Problems of Forward-Backward Systems with Input Constraint.

[BibT_eX]

[DOI]

Jianhui Huang

Wenqiang Li

Hanyu Zhao

J. Optim. Theory Appl., December, 2023

Deep-Reinforcement-Learning-Based NOMA-Aided Slotted ALOHA for LEO Satellite IoT Networks.

[BibT_eX]

[DOI]

IEEE Internet Things J., October, 2023

GoldMiner: Elastic Scaling of Training Data Pre-Processing Pipelines for Deep Learning.

[BibT_eX]

[DOI]

Proc. ACM Manag. Data, 2023

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout.

[BibT_eX]

[DOI]

CoRR, 2023

EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2023

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.

[BibT_eX]

[DOI]

Proceedings of the Eighteenth European Conference on Computer Systems, 2023

Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Artificial Intelligence Security Competition (AISC).

[BibT_eX]

[DOI]

CoRR, 2022

EasyScale: Accuracy-consistent Elastic Training for Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Instance-wise Prompt Tuning for Pretrained Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Deep Reinforcement Learning for the Joint AoI and Throughput Optimization of the Random Access System.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Wireless Communications and Signal Processing, 2022

Zoomer: Boosting Retrieval on Web-scale Graphs by Regions of Interest.

[BibT_eX]

[DOI]

Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Deep Learning-Based Joint Modulation and Coding Scheme Recognition for 5G New Radio Protocols.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Communication Technology, 2022

2021

WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models.

[BibT_eX]

[DOI]

AI Open, 2021

FlexNF: Flexible Network Function Orchestration on the Programmable Data Plane.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE/ACM International Symposium on Quality of Service, 2021

A Chinese Machine Reading Comprehension Dataset Automatic Generated Based on Knowledge Graph.

[BibT_eX]

[DOI]

Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021

A Federated Adversarial Learning Method for Biomedical Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020

PASCAL: a pseudo cascade learning framework for breast cancer treatment entity normalization in Chinese clinical text.

[BibT_eX]

[DOI]

BMC Medical Informatics Decis. Mak., 2020

HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

A Topology-Aware Performance Prediction Model for Distributed Deep Learning on GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019

MSD: Multi-Self-Distillation Learning via Multi-classifiers within Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

A Word Segmentation Method of Ancient Chinese Based on Word Alignment.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2019

SCHED²: Scheduling Deep Learning Training via Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Global Communications Conference, 2019

Chinese Historical Term Translation Pairs Extraction Using Modern Chinese as a Pivot Language.

[BibT_eX]

[DOI]

Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

2018

Gandiva: Introspective Cluster Scheduling for Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Building efficient and available distributed transaction with Paxos-based coding consensus.

[BibT_eX]

[DOI]

Proceedings of the IEEE INFOCOM 2018, 2018

Term Translation Extraction from Historical Classics Using Modern Chinese Explanation.

[BibT_eX]

[DOI]

Xiaoting Wu

Hanyu Zhao

Chao Che

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines.

[BibT_eX]

[DOI]

Proceedings of the ACM Symposium on Cloud Computing, 2018

Scheduling CPU for GPU-based Deep Learning Jobs.

[BibT_eX]

[DOI]

Proceedings of the ACM Symposium on Cloud Computing, 2018

Hanyu Zhao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...