Hanyu Zhao

Orcid: 0000-0002-2536-0016

According to our database1, Hanyu Zhao authored at least 42 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FlexNF: Flexible Network Function Orchestration for Scalable On-Path Service Chain Serving.
IEEE/ACM Trans. Netw., June, 2024

Class Hierarchy-Guided Generalized Few-Shot Ship Detection in Remote Sensing Images.
IEEE Geosci. Remote. Sens. Lett., 2024

Modifying the one-hot encoding technique can enhance the adversarial robustness of the visual model for symbol recognition.
Expert Syst. Appl., 2024

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models.
CoRR, 2024

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging.
CoRR, 2024

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency.
CoRR, 2024

Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling.
CoRR, 2024

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies.
CoRR, 2024

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach.
CoRR, 2024

Variational Continual Test-Time Adaptation.
CoRR, 2024

Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache.
CoRR, 2024

Llumnix: Dynamic Scheduling for Large Language Model Serving.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
A Class of Optimal Control Problems of Forward-Backward Systems with Input Constraint.
J. Optim. Theory Appl., December, 2023

Deep-Reinforcement-Learning-Based NOMA-Aided Slotted ALOHA for LEO Satellite IoT Networks.
IEEE Internet Things J., October, 2023

GoldMiner: Elastic Scaling of Training Data Pre-Processing Pipelines for Deep Learning.
Proc. ACM Manag. Data, 2023

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout.
CoRR, 2023

EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs.
Proceedings of the International Conference for High Performance Computing, 2023

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Artificial Intelligence Security Competition (AISC).
CoRR, 2022

EasyScale: Accuracy-consistent Elastic Training for Deep Learning.
CoRR, 2022

Instance-wise Prompt Tuning for Pretrained Language Models.
CoRR, 2022

Deep Reinforcement Learning for the Joint AoI and Throughput Optimization of the Random Access System.
Proceedings of the 14th International Conference on Wireless Communications and Signal Processing, 2022

Zoomer: Boosting Retrieval on Web-scale Graphs by Regions of Interest.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Deep Learning-Based Joint Modulation and Coding Scheme Recognition for 5G New Radio Protocols.
Proceedings of the 22nd IEEE International Conference on Communication Technology, 2022

2021
WuDaoCorpora: A super large-scale Chinese corpora for pre-training language models.
AI Open, 2021

FlexNF: Flexible Network Function Orchestration on the Programmable Data Plane.
Proceedings of the 29th IEEE/ACM International Symposium on Quality of Service, 2021

A Chinese Machine Reading Comprehension Dataset Automatic Generated Based on Knowledge Graph.
Proceedings of the Chinese Computational Linguistics - 20th China National Conference, 2021

A Federated Adversarial Learning Method for Biomedical Named Entity Recognition.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
PASCAL: a pseudo cascade learning framework for breast cancer treatment entity normalization in Chinese clinical text.
BMC Medical Informatics Decis. Mak., 2020

HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

A Topology-Aware Performance Prediction Model for Distributed Deep Learning on GPU Clusters.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019
MSD: Multi-Self-Distillation Learning via Multi-classifiers within Deep Neural Networks.
CoRR, 2019

A Word Segmentation Method of Ancient Chinese Based on Word Alignment.
Proceedings of the Natural Language Processing and Chinese Computing, 2019

SCHED²: Scheduling Deep Learning Training via Deep Reinforcement Learning.
Proceedings of the 2019 IEEE Global Communications Conference, 2019

Chinese Historical Term Translation Pairs Extraction Using Modern Chinese as a Pivot Language.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

2018
Gandiva: Introspective Cluster Scheduling for Deep Learning.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Building efficient and available distributed transaction with Paxos-based coding consensus.
Proceedings of the IEEE INFOCOM 2018, 2018

Term Translation Extraction from Historical Classics Using Modern Chinese Explanation.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines.
Proceedings of the ACM Symposium on Cloud Computing, 2018

Scheduling CPU for GPU-based Deep Learning Jobs.
Proceedings of the ACM Symposium on Cloud Computing, 2018


  Loading...