Quanlu Zhang

Orcid: 0000-0003-0557-1104

According to our database¹, Quanlu Zhang authored at least 35 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Automating Cloud Deployment for Real-Time Online Foundation Model Inference.

[BibT_eX]

[DOI]

IEEE/ACM Trans. Netw., April, 2024

Efficient Large Language Models: A Survey.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

You Only Cache Once: Decoder-Decoder Architectures for Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

2023

AutoTaskFormer: Searching Vision Transformers for Multi-task Learning.

[BibT_eX]

[DOI]

CoRR, 2023

SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.

[BibT_eX]

[DOI]

CoRR, 2023

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction.

[BibT_eX]

[DOI]

CoRR, 2023

PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.

[BibT_eX]

[DOI]

Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters.

[BibT_eX]

[DOI]

Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022

SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute.

[BibT_eX]

[DOI]

Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

Privacy-preserving Online AutoML for Domain-Specific Face Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

AceNAS: Learning to Rank Ace Neural Architectures with Weak Supervision of Weight Sharing.

[BibT_eX]

[DOI]

CoRR, 2021

2020

How Does Supernet Help in Neural Architecture Search?

[BibT_eX]

[DOI]

Yuge Zhang

Quanlu Zhang

Yaming Yang

CoRR, 2020

Deeper Insights into Weight Sharing in Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2020

A Novel Hybrid Active Contour Model for Intracranial Tuberculosis MRI Segmentation Applications.

[BibT_eX]

[DOI]

IEEE Access, 2020

AutoSys: The Design and Operation of Learning-Augmented Systems.

[BibT_eX]

[DOI]

Proceedings of the 2020 USENIX Annual Technical Conference, 2020

HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Retiarii: A Deep Learning Exploratory-Training Framework.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Automating Cloud Deployment for Deep Learning Inference of Real-time Online Services.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE Conference on Computer Communications, 2020

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

2018

Gandiva: Introspective Cluster Scheduling for Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

Building efficient and available distributed transaction with Paxos-based coding consensus.

[BibT_eX]

[DOI]

Proceedings of the IEEE INFOCOM 2018, 2018

Towards Web-based Delta Synchronization for Cloud Storage Services.

[BibT_eX]

[DOI]

Proceedings of the 16th USENIX Conference on File and Storage Technologies, 2018

SDPaxos: Building Efficient Semi-Decentralized Geo-replicated State Machines.

[BibT_eX]

[DOI]

Proceedings of the ACM Symposium on Cloud Computing, 2018

Scheduling CPU for GPU-based Deep Learning Jobs.

[BibT_eX]

[DOI]

Proceedings of the ACM Symposium on Cloud Computing, 2018

2017

DeltaCFS: Boosting Delta Sync for Cloud Storage Services by Learning from NFS.

[BibT_eX]

[DOI]

Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

2015

CHARM: A Cost-Efficient Multi-Cloud Data Hosting Scheme with High Availability.

[BibT_eX]

[DOI]

IEEE Trans. Cloud Comput., 2015

UStore: A Low Cost Cold and Archival Data Storage System for Data Centers.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Conference on Distributed Computing Systems, 2015

Understanding and Surpassing Dropbox: Efficient Incremental Synchronization in Cloud Storage Services.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Global Communications Conference, 2015

DSwitch: a dual mode direct and network attached disk.

[BibT_eX]

[DOI]

Quanlu Zhang

Yafei Dai

Lintao Zhang

Proceedings of the Sixth ACM Symposium on Cloud Computing, 2015

Quanlu Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...