Wei Zhang

Orcid: 0000-0002-4840-0324

Affiliations:
  • Shanghai Jiao Tong University, China


According to our database1, Wei Zhang authored at least 11 papers between 2019 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Adaptive Resource Efficient Microservice Deployment in Cloud-Edge Continuum.
IEEE Trans. Parallel Distributed Syst., 2022

Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs.
IEEE Trans. Computers, 2022

PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Astraea: towards QoS-aware and resource-efficient multi-stage GPU services.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021
QoS-Aware and Resource Efficient Microservice Deployment in Cloud-Edge Continuum.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

CHARM: Collaborative Host and Accelerator Resource Management for GPU Datacenters.
Proceedings of the 39th IEEE International Conference on Computer Design, 2021

2020
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters.
CoRR, 2020

URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

2019
URSA: Precise Capacity Planning and Contention-aware Scheduling for Public Clouds.
CoRR, 2019

Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters.
Proceedings of the ACM International Conference on Supercomputing, 2019


  Loading...