Zhe Zhou

Orcid: 0000-0001-7929-8054

Affiliations:
  • Peking University, Beijing, China


According to our database1, Zhe Zhou authored at least 24 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Toward CXL-Native Memory Tiering via Device-Side Profiling.
CoRR, 2024

LLM Inference Unveiled: Survey and Roofline Model Insights.
CoRR, 2024

NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

A Software-Hardware Co-design Solution for 3D Inner Structure Reconstruction.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

SpecPIM: Accelerating Speculative Inference on PIM-Enabled System via Architecture-Dataflow Co-Exploration.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
FD-CNN: A Frequency-Domain FPGA Acceleration Scheme for CNN-Based Image-Processing Applications.
ACM Trans. Embed. Comput. Syst., November, 2023

Energon: Toward Efficient Acceleration of Transformers Using Dynamic Sparse Attention.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2023

Automated Design of Chiplets.
Proceedings of the 2023 International Symposium on Physical Design, 2023

DIMM-Link: Enabling Efficient Inter-DIMM Communication for Near-Memory Processing.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

NMExplorer: An Efficient Exploration Framework for DIMM-based Near-Memory Tensor Reduction.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Polaris: Enhancing CXL-based Memory Expanders with Memory-side Prefetching.
Proceedings of the Advanced Parallel Processing Technologies, 2023

2022
PetS: A Unified Framework for Parameter-Efficient Transformers Serving.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

GNNear: Accelerating Full-Batch Training of Graph Neural Networks with near-Memory Processing.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021
GCNear: A Hybrid Architecture for Efficient GCN Training with Near-Memory Processing.
CoRR, 2021

Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention.
CoRR, 2021

METRO: A Software-Hardware Co-Design of Interconnections for Spatial DNN Accelerators.
CoRR, 2021

Rapid Configuration of Asynchronous Recurrent Neural Networks for ASIC Implementations.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

Reconfigurable ASIC Implementation of Asynchronous Recurrent Neural Networks.
Proceedings of the 27th IEEE International Symposium on Asynchronous Circuits and Systems, 2021

2020
Edge-Stream: a Stream Processing Approach for Distributed Applications on a Hierarchical Edge-computing System.
Proceedings of the 5th IEEE/ACM Symposium on Edge Computing, 2020

SaFace: Towards Scenario-aware Face Recognition via Edge Computing System.
Proceedings of the 3rd USENIX Workshop on Hot Topics in Edge Computing, 2020

Hardware-assisted Service Live Migration in Resource-limited Edge Computing Systems.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
Accelerate service live migration in resource-limited edge computing systems.
Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, 2019


  Loading...