Zhen Zheng
Orcid: 0009-0006-2692-713X
According to our database1,
Zhen Zheng
authored at least 48 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Research on Pedestrian and Cyclist Classification Method Based on Micro-Doppler Effect.
Sensors, October, 2024
Fast Robust Point Cloud Registration Based on Compatibility Graph and Accelerated Guided Sampling.
Remote. Sens., August, 2024
Multi-task aided face recognition network with convolution kernel spatial collaboration.
Signal Image Video Process., June, 2024
Classification of inland lake water quality levels based on Sentinel-2 images using convolutional neural networks and spatiotemporal variation and driving factors of algal bloom.
Ecol. Informatics, 2024
MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design.
CoRR, 2024
BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching.
CoRR, 2024
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR, 2024
Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
RecFlex: Enabling Feature Heterogeneity-Aware Optimization for Deep Recommendation Models with Flexible Schedules.
Proceedings of the International Conference for High Performance Computing, 2024
MonoNN: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
Proceedings of the Nineteenth European Conference on Computer Systems, 2024
2023
Magnetic Anomaly Detection Based on a Compound Tri-Stable Stochastic Resonance System.
Sensors, November, 2023
Expanding the Edge: Enabling Efficient Winograd CNN Inference With Deep Reuse on Edge Device.
IEEE Trans. Knowl. Data Eng., October, 2023
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach.
Proc. ACM Manag. Data, September, 2023
Flash-LLM: Enabling Low-Cost and Highly-Efficient Large Generative Model Inference With Unstructured Sparsity.
Proc. VLDB Endow., 2023
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR, 2023
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.
CoRR, 2023
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform.
CoRR, 2023
RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023
2022
IEEE Trans. Parallel Distributed Syst., 2022
Single-Stage Adaptive Multi-Scale Point Cloud Noise Filtering Algorithm Based on Feature Information.
Remote. Sens., 2022
Research on localisation algorithm of large irregular workpiece for industrial robot.
Int. J. Comput. Sci. Math., 2022
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022
Proceedings of the 2022 USENIX Annual Technical Conference, 2022
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022
2021
Int. J. Wirel. Mob. Comput., 2021
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021
Proceedings of the EuroMLSys@EuroSys 2021, 2021
2020
CoRR, 2020
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads.
CoRR, 2020
IEEE Access, 2020
Proceedings of the CoNEXT '20: The 16th International Conference on emerging Networking EXperiments and Technologies, 2020
GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020
2019
J. Intell. Fuzzy Syst., 2019
Adaptive Edge Detection Algorithm Based on Grey Entropy Theory and Textural Features.
IEEE Access, 2019
HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations.
Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019
2018
Proceedings of the IEEE International Conference on Information and Automation, 2018
The Influence of Cambered Optical Fairing on the Light Beam of Underwater Laser Fuze.
Proceedings of the IEEE International Conference on Information and Automation, 2018
Optimization design of multi-sided reflecting prism in laser line scanning imaging system.
Proceedings of the IEEE International Conference on Information and Automation, 2018
2017
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017
2016
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer.
Proceedings of the International Conference for High Performance Computing, 2016
2014
Holistic Modeling and Performance Evaluation for Converged Network-Cloud Service Provisioning.
Proceedings of the 28th IEEE International Conference on Advanced Information Networking and Applications, 2014
2013
Proceedings of the 9th International Conference on Information, 2013
2012
Proceedings of the Advances in Neural Networks - ISNN 2012, 2012
Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics, 2012
2006
Illumination Variation in Images in Independent Component Analysis and Principal Component Analysis Subspaces.
Proceedings of the Sixth International Conference on Intelligent Systems Design and Applications (ISDA 2006), 2006