Zhen Dong

Orcid: 0000-0002-5951-6170

Affiliations:
  • Peking University, Institute of Microelectronics, Beijing, China


According to our database1, Zhen Dong authored at least 51 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
End-to-end codesign of Hessian-aware quantized neural networks for FPGAs.
ACM Trans. Reconfigurable Technol. Syst., September, 2024

Corrigendum: Applications and techniques for fast machine learning in science.
Frontiers Big Data, 2024

Stochastic Communication Avoidance for Recommendation Systems.
CoRR, 2024

DQRM: Deep Quantized Recommendation Models.
CoRR, 2024

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis.
CoRR, 2024

Interpolating Video-LLMs: Toward Longer-sequence LMMs in a Training-free Manner.
CoRR, 2024

K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences.
CoRR, 2024

Fisher-aware Quantization for DETR Detectors with Critical-category Objectives.
CoRR, 2024

LLM Inference Unveiled: Survey and Roofline Model Insights.
CoRR, 2024

Magic-Me: Identity-Specific Video Customized Diffusion.
CoRR, 2024

Integrating View Conditions for Image Synthesis.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

SqueezeLLM: Dense-and-Sparse Quantization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

PB-LLM: Partially Binarized Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation.
CoRR, 2023

CVPR 2023 Text Guided Video Editing Competition.
CoRR, 2023

Integrating View Conditions for Image Synthesis.
CoRR, 2023

QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources.
CoRR, 2023

PB-LLM: Partially Binarized Large Language Models.
CoRR, 2023

End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs.
CoRR, 2023

QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Q-Diffusion: Quantizing Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Applications and Techniques for Fast Machine Learning in Science.
Frontiers Big Data, 2022

Analysis of Quantization on MLP-based Vision Models.
CoRR, 2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data.
CoRR, 2022

UnrealNAS: Can We Search Neural Architectures with Unreal Data?
CoRR, 2022

Hessian-Aware Pruning and Optimal Neural Implant.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021
Applications and Techniques for Fast Machine Learning in Science.
CoRR, 2021

A Survey of Quantization Methods for Efficient Neural Network Inference.
CoRR, 2021

Hessian-Aware Pruning and Optimal Neural Implant.
CoRR, 2021

HAWQ-V3: Dyadic Neural Network Quantization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization.
Proceedings of the IEEE International Conference on Acoustics, 2021

CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAs.
Proceedings of the FPGA '21: The 2021 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, Virtual Event, USA, February 28, 2021

HAO: Hardware-aware Neural Architecture Optimization for Efficient Inference.
Proceedings of the 29th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2021

2020
Cross-Domain Sentiment Classification with In-Domain Contrastive Learning.
CoRR, 2020

HAWQV3: Dyadic Neural Network Quantization.
CoRR, 2020

CoDeNet: Algorithm-hardware Co-design for Deformable Convolution.
CoRR, 2020

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ZeroQ: A Novel Zero Shot Quantization Framework.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Novel Convolution Computing Paradigm Based on NOR Flash Array With High Computing Speed and Energy Efficiency.
IEEE Trans. Circuits Syst. I Regul. Pap., 2019

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks.
CoRR, 2019

Algorithm-hardware Co-design for Deformable Convolution.
Proceedings of the Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing, 2019

HAWQ: Hessian AWare Quantization of Neural Networks With Mixed-Precision.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...