Xiaofan Zhang

Orcid: 0000-0001-5081-3972

Affiliations:

University of Illinois Urbana-Champaign, IL, USA

According to our database¹, Xiaofan Zhang authored at least 44 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

AutoAI2C: An Automated Hardware Generator for DNN Acceleration on Both FPGA and ASIC.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., October, 2024

Addressing Architectural Obstacles for Overlay with Stream Network Abstraction.

[BibT_eX]

[DOI]

CoRR, 2024

TBA: Faster Large Language Model Training Using SSD-Based Activation Offloading.

[BibT_eX]

[DOI]

Vikram Sharma Mailthody

Sitao Huang

Steven S. Lumetta

Wen-Mei W. Hwu

CoRR, 2024

New Solutions on LLM Acceleration, Optimization, and Application.

[BibT_eX]

[DOI]

CoRR, 2024

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Invited: New Solutions on LLM Acceleration, Optimization, and Application.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

HomeSGN: A Smarter Home with Novel Rule Mining Enabled by a Scorer-Generator GAN.

[BibT_eX]

[DOI]

Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024

Invited Paper: Software/Hardware Co-design for LLM and Its Application for Design Verification.

[BibT_eX]

[DOI]

Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024

2023

Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization.

[BibT_eX]

[DOI]

Clemens JS Schaefer

Navid Lambert-Shirzad

CoRR, 2023

Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search.

[BibT_eX]

[DOI]

Navid Lambert-Shirzad

CoRR, 2023

2022

Algorithm/Accelerator Co-Design and Co-Search for Edge AI.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2022

Exploring HW/SW Co-Design for Video Analysis on CPU-FPGA Heterogeneous Systems.

[BibT_eX]

[DOI]

Volodymyr V. Kindratenko

Deming Chen

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Efficient Machine Learning, Compilers, and Optimizations for Embedded Systems.

[BibT_eX]

[DOI]

CoRR, 2022

AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

YouHome System and Dataset: Making Your Home Know You Better.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Smart Electronic Systems, 2022

2021

Efficient Methods for Mapping Neural Machine Translator on FPGAs.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2021

Being-ahead: Benchmarking and Exploring Accelerators for Hardware-Efficient AI Deployment.

[BibT_eX]

[DOI]

Xiaofan Zhang

Hanchen Ye

Deming Chen

CoRR, 2021

Exploring HW/SW Co-Optimizations for Accelerating Large-scale Texture Identification on Distributed GPUs.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

Scaling Up Hardware Accelerator Verification using A-QED with Functional Decomposition.

[BibT_eX]

[DOI]

Saranyu Chattopadhyay

Proceedings of the Formal Methods in Computer Aided Design, 2021

F-CAD: A Framework to Explore Hardware Accelerators for Codec Avatar Decoding.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020

SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Learning and Systems, 2020

DNNExplorer: A Framework for Modeling and Exploring a Novel Paradigm of FPGA-based DNN Accelerator.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

AutoDNNchip: An Automated DNN Chip Predictor and Builder for Both FPGAs and ASICs.

[BibT_eX]

[DOI]

Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

HybridDNN: A Framework for High-Performance Hybrid DNN Accelerator Design and Implementation.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

A-QED Verification of Hardware Accelerators.

[BibT_eX]

[DOI]

Eshan Singh

Florian Lonsing

Saranyu Chattopadhyay

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019

SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection.

[BibT_eX]

[DOI]

CoRR, 2019

A Bi-Directional Co-Design Approach to Enable Deep Learning on IoT Devices.

[BibT_eX]

[DOI]

CoRR, 2019

SiamVGG: Visual Tracking using Deeper Siamese Networks.

[BibT_eX]

[DOI]

Yuhong Li

Xiaofan Zhang

CoRR, 2019

T-DLA: An Open-source Deep Learning Accelerator for Ternarized DNN Models on Embedded FPGA.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Computer Society Annual Symposium on VLSI, 2019

µL2Q: An Ultra-Low Loss Quantization Method for DNN Compression.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2019

Cloud-DNN: An Open Framework for Mapping DNN Models to Cloud FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

Implementing neural machine translation with bi-directional GRU and attention mechanism on FPGAs using HLS.

[BibT_eX]

[DOI]

Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

2018

DNNBuilder: an automated tool for building high-performance DNN hardware accelerators for FPGAs.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2018

Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 2018 on Great Lakes Symposium on VLSI, 2018

Design Flow of Accelerating Hybrid Extremely Low Bit-Width Neural Network in Embedded FPGA.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

AccDNN: An IP-Based DNN Generator for FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2018

CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes.

[BibT_eX]

[DOI]

Yuhong Li

Xiaofan Zhang

Deming Chen

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Machine learning on FPGAs to face the IoT revolution.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017

An energy efficient approach for C4.5 algorithm using OpenCL design flow.

[BibT_eX]

[DOI]

Hai Peng

Xiaofan Zhang

Letian Huang

Proceedings of the International Conference on Field Programmable Technology, 2017

High-performance video content recognition with long-term recurrent convolutional network for FPGA.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

Xiaofan Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...