Inference-to-complete: A High-performance and Programmable Data-plane Co-processor for Neural-network-driven Traffic Analysis.
CoRR, 2024
Octopus: A Heterogeneous In-network Computing Accelerator Enabling Deep Learning for network.
CoRR, 2023
A low-latency LSTM accelerator using balanced sparsity based on FPGA.
Microprocess. Microsystems, March, 2022
An automatic learning rate decay strategy for stochastic gradient descent optimization methods in neural networks.
Int. J. Intell. Syst., 2022
An energy-efficient convolutional neural network accelerator for speech classification based on FPGA and quantization.
CCF Trans. High Perform. Comput., 2021
A high-throughput scalable BNN accelerator with fully pipelined architecture.
CCF Trans. High Perform. Comput., 2021
COVID Edge-Net: Automated COVID-19 Lung Lesion Edge Detection in Chest CT Images.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2021
Compressed LSTM using balanced sparsity.
Proceedings of the Ninth International Conference on Advanced Cloud and Big Data, 2021
RFC-HyPGCN: A Runtime Sparse Feature Compress Accelerator for Skeleton-Based GCNs Action Recognition Model with Hybrid Pruning.
Proceedings of the 32nd IEEE International Conference on Application-specific Systems, 2021