2024
Inference-to-complete: A High-performance and Programmable Data-plane Co-processor for Neural-network-driven Traffic Analysis.
CoRR, 2024

2023
Octopus: A Heterogeneous In-network Computing Accelerator Enabling Deep Learning for network.
CoRR, 2023

2022
A low-latency LSTM accelerator using balanced sparsity based on FPGA.
Microprocess. Microsystems, March, 2022

An automatic learning rate decay strategy for stochastic gradient descent optimization methods in neural networks.
Int. J. Intell. Syst., 2022

2021
An energy-efficient convolutional neural network accelerator for speech classification based on FPGA and quantization.
CCF Trans. High Perform. Comput., 2021

A high-throughput scalable BNN accelerator with fully pipelined architecture.
CCF Trans. High Perform. Comput., 2021

COVID Edge-Net: Automated COVID-19 Lung Lesion Edge Detection in Chest CT Images.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2021

Compressed LSTM using balanced sparsity.
Proceedings of the Ninth International Conference on Advanced Cloud and Big Data, 2021

RFC-HyPGCN: A Runtime Sparse Feature Compress Accelerator for Skeleton-Based GCNs Action Recognition Model with Hybrid Pruning.
Proceedings of the 32nd IEEE International Conference on Application-specific Systems, 2021