2025
DFU-E: A Dataflow Architecture for Edge DSP and AI Applications.
IEEE Trans. Parallel Distributed Syst., June, 2025

2024
Multilayer Dataflow: Orchestrate Butterfly Sparsity to Accelerate Attention Computation.
CoRR, 2024

A Comprehensive Survey on GNN Characterization.
CoRR, 2024

Revisiting Edge Perturbation for Graph Neural Network in Graph Data Augmentation and Attack.
CoRR, 2024

Accelerating Mini-batch HGNN Training by Reducing CUDA Kernels.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2024

Disttack: Graph Adversarial Attacks Toward Distributed GNN Training.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

ADE-HGNN: Accelerating HGNNs Through Attention Disparity Exploitation.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

GDL-GNN: Applying GPU Dataloading of Large Datasets for Graph Neural Network Inference.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

2023
Accelerating Convolutional Neural Networks by Exploiting the Sparsity of Output Activation.
IEEE Trans. Parallel Distributed Syst., December, 2023

Characterizing and Understanding Defense Methods for GNNs on GPUs.
IEEE Comput. Archit. Lett., 2023

2021
An efficient scheduling algorithm for dataflow architecture using loop-pipelining.
Inf. Sci., 2021

2020
An efficient dataflow accelerator for scientific applications.
Future Gener. Comput. Syst., 2020

An Efficient Multicast Router using Shared-Buffer with Packet Merging for Dataflow Architecture.
Proceedings of the 14th IEEE/ACM International Symposium on Networks-on-Chip, 2020

2019
Applying CNN on a scientific application accelerator based on dataflow architecture.
CCF Trans. High Perform. Comput., 2019

2018
Accelerating CNN Algorithm with Fine-Grained Dataflow Architectures.
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018