DFU-E: A Dataflow Architecture for Edge DSP and AI Applications.
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Parallel Distributed Syst., June, 2025
Multilayer Dataflow: Orchestrate Butterfly Sparsity to Accelerate Attention Computation.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
A Comprehensive Survey on GNN Characterization.
CoRR, 2024
Revisiting Edge Perturbation for Graph Neural Network in Graph Data Augmentation and Attack.
CoRR, 2024
Accelerating Mini-batch HGNN Training by Reducing CUDA Kernels.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2024
Disttack: Graph Adversarial Attacks Toward Distributed GNN Training.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
ADE-HGNN: Accelerating HGNNs Through Attention Disparity Exploitation.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
GDL-GNN: Applying GPU Dataloading of Large Datasets for Graph Neural Network Inference.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
Accelerating Convolutional Neural Networks by Exploiting the Sparsity of Output Activation.
,
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Parallel Distributed Syst., December, 2023
Characterizing and Understanding Defense Methods for GNNs on GPUs.
IEEE Comput. Archit. Lett., 2023
An efficient scheduling algorithm for dataflow architecture using loop-pipelining.
Inf. Sci., 2021
An efficient dataflow accelerator for scientific applications.
Future Gener. Comput. Syst., 2020
An Efficient Multicast Router using Shared-Buffer with Packet Merging for Dataflow Architecture.
Proceedings of the 14th IEEE/ACM International Symposium on Networks-on-Chip, 2020
Applying CNN on a scientific application accelerator based on dataflow architecture.
CCF Trans. High Perform. Comput., 2019
Accelerating CNN Algorithm with Fine-Grained Dataflow Architectures.
Proceedings of the 20th IEEE International Conference on High Performance Computing and Communications; 16th IEEE International Conference on Smart City; 4th IEEE International Conference on Data Science and Systems, 2018