2024
Scalable, Programmable and Dense: The HammerBlade Open-Source RISC-V Manycore.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Polynormer: Polynomial-Expressive Graph Transformer in Linear Time.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Comprehensive Evaluation of FPGA-Based Spatial Acceleration of LLMs.
Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2024

Less is More: Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

2023
Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference.
CoRR, 2023

Comprehensive Benchmarking of Binary Neural Networks on NVM Crossbar Architectures.
CoRR, 2023