2025
ACiS: Complex Processing in the Switch Fabric.
CoRR, January, 2025
2024
FPGA-Accelerated Range-Limited Molecular Dynamics.
IEEE Trans. Computers, June, 2024
Scrutinize What We Ignore: Reining Task Representation Shift In Context-Based Offline Meta Reinforcement Learning.
CoRR, 2024
SmartFuse: Reconfigurable Smart Switches to Accelerate Fused Collectives in HPC Applications.
Proceedings of the 38th ACM International Conference on Supercomputing, 2024
2023
A feedforward neural network based on Legendre polynomial for solving linear Fredholm integro-differential equations.
Int. J. Comput. Math., July, 2023
FASDA: An FPGA-Aided, Scalable and Distributed Accelerator for Range-Limited Molecular Dynamics.
Proceedings of the International Conference for High Performance Computing, 2023
FLASH: FPGA-Accelerated Smart Switches with GCN Case Study.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 37th International Conference on Supercomputing, 2023
Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training.
Proceedings of the 37th International Conference on Supercomputing, 2023
Flexible Functional Component for Fluidic Soft Robots.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023
Design and Experimental Study of An Intelligent Soft Crawling Robot for Environmental Interactions.
Proceedings of the Intelligent Robotics and Applications - 16th International Conference, 2023
2022
Reconfigurable switches for high performance and flexible MPI collectives.
Concurr. Comput. Pract. Exp., 2022
A Parameter Optimization Method of 3D Printing Soft Materials for Soft Robots.
Proceedings of the Intelligent Robotics and Applications - 15th International Conference, 2022
Optimized Mappings for Symmetric Range-Limited Molecular Force Calculations on FPGAs.
Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022
A Framework for Neural Network Inference on FPGA-Centric SmartNICs.
Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022
H-GCN: A Graph Convolutional Network Accelerator on Versal ACAP Architecture.
Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks.
Proceedings of the 30th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2022
2021
Workload Imbalance in HPC Applications: Effect on Performance of In-Network Processing.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021
A Survey: Handling Irregularities in Neural Network Acceleration with FPGAs.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021
2020
Shop The Look: Building a Large Scale Visual Shopping System at Pinterest.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020
A Reconfigurable Compute-in-the-Network FPGA Assistant for High-Level Collective Support with Distributed Matrix Multiply Case Study.
Proceedings of the International Conference on Field-Programmable Technology, 2020
FPGAs in the Network and Novel Communicator Support Accelerate MPI Collectives.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020
FP-AMG: FPGA-Based Acceleration Framework for Algebraic Multigrid Solvers.
Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020
2019
A Multi-Stage Model for the Electromagnetic Shielding Effectiveness Prediction of an Infinite Conductor Plane With Periodic Apertures.
IEEE Access, 2019