Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision.
ACM Trans. Embed. Comput. Syst., January, 2025
Learning Cache Coherence Traffic for NoC Routing Design.
Proceedings of the Great Lakes Symposium on VLSI 2025, GLSVLSI 2025, New Orleans, LA, USA, 30 June 2025, 2025
Domino-Pro-Max: Toward Efficient Network Simplification and Reparameterization for Embedded Hardware Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2024
Pearls Hide Behind Linearity: Simplifying Deep Convolutional Networks for Embedded Hardware Systems via Linearity Grafting.
Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024
iMAT: Energy-Efficient In-Memory Acceleration for Ternary Neural Networks With Sparse Dot Product.
Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2023