Wenqi Lou

Orcid: 0000-0002-2240-6672

According to our database1, Wenqi Lou authored at least 16 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FlexBCM: Hybrid Block-Circulant Neural Network and Accelerator Co-Search on FPGAs.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2024

Unleashing Network/Accelerator Co-Exploration Potential on FPGAs: A Deeper Joint Search.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., October, 2024

MFNAS: Multi-fidelity Exploration in Neural Architecture Search with Stable Zero-Shot Proxy.
Proceedings of the PRICAI 2024: Trends in Artificial Intelligence, 2024

Fine-Grained Shared Cache Interference Analysis Using Basic Block's Execution Time.
Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

UniCoMo: A Unified Learning-Based Cost Model for Tensorized Program Tuning.
Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

AutoSparse: A Source-to-Source Format and Schedule Auto- Tuning Framework for Sparse Tensor Program.
Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

Enhancing Long Sequence Input Processing in FPGA-Based Transformer Accelerators through Attention Fusion.
Proceedings of the Great Lakes Symposium on VLSI 2024, 2024

Beyond Training: A Zero-Shot Framework to Neural Architecture and Accelerator Co-Exploration.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

2023
hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGA.
Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2023

NAF: Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

2022
OctCNN: A High Throughput FPGA Accelerator for CNNs Using Octave Convolution Algorithm.
IEEE Trans. Computers, 2022

TCL-Net: A Lightweight and Efficient Dehazing Network with Frequency-Domain Fusion and Multi-Angle Attention.
Proceedings of the Computer Vision - ACCV 2024, 2022

2021
Neural Network Instruction Set Extension and Code Mapping Mechanism.
Int. J. Softw. Informatics, 2021

2020
OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
RV-CNN: Flexible and Efficient Instruction Set for CNNs Based on RISC-V Processors.
Proceedings of the Advanced Parallel Processing Technologies, 2019

2017
Reconfigurable Hardware Accelerators: Opportunities, Trends, and Challenges.
CoRR, 2017


  Loading...