Libo Huang
Orcid: 0000-0002-8307-6742
According to our database1,
Libo Huang
authored at least 133 papers
between 2007 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Frontiers Comput. Sci., January, 2025
2024
A Low-Cost Floating-Point FMA Unit Supporting Package Operations for HPC-AI Applications.
IEEE Trans. Circuits Syst. II Express Briefs, July, 2024
CCF Trans. High Perform. Comput., June, 2024
ACM Trans. Design Autom. Electr. Syst., May, 2024
IEEE Trans. Biomed. Eng., May, 2024
ACM Trans. Archit. Code Optim., March, 2024
MPRTA: An Efficient Multilevel Parallel Mobile Accelerator for High-Performance Ray Tracing.
IEEE Trans. Very Large Scale Integr. Syst., February, 2024
A Low-Cost Floating-Point Dot-Product-Dual-Accumulate Architecture for HPC-Enabled AI.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., February, 2024
IEEE Trans. Intell. Veh., January, 2024
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
KFC: Knowledge Reconstruction and Feedback Consolidation Enable Efficient and Effective Continual Generative Learning.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024
Proceedings of the Great Lakes Symposium on VLSI 2024, 2024
ImSPU: Implicit Sharing of Computation Resources Between Vector and Scalar Processing Units.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024
Class-wise Image Mixture Guided Self-Knowledge Distillation for Image Classification.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024
Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024
Out-of-Order and Recursive RAS: A Return Address Stack Design on High Performance Processor.
Proceedings of the 35th IEEE International Conference on Application-specific Systems, 2024
eTag: Class-Incremental Learning via Embedding Distillation and Task-Oriented Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Vis. Comput. Graph., December, 2023
J. Circuits Syst. Comput., July, 2023
IEEE Trans. Neural Networks Learn. Syst., May, 2023
IEEE Trans. Very Large Scale Integr. Syst., February, 2023
RCFusion: Fusing 4-D Radar and Camera With Bird's-Eye View Features for 3-D Object Detection.
IEEE Trans. Instrum. Meas., 2023
Tracking of Multiple Static and Dynamic Targets for 4D Automotive Millimeter-Wave Radar Point Cloud in Urban Environments.
Remote. Sens., 2023
CoRR, 2023
eTag: Class-Incremental Learning with Embedding Distillation and Task-Oriented Generation.
CoRR, 2023
Proceedings of the 41st IEEE International Conference on Computer Design, 2023
Proceedings of the Great Lakes Symposium on VLSI 2023, 2023
Proceedings of the Great Lakes Symposium on VLSI 2023, 2023
Proceedings of the Great Lakes Symposium on VLSI 2023, 2023
A Multi-level Parallel Integer/Floating-Point Arithmetic Architecture for Deep Learning Instructions.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023
2022
A fast unsmoothed aggregation algebraic multigrid framework for the large-scale simulation of incompressible flow.
ACM Trans. Graph., 2022
Multi-Lane Detection and Tracking Using Temporal-Spatial Model and Particle Filtering.
IEEE Trans. Intell. Transp. Syst., 2022
J. Comput. Sci. Technol., 2022
SADD: A Novel Systolic Array Accelerator with Dynamic Dataflow for Sparse GEMM in Deep Learning.
Proceedings of the Network and Parallel Computing, 2022
Proceedings of the Network and Parallel Computing, 2022
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022
MMTP: Multi-Modal Trajectory Prediction with Interaction Attention and Adaptive Task Weighting.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022
Efficient Multiple-Precision and Mixed-Precision Floating-Point Fused Multiply-Accumulate Unit for HPC and AI Applications.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2022
Proceedings of the Algorithms and Architectures for Parallel Processing, 2022
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022
2021
ACM Trans. Archit. Code Optim., 2021
Dynamic Hand Gesture Recognition in In-Vehicle Environment Based on FMCW Radar and Transformer.
Sensors, 2021
Fast and Accurate Lane Detection via Graph Structure and Disentangled Representation Learning.
Sensors, 2021
Sensors, 2021
Sensors, 2021
CoRR, 2021
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the CONF-CDS 2021: The 2nd International Conference on Computing and Data Science, 2021
2020
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020
DancerFly: An Order-Aware Network-on-Chip Router On-the-Fly Mitigating Multi-path Packet Reordering.
Int. J. Parallel Program., 2020
Coordinated Page Prefetch and Eviction for Memory Oversubscription Management in GPUs.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
2019
IEEE Trans. Parallel Distributed Syst., 2019
Efficient architectural exploration of TAGE branch predictor for embedded processors.
Microelectron. J., 2019
SIMD stealing: Architectural support for efficient data parallel execution on multicores.
Microprocess. Microsystems, 2019
MT-DMA: A DMA Controller Supporting Efficient Matrix Transposition for Digital Signal Processing.
IEEE Access, 2019
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019
An Efficient Direct Memory Access (DMA) Controller for Scientific Computing Accelerators.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019
Improving the DRAM Access Efficiency for Matrix Multiplication on Multicore Accelerators.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019
Proceedings of the ACM Turing Celebration Conference - China, 2019
2018
Frontiers Inf. Technol. Electron. Eng., 2018
J. Circuits Syst. Comput., 2018
Innov. Syst. Softw. Eng., 2018
Int. J. Parallel Program., 2018
IEEE Access, 2018
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2018
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018
Proceedings of the Algorithms and Architectures for Parallel Processing, 2018
Proceedings of the IEEE Frontiers in Education Conference, 2018
Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018
HASS: High Accuracy Spike Sorting with Wavelet Package Decomposition and Mutual Information.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018
2017
ACM Trans. Archit. Code Optim., 2017
Proceedings of the Verification and Evaluation of Computer and Communication Systems, 2017
Proceedings of the 18th Annual Conference on Information Technology Education and the 6th Annual Conference on Research in Information Technology, 2017
Proceedings of the Network and Parallel Computing, 2017
SimpleBP: A Lightweight Branch Prediction Simulator for Effective Design Exploration.
Proceedings of the 2017 International Conference on Networking, Architecture, and Storage, 2017
Proceedings of the 2017 International Conference on Networking, Architecture, and Storage, 2017
Proceedings of the 6th International Conference on Modern Circuits and Systems Technologies, 2017
Unleashing the power of GPU for physically-based rendering via dynamic ray shuffling.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017
Proceedings of the 2017 IEEE International Conference on Computer Design, 2017
Proceedings of the 2017 International Conference on Advances in Computing, 2017
Proceedings of the on Great Lakes Symposium on VLSI 2017, 2017
Proceedings of the Computing Frontiers Conference, 2017
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017
2016
Proceedings of the Computer Engineering and Technology - 20th CCF Conference, 2016
2015
Proceedings of the 10th International Design & Test Symposium, 2015
Proceedings of the Algorithms and Architectures for Parallel Processing, 2015
Fast FPGA system for microarchitecture optimization on synthesizable modern processor design.
Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015
2014
Integrated Coherence Prediction: Towards Efficient Cache Coherence on NoC-Based Multicore Architectures.
ACM Trans. Design Autom. Electr. Syst., 2014
IEEE Trans. Computers, 2014
Comput. J., 2014
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014
2013
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2013
Adaptive communication mechanism for accelerating MPI functions in NoC-based multicore processors.
ACM Trans. Archit. Code Optim., 2013
Efficient multimedia coprocessor with enhanced SIMD engines for exploiting ILP and DLP.
Parallel Comput., 2013
Microprocess. Microsystems, 2013
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013
2012
IEEE Trans. Computers, 2012
Proceedings of the Great Lakes Symposium on VLSI 2012, 2012
Proceedings of the 23rd IEEE International Conference on Application-Specific Systems, 2012
2011
Proceedings of the Design, Automation and Test in Europe, 2011
2010
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010
Proceedings of the Algorithms and Architectures for Parallel Processing, 2010
Proceedings of the 16th International Conference on High-Performance Computer Architecture (HPCA-16 2010), 2010
2009
IET Comput. Digit. Tech., 2009
Implementation of OpenVG Path and Paint Algorithms on Synchronous Data Triggered Architecture with Optimization.
Proceedings of the International Conference on Networking, Architecture, and Storage, 2009
2008
Proceedings of the 2008 ACM Symposium on Applied Computing (SAC), 2008
A New CORDIC Algorithm and Software Implementation Based on Synchronized Data Triggering Architecture.
Proceedings of the 2008 International Conference on Multimedia and Ubiquitous Engineering (MUE 2008), 2008
Customizing computation accelerators for extensible multi-issue processors with effective optimization techniques.
Proceedings of the 45th Design Automation Conference, 2008
Proceedings of the Second International Conference on Complex, 2008
2007
Proceedings of the 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE 2007), 2007
A New Architecture For Multiple-Precision Floating-Point Multiply-Add Fused Unit Design.
Proceedings of the 18th IEEE Symposium on Computer Arithmetic (ARITH-18 2007), 2007