Xiaoming Chen

ACM Trans. Design Autom. Electr. Syst., January, 2024

PIMCOMP: An End-to-End DNN Compiler for Processing-In-Memory Accelerators.

[BibT_eX]

[DOI]

CoRR, 2024

BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices.

[BibT_eX]

[DOI]

CoRR, 2024

Older and Wiser: The Marriage of Device Aging and Intellectual Property Protection of Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2024

JANM-IK: Jacobian Argumented Nelder-Mead Algorithm for Inverse Kinematics and its Hardware Acceleration.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2024

TMiner: A Vertex-Based Task Scheduling Architecture for Graph Pattern Mining.

[BibT_eX]

[DOI]

Zerun Li

Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

Accelerating Frequency-domain Convolutional Neural Networks Inference using FPGAs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

AceMiner: Accelerating Graph Pattern Matching using PIM with Optimized Cache System.

[BibT_eX]

[DOI]

Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

MemSort: In-Memory Sorting Architecture.

[BibT_eX]

[DOI]

Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

PIMSIM-NN: An ISA-based Simulation Framework for Processing-in-Memory Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

PIMSYN: Synthesizing Processing-in-Memory CNN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Older and Wiser: The Marriage of Device Aging and Intellectual Property Protection of DNNs.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

ReCG: ReRAM-Accelerated Sparse Conjugate Gradient.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache Optimizations.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

Frequency-Domain Inference Acceleration for Convolutional Neural Networks Using ReRAMs.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., December, 2023

MNSIM 2.0: A Behavior-Level Modeling Tool for Processing-In-Memory Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., November, 2023

FeCrypto: Instruction Set Architecture for Cryptographic Algorithms Based on FeFET-Based In-Memory Computing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., September, 2023

RBDCore: Robot Rigid Body Dynamics Accelerator with Multifunctional Pipelines.

[BibT_eX]

[DOI]

CoRR, 2023

Dadu-RBD: Robot Rigid Body Dynamics Accelerator with Multifunctional Pipelines.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

Hardware-Software Co-Design for Content-Based Sparse Attention.

[BibT_eX]

[DOI]

Proceedings of the 41st IEEE International Conference on Computer Design, 2023

LIM-GEN: A Data-Guided Framework for Automated Generation of Heterogeneous Logic-in-Memory Architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

Meltrix: A RRAM-Based Polymorphic Architecture Enhanced by Function Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

FSPA: An FeFET-based Sparse Matrix-Dense Vector Multiplication Accelerator.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

PIMCOMP: A Universal Compilation Framework for Crossbar-based PIM DNN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Accelerating Convolutional Neural Networks in Frequency Domain via Kernel-Sharing Approach.

[BibT_eX]

[DOI]

Proceedings of the 28th Asia and South Pacific Design Automation Conference, 2023

2022

Toward Efficient Computing for Robotics: From a Circuit and System View.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2022

Re-FeMAT: A Reconfigurable Multifunctional FeFET-Based Memory Architecture.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Search-Free Inference Acceleration for Sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Fast and High-Accuracy Approximate MAC Unit Design for CNN Computing.

[BibT_eX]

[DOI]

IEEE Embed. Syst. Lett., 2022

VNet: a versatile network to train real-time semantic segmentation models on a single GPU.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2022

Numerically-Stable and Highly-Scalable Parallel LU Factorization for Circuit Simulation.

[BibT_eX]

[DOI]

Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

PME: Processing-in-memory Masking and Encoding for Secure NVM.

[BibT_eX]

[DOI]

Zhiwen Xie

Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

AIC-Bench: Workload Selection Methodology for Benchmarking AI Chips.

[BibT_eX]

[DOI]

Zhenyu Quan

P3S: A High Accuracy Probabilistic Prediction Processing System for CNN Acceleration.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '22: Great Lakes Symposium on VLSI 2022, Irvine CA USA, June 6, 2022

Energy-Efficient In-SRAM Accumulation for CMOS-based CNN Accelerators.

[BibT_eX]

[DOI]

Wanqian Li

Proceedings of the GLSVLSI '22: Great Lakes Symposium on VLSI 2022, Irvine CA USA, June 6, 2022

GraphRing: an HMC-ring based graph processing framework with optimized data movement.

[BibT_eX]

[DOI]

Zerun Li

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

FeMIC: Multi-Operands in-Memory Computing Based on FeFETs.

[BibT_eX]

[DOI]

Proceedings of the 27th Asia and South Pacific Design Automation Conference, 2022

Optimal Data Allocation for Graph Processing in Processing-in-Memory Systems.

[BibT_eX]

[DOI]

Zerun Li

Proceedings of the 27th Asia and South Pacific Design Automation Conference, 2022

Solving Least-Squares Fitting in $O(1)$ Using RRAM-based Computing-in-Memory Technique.

[BibT_eX]

[DOI]

Proceedings of the 27th Asia and South Pacific Design Automation Conference, 2022

2021

FTT-NAS: Discovering Fault-tolerant Convolutional Neural Architecture.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2021

Chaotic Weights: A Novel Approach to Protect Intellectual Property of Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021

Breaking the von Neumann bottleneck: architecture-level processing-in-memory technology.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2021

RRAM-based Analog In-Memory Computing : Invited Paper.

[BibT_eX]

[DOI]

Tao Song

Proceedings of the IEEE/ACM International Symposium on Nanoscale Architectures, 2021

CoPIM: A Concurrency-aware PIM Workload Offloading Architecture for Graph Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2021

Eliminating Iterations of Iterative Methods: Solving Large-Scale Sparse Linear System in <i>O</i>(1) with RRAM-based In-Memory Accelerator.

[BibT_eX]

[DOI]

Tao Song

Proceedings of the GLSVLSI '21: Great Lakes Symposium on VLSI 2021, 2021

BRAHMS: Beyond Conventional RRAM-based Neural Network Accelerators Using Hybrid Analog Memory System.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

F3D: Accelerating 3D Convolutional Neural Networks in Frequency Space Using ReRAM.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

ChaoPIM: A PIM-based Protection Framework for DNN Accelerators Using Chaotic Encryption.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE Asian Test Symposium, 2021

FePIM: Contention-Free In-Memory Computing Based on Ferroelectric Field-Effect Transistors.

[BibT_eX]

[DOI]

Yuping Wu

Proceedings of the ASPDAC '21: 26th Asia and South Pacific Design Automation Conference, 2021

2020

Swallow: A Versatile Accelerator for Sparse Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Long Live TIME: Improving Lifetime and Security for NVM-Based Training-in-Memory Systems.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Enabling Secure NVM-Based in-Memory Neural Network Computing by Sparse Fast Gradient Encryption.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2020

The Impact of Ferroelectric FETs on Digital and Analog Circuits and Architectures.

[BibT_eX]

[DOI]

Ann Franchesca Laguna

IEEE Des. Test, 2020

FTT-NAS: Discovering Fault-Tolerant Neural Architecture.

[BibT_eX]

[DOI]

CoRR, 2020

DaDu Series - Fast and Efficient Robot Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

Communication Lower Bound in Convolution Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

MNSIM 2.0: A Behavior-Level Modeling Tool for Memristor-based Neuromorphic Computing Systems.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

Accelerating RRT Motion Planning Using TCAM.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

TUPIM: A Transparent and Universal Processing-in-Memory Architecture for Unmodified Binaries.

[BibT_eX]

[DOI]

Proceedings of the GLSVLSI '20: Great Lakes Symposium on VLSI 2020, 2020

Dadu-CD: Fast and Efficient Processing-in-Memory Accelerator for Collision Detection.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

Black Box Search Space Profiling for Accelerator-Aware Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

An Energy-Efficient Quantized and Regularized Training Framework For Processing-In-Memory Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

Search-free Accelerator for Sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

FTT-NAS: Discovering Fault-Tolerant Neural Architecture.

[BibT_eX]

[DOI]

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

Soft Error Mitigation for Deep Convolution Neural Network on FPGA Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE International Conference on Artificial Intelligence Circuits and Systems, 2020

2019

Ferroelectric FETs-Based Nonvolatile Logic-in-Memory Circuits.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2019

moDNN: Memory Optimal Deep Neural Network Training on Graphics Processing Units.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2019

Dependable Visual Light-Based Indoor Localization with Automatic Anomaly Detection for Location-Based Service of Mobile Cyber-Physical Systems.

[BibT_eX]

[DOI]

ACM Trans. Cyber Phys. Syst., 2019

Power and Area Efficient FPGA Building Blocks Based on Ferroelectric FETs.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2019

Thread: Towards fine-grained precision reconfiguration in variable-precision neural network accelerator.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2019

Accelerating DNN-based 3D point cloud processing for mobile computing.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2019

PIMSim: A Flexible and Detailed Processing-in-Memory Simulator.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2019

FeMAT: Exploring In-Memory Processing in Multifunctional FeFET-Based Memory Array.

[BibT_eX]

[DOI]

Xiaoyu Zhang

Proceedings of the 37th IEEE International Conference on Computer Design, 2019

A General Logic Synthesis Framework for Memristor-based Logic Design.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2019

Enabling Secure in-Memory Neural Network Computing by Sparse Fast Gradient Encryption.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2019

Merging Everything (ME): A Unified FPGA Architecture Based on Logic-in-Memory Techniques.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

CuckooPIM: an efficient and less-blocking coherence mechanism for processing-in-memory systems.

[BibT_eX]

[DOI]

Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

Addressing the issue of processing element under-utilization in general-purpose systolic deep learning accelerators.

[BibT_eX]

[DOI]

Proceedings of the 24th Asia and South Pacific Design Automation Conference, 2019

2018

Hardware Trojan Detection in Third-Party Digital Intellectual Property Cores by Multilevel Feature Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

Nonvolatile Lookup Table Design Based on Ferroelectric Field-Effect Transistors.

[BibT_eX]

[DOI]

Michael T. Niemier

Xiaobo Sharon Hu

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Mixed size crossbar based RRAM CNN accelerator with overlapped mapping method.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2018

Model-based and data-driven approaches for building automation and control.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2018

Rescuing memristor-based computing with non-linear resistance levels.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Design and optimization of FeFET-based crossbars for binary convolution neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

moDNN: Memory optimal DNN training on GPUs.

[BibT_eX]

[DOI]

Sandeep Krishna Thirumala

Danny Z. Chen

Xiaobo Sharon Hu

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Computing with ferroelectric FETs: Devices, models, systems, and applications.

[BibT_eX]

[DOI]

Xunzhao Yin

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Dadu-P: a scalable accelerator for robot motion planning in a dynamic environment.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Design Automation Conference, 2018

Long live TIME: improving lifetime for training-in-memory engines by structured gradient sparsification.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Design Automation Conference, 2018

2017

A General Framework for Hardware Trojan Detection in Digital Circuits by Statistical Learning Algorithms.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2017

A Deep Learning Approach for Blind Drift Calibration of Sensor Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Low-overhead implementation of logic encryption using gate replacement techniques.

[BibT_eX]

[DOI]

Proceedings of the 18th International Symposium on Quality Electronic Design, 2017

Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Design Automation Conference, 2017

2016

A Unified Methodology for Designing Hardware Random Number Generators Based on Any Probability Distribution.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2016

Modeling Random Telegraph Noise as a Randomness Source and its Application in True Random Number Generation.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2016

Virtual temperature measurement for smart buildings via Bayesian model fusion.

[BibT_eX]

[DOI]

Xin Li

Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Overview of cyber-physical temperature estimation in smart buildings: From modeling to measurements.

[BibT_eX]

[DOI]

Xin Li

Sheldon X.-D. Tan

Proceedings of the IEEE Conference on Computer Communications Workshops, 2016

Sparsity-oriented sparse solver design for circuit simulation.

[BibT_eX]

[DOI]

Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

2015

HS3-DPG: Hierarchical Simulation for 3-D P/G Network.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2015

GPU-Accelerated Sparse LU Factorization for Circuit Simulation with Performance Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2015

FASTrust: Feature analysis for third-party IP trust verification.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Test Conference, 2015

From Robust Chip to Smart Building: CAD Algorithms and Methodologies for Uncertainty Analysis of Building Performance.

[BibT_eX]

[DOI]

Xin Li

Sheldon X.-D. Tan

Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, 2015

A fast parallel sparse solver for SPICE-based circuit simulators.

[BibT_eX]

[DOI]

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

A STT-RAM-based low-power hybrid register file for GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Design Automation Conference, 2015

2014

Efficient region-aware P/G TSV planning for 3D ICs.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Symposium on Quality Electronic Design, 2014

Run-Time Technique for Simultaneous Aging and Power Optimization in GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual Design Automation Conference 2014, 2014

Statistical analysis of random telegraph noise in digital circuits.

[BibT_eX]

[DOI]

Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014

2013

NICSLU: An Adaptive Sparse Matrix Solver for Parallel Circuit Simulation.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2013

Evaluation and mitigation of performance degradation under random telegraph noise for digital circuits.

[BibT_eX]

[DOI]

IET Circuits Devices Syst., 2013

Assessment of Circuit Optimization Techniques Under NBTI.

[BibT_eX]

[DOI]

IEEE Des. Test, 2013

Nonzero pattern analysis and memory access optimization in GPU-based sparse LU factorization for circuit simulation.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Irregular Applications - Architectures and Algorithms, 2013

HS3DPG: Hierarchical simulation for 3D P/G network.

[BibT_eX]

[DOI]

Proceedings of the 18th Asia and South Pacific Design Automation Conference, 2013

2012

Variation-Aware Supply Voltage Assignment for Simultaneous Power and Aging Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2012

Parallel Circuit Simulation on Multi/Many-core Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Sparse LU factorization for parallel circuit simulation on GPU.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Design Automation Conference 2012, 2012

An adaptive LU factorization algorithm for parallel circuit simulation.

[BibT_eX]

[DOI]