Brucek Khailany

CoRR, 2024

Revisiting VerilogEval: Newer LLMs, In-Context Learning, and Specification-to-RTL Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool.

[BibT_eX]

[DOI]

Chia-Tung Ho

CoRR, 2024

PyHDL-Eval: An LLM Evaluation Framework for Hardware Design Using Python-Embedded DSLs.

[BibT_eX]

[DOI]

Christopher Batten

Mingjie Liu

Proceedings of the 2024 ACM/IEEE International Symposium on Machine Learning for CAD, 2024

2023

Machine Learning and Algorithms: Let Us Team Up for EDA.

[BibT_eX]

[DOI]

IEEE Des. Test, February, 2023

A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2023

ChipNeMo: Domain-Adapted LLMs for Chip Design.

[BibT_eX]

[DOI]

CoRR, 2023

VerilogEval: Evaluating Large Language Models for Verilog Code Generation.

[BibT_eX]

[DOI]

Mingjie Liu

CoRR, 2023

NVCell 2: Routability-Driven Standard Cell Layout in Advanced Nodes with Lattice Graph Routability Model.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Symposium on Physical Design, 2023

AutoDMP: Automated DREAMPlace-based Macro Placement.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Symposium on Physical Design, 2023

An Adversarial Active Sampling-Based Data Augmentation Framework for AI-Assisted Lithography Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

Invited Paper: VerilogEval: Evaluating Large Language Models for Verilog Code Generation.

[BibT_eX]

[DOI]

Mingjie Liu

Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

GenFuzz: GPU-accelerated Hardware Fuzzing using Genetic Algorithm with Multiple Inputs.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Late Breaking Results: Test Selection For RTL Coverage By Unsupervised Learning From Fast Functional Simulation.

[BibT_eX]

[DOI]

Rongjian Liang

Yuji Chai

Haoxin Ren

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

Efficient Transformer Inference with Statically Structured Sparse Attention.

[BibT_eX]

[DOI]

Hasan Genc

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update.

[BibT_eX]

[DOI]

Jiawei Zhao

IEEE Trans. Computers, 2022

HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression.

[BibT_eX]

[DOI]

CoRR, 2022

An Adversarial Active Sampling-based Data Augmentation Framework for Manufacturable Chip Design.

[BibT_eX]

[DOI]

Saumyadip Mukhopadhyay

CoRR, 2022

Large Scale Mask Optimization Via Convolutional Fourier Neural Operator and Litho-Guided Self Training.

[BibT_eX]

[DOI]

Haoyu Yang

Zongyi Li

Kumara Sastry

Saumyadip Mukhopadhyay

CoRR, 2022

A 17-95.6 TOPS/W Deep Learning Inference Accelerator with Per-Vector Scaled 4-bit Quantization for Transformers in 5nm.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits 2022), 2022

XT-PRAGGMA: Crosstalk Pessimism Reduction Achieved with GPU Gate-level Simulations and Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the 2022 ACM/IEEE Workshop on Machine Learning for CAD, 2022

AutoCRAFT: Layout Automation for Custom Circuits in Advanced FinFET Technologies.

[BibT_eX]

[DOI]

Proceedings of the ISPD 2022: International Symposium on Physical Design, Virtual Event, Canada, March 27, 2022

From RTL to CUDA: A GPU Acceleration Flow for RTL Simulation with Batch Stimulus.

[BibT_eX]

[DOI]

Proceedings of the 51st International Conference on Parallel Processing, 2022

Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training.

[BibT_eX]

[DOI]

Charbel Sakr

Brian Zimmer

William J. Dally

Proceedings of the International Conference on Machine Learning, 2022

TransSizer: A Novel Transformer-Based Fast Gate Sizer.

[BibT_eX]

[DOI]

Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

GATSPI: GPU accelerated gate-level simulation for power improvement.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Generic lithography modeling with dual-band optics-inspired neural networks.

[BibT_eX]

[DOI]

Haoyu Yang

Zongyi Li

Kumara Sastry

Saumyadip Mukhopadhyay

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Generative self-supervised learning for gate sizing: invited.

[BibT_eX]

[DOI]

Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

2021

DREAMPlace: Deep Learning Toolkit-Enabled GPU Acceleration for Modern VLSI Placement.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2021

NVCell: Standard Cell Layout in Advanced Technology Nodes with Reinforcement Learning.

[BibT_eX]

[DOI]

Matthew Fojtik

CoRR, 2021

Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update.

[BibT_eX]

[DOI]

Jiawei Zhao

CoRR, 2021

Verifying High-Level Latency-Insensitive Designs with Formal Model Checking.

[BibT_eX]

[DOI]

Alicia Klinefelter

CoRR, 2021

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference.

[BibT_eX]

[DOI]

CoRR, 2021

Simba: scaling deep-learning inference with chiplet-based architecture.

[BibT_eX]

[DOI]

Yakun Sophia Shao

Jason Clemons

Commun. ACM, 2021

VS-Quant: Per-vector Scaled Quantization for Accurate Low-Precision Neural Network Inference.

[BibT_eX]

[DOI]

Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

3.2 The A100 Datacenter GPU and Ampere Architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2021

Optimizing VLSI Implementation with Reinforcement Learning - ICCAD Special Session Paper.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

IPA: Floorplan-Aware SystemC Interconnect Performance Modeling and Generation for HLS-based SoCs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

2021 ICCAD CAD Contest Problem C: GPU Accelerated Logic Rewriting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2021

Parasitic-Aware Analog Circuit Sizing with Graph Neural Networks and Bayesian Optimization.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

MAVIREC: ML-Aided Vectored IR-Drop Estimation and Classification.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Softermax: Hardware/Software Co-Design of an Efficient Softmax for Transformers.

[BibT_eX]

[DOI]

Jacob R. Stevens

Anand Raghunathan

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020

ABCDPlace: Accelerated Batch-Based Concurrent Detailed Placement on Multithreaded CPUs and GPUs.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Accelerating Chip Design With Machine Learning.

[BibT_eX]

[DOI]

Yanqing Zhang

Bryan Catanzaro

William J. Dally

IEEE Micro, 2020

A 0.32-128 TOPS, Scalable Multi-Chip-Module-Based Deep Neural Network Inference Accelerator With Ground-Referenced Signaling in 16 nm.

[BibT_eX]

[DOI]

Brian Zimmer

IEEE J. Solid State Circuits, 2020

MAVIREC: ML-Aided Vectored IR-DropEstimation and Classification.

[BibT_eX]

[DOI]

CoRR, 2020

Accelerating Chip Design with Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the MLCAD '20: 2020 ACM/IEEE Workshop on Machine Learning for CAD, 2020

Problem C: GPU Accelerated Logic Re-simulation : (Invited Talk).

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

Opportunities for RTL and Gate Level Simulation using GPUs (Invited Talk).

[BibT_eX]

[DOI]

Yanqing Zhang

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

GRANNITE: Graph Neural Network Inference for Transferable Power Estimation.

[BibT_eX]

[DOI]

Yanqing Zhang

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

PowerNet: Transferable Dynamic IR Drop Estimation via Maximum Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

FIST: A Feature-Importance Sampling and Tree-Based Method for Automatic Design Flow Parameter Tuning.

[BibT_eX]

[DOI]

Erick Carvajal Barboza

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

2019

A 0.11 pJ/Op, 0.32-128 TOPS, Scalable Multi-Chip-Module-based Deep Neural Network Accelerator with Ground-Reference Signaling in 16nm.

[BibT_eX]

[DOI]

Brian Zimmer

Proceedings of the 2019 Symposium on VLSI Circuits, Kyoto, Japan, June 9-14, 2019, 2019

Simba: Scaling Deep-Learning Inference with Multi-Chip-Module-Based Architecture.

[BibT_eX]

[DOI]

Yakun Sophia Shao

Jason Clemons

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Timeloop: A Systematic Approach to DNN Accelerator Evaluation.

[BibT_eX]

[DOI]

Stephen W. Keckler

Joel S. Emer

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

MAGNet: A Modular Accelerator Generator for Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer-Aided Design, 2019

A 0.11 PJ/OP, 0.32-128 Tops, Scalable Multi-Chip-Module-Based Deep Neural Network Accelerator Designed with A High-Productivity vlsi Methodology.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Hot Chips 31 Symposium (HCS), 2019

PRIMAL: Power Inference using Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

Analog/Mixed-Signal Hardware Error Modeling for Deep Learning Inference.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

High Performance Graph Convolutional Networks with Applications in Testability Analysis.

[BibT_eX]

[DOI]

Karthikeyan Natarajan

Bei Yu

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

DREAMPlace: Deep Learning Toolkit-Enabled GPU Acceleration for Modern VLSI Placement.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

A Fine-Grained GALS SoC with Pausible Adaptive Clocking in 16 nm FinFET.

[BibT_eX]

[DOI]

Matthew Fojtik

Alicia Klinefelter

Proceedings of the 25th IEEE International Symposium on Asynchronous Circuits and Systems, 2019

2018

Hardware-Enabled Artificial Intelligence.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Symposium on VLSI Circuits, 2018

A modular digital VLSI flow for high-productivity SoC design.

[BibT_eX]

[DOI]

Evgeni Khmer

Proceedings of the 55th Annual Design Automation Conference, 2018

2017

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

2016

A real-time energy-efficient superpixel hardware accelerator for mobile computer vision applications.

[BibT_eX]

[DOI]

Injoon Hong

Jason Clemons

Iuri Frosio

Stephen W. Keckler

Proceedings of the 53rd Annual Design Automation Conference, 2016

Modeling and Analysis of Power Supply Noise Tolerance with Fine-Grained GALS Adaptive Clocks.

[BibT_eX]

[DOI]

Divya Akella Kamakshi

Proceedings of the 22nd IEEE International Symposium on Asynchronous Circuits and Systems, 2016

2015

A Pausible Bisynchronous FIFO for GALS Systems.

[BibT_eX]

[DOI]

Matthew Fojtik

Proceedings of the 21st IEEE International Symposium on Asynchronous Circuits and Systems, 2015

2013

GPU design in a power-limited era.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Microelectronic Systems Education, 2013

2012

Unifying Primary Cache, Scratch, and Register File Memories in a Throughput Processor.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

2011

GPUs and the Future of Parallel Computing.

[BibT_eX]

[DOI]

IEEE Micro, 2011

CudaDMA: optimizing GPU memory bandwidth via warp specialization.

[BibT_eX]

[DOI]

Michael Bauer

Henry Cook