Jungwook Choi

Wei Wang

Moriyoshi Ohara

Proceedings of the IEEE Symposium in Low-Power and High-Speed Chips, 2019

DLFloat: A 16-b Floating Point Format Designed for Deep Learning Training and Inference.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE Symposium on Computer Arithmetic, 2019

Approximate Computing Techniques for Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Approximate Circuits, Methodologies and CAD., 2019

2018

Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN).

[BibT_eX]

[DOI]

Pierce I-Jen Chuang

Zhuo Wang

CoRR, 2018

PACT: Parameterized Clipping Activation for Quantized Neural Networks.

[BibT_eX]

[DOI]

Zhuo Wang

Pierce I-Jen Chuang

CoRR, 2018

A Scalable Multi- TeraOPS Deep Learning Processor Core for AI Trainina and Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Symposium on VLSI Circuits, 2018

Training Deep Neural Networks with 8-bit Floating Point Numbers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Taming the beast: Programming Peta-FLOP class Deep Learning Systems.

[BibT_eX]

[DOI]

Leland Chang

Proceedings of the International Symposium on Low Power Electronics and Design, 2018

Across the Stack Opportunities for Deep Learning Acceleration.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design, 2018

PROMISE: An End-to-End Design of a Programmable Mixed-Signal Accelerator for Machine-Learning Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

True Gradient-Based Training of Deep Binary Activated Neural Networks Via Continuous Binarization.

[BibT_eX]

[DOI]

Charbel Sakr

Zhuo Wang

Naresh R. Shanbhag

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Exploiting approximate computing for deep learning acceleration.

[BibT_eX]

[DOI]

Chia-Yu Chen

Viji Srinivasan

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Compensated-DNN: energy efficient low-precision deep neural networks by compensating quantization errors.

[BibT_eX]

[DOI]

Shubham Jain

Pierce Chuang

Leland Chang

Proceedings of the 55th Annual Design Automation Conference, 2018

AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Toward a pixel-parallel architecture for graph cuts inference on FPGA.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

Accelerator Design for Deep Learning Training: Extended Abstract: Invited.

[BibT_eX]

[DOI]

Ankur Agrawal

Chia-Yu Chen

Jinwook Oh

Sunil Shukla

Viji Srinivasan

Wei Zhang

Proceedings of the 54th Annual Design Automation Conference, 2017

POSTER: Design Space Exploration for Performance Optimization of Deep Neural Networks on Shared Memory Accelerators.

[BibT_eX]

[DOI]

Leland Chang

Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016

Error Resilient and Energy Efficient MRF Message-Passing-Based Stereo Matching.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2016

Video-Rate Stereo Matching Using Markov Random Field TRW-S Inference on a Hybrid CPU+FPGA Computing Platform.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Energy-Efficient Simultaneous Localization and Mapping via Compounded Approximate Computing.

[BibT_eX]

[DOI]

Jinwook Oh

Guilherme C. Januario

Proceedings of the 2016 IEEE International Workshop on Signal Processing Systems, 2016

Approximate computing: Challenges and opportunities.

[BibT_eX]

[DOI]

Ankur Agrawal

Zehra Sura

Proceedings of the IEEE International Conference on Rebooting Computing, 2016

Analysis of error resiliency of belief propagation in computer vision.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Configurable and scalable belief propagation accelerator for computer vision.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Field Programmable Logic and Applications, 2016

2015

High performance and error resilient probabilistic inference system for machine learning

[BibT_eX]

[DOI]

PhD thesis, 2015

Transmission Power Control with the Guaranteed Communication Reliability in WSN.

[BibT_eX]

[DOI]

Int. J. Distributed Sens. Networks, 2015

Fast hierarchical implementation of sequential tree-reweighted belief propagation for probabilistic inference.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015

2014

A robust message passing based stereo matching kernel via system-level error resiliency.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Error resilient MRF message passing architecture for stereo matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Signal Processing Systems, 2013

FPGA acceleration of Markov Random Field TRW-S inference for stereo matching.

[BibT_eX]

[DOI]

Proceedings of the 11th ACM/IEEE International Conference on Formal Methods and Models for Codesign, 2013

EMERALD: Characterization of emerging applications and algorithms for low-power devices.

[BibT_eX]

[DOI]

Vijaykrishnan Narayanan

Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

2012

Flexible and Expandable Speech Recognition Hardware with Weighted Finite State Transducers.

[BibT_eX]

[DOI]

Kisun You

Wonyong Sung

J. Signal Process. Syst., 2012

Deformable Carbon Nanotube-Contact Pads for Inertial Microswitch to Extend Contact Time.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Electron., 2012

Hardware implementation of MRF map inference on an FPGA platform.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Field Programmable Logic and Applications (FPL), 2012

2011

Memory Access Optimized VLSI for 5000-Word Continuous Speech Recognition.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2011

2010

A Real-Time FPGA-Based 20 000-Word Speech Recognizer With Optimized DRAM Access.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2010

Supporting handover in an IEEE 802.11p-based wireless access system.

[BibT_eX]

[DOI]

Hyukjoon Lee

Proceedings of the Seventh International Workshop on Vehicular Ad Hoc Networks, 2010

An FPGA implementation of speech recognition with weighted finite state transducers.

[BibT_eX]

[DOI]