Ardavan Pedram

Ali Shafiee Ardestani

J. Syst. Archit., 2022

Griffin: Rethinking Sparse Optimization for Deep Learning Architectures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021

Design Space Exploration of Sparse Accelerators for Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking Floating Point Overheads for Mixed Precision DNN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

2020

Campfire: Compressible, Regularization-Free, Structured Sparse Training for Hardware Accelerators.

[BibT_eX]

[DOI]

CoRR, 2020

2018

Plasticine: A Reconfigurable Accelerator for Parallel Patterns.

[BibT_eX]

[DOI]

IEEE Micro, 2018

Spatial: a language and compiler for application accelerators.

[BibT_eX]

[DOI]

Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018

2017

Dark Memory and Accelerator-Rich System Optimization in the Dark Silicon Era.

[BibT_eX]

[DOI]

IEEE Des. Test, 2017

Plasticine: A Reconfigurable Architecture For Parallel Paterns.

[BibT_eX]

[DOI]

Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

CATERPILLAR: Coarse Grain Reconfigurable Architecture for accelerating the training of Deep Neural Networks.

[BibT_eX]

[DOI]

Yuanfang Li

Proceedings of the 28th IEEE International Conference on Application-specific Systems, 2017

2016

A Systematic Approach to Blocking Convolutional Neural Networks.

[BibT_eX]

[DOI]

Jonathan Ragan-Kelley

Mark Horowitz

CoRR, 2016

Simulator calibration for accelerator-rich architecture studies.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Embedded Computer Systems: Architectures, 2016

Evaluating programmable architectures for imaging and vision applications.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Improving energy efficiency of DRAM by exploiting half page row access.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

EIE: Efficient Inference Engine on Compressed Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Deep compression and EIE: Efficient inference engine on compressed deep neural network.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Hot Chips 28 Symposium (HCS), 2016

2014

A Highly Efficient Multicore Floating-Point FFT Architecture Based on Hybrid Linear Algebra/FFT Cores.

[BibT_eX]

[DOI]

John D. McCalpin

J. Signal Process. Syst., 2014

Algorithm, Architecture, and Floating-Point Unit Codesign of a Matrix Factorization Accelerator.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2014

2013

Algorithm/Architecture Codesign of Low Power and High Performance Linear Algebra Compute Fabrics.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Transforming a linear algebra core to an FFT accelerator.

[BibT_eX]

[DOI]

John D. McCalpin

Proceedings of the 24th International Conference on Application-Specific Systems, 2013

Floating Point Architecture Extensions for Optimized Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE Symposium on Computer Arithmetic, 2013

2012

Codesign Tradeoffs for High-Performance, Low-Power Linear Algebra Architectures.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2012

On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

A Linear Algebra Core Design for Efficient Level-3 BLAS.

[BibT_eX]

[DOI]

Syed Zohaib Gilani

Nam Sung Kim

Michael J. Schulte

Proceedings of the 23rd IEEE International Conference on Application-Specific Systems, 2012

2011

A high-performance, low-power linear algebra core.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Application-specific Systems, 2011

2009

Modeling Cache Effects at the Transaction Level.

[BibT_eX]

[DOI]

David Craven