Ardavan Pedram
Orcid: 0000-0002-6348-6701
According to our database1,
Ardavan Pedram
authored at least 27 papers
between 2009 and 2023.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2023
Retrospective: EIE: Efficient Inference Engine on Sparse and Compressed Neural Network.
CoRR, 2023
2022
Algorithm/architecture solutions to improve beyond uniform quantization in embedded DNN accelerators.
J. Syst. Archit., 2022
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022
2021
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021
2020
Campfire: Compressible, Regularization-Free, Structured Sparse Training for Hardware Accelerators.
CoRR, 2020
2018
Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018
2017
IEEE Des. Test, 2017
Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017
CATERPILLAR: Coarse Grain Reconfigurable Architecture for accelerating the training of Deep Neural Networks.
Proceedings of the 28th IEEE International Conference on Application-specific Systems, 2017
2016
Proceedings of the International Conference on Embedded Computer Systems: Architectures, 2016
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016
Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016
Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016
Deep compression and EIE: Efficient inference engine on compressed deep neural network.
Proceedings of the 2016 IEEE Hot Chips 28 Symposium (HCS), 2016
2014
A Highly Efficient Multicore Floating-Point FFT Architecture Based on Hybrid Linear Algebra/FFT Cores.
J. Signal Process. Syst., 2014
Algorithm, Architecture, and Floating-Point Unit Codesign of a Matrix Factorization Accelerator.
IEEE Trans. Computers, 2014
2013
Algorithm/Architecture Codesign of Low Power and High Performance Linear Algebra Compute Fabrics.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013
Proceedings of the 24th International Conference on Application-Specific Systems, 2013
Proceedings of the 21st IEEE Symposium on Computer Arithmetic, 2013
2012
IEEE Trans. Computers, 2012
On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators.
Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012
Proceedings of the 23rd IEEE International Conference on Application-Specific Systems, 2012
2011
Proceedings of the 22nd IEEE International Conference on Application-specific Systems, 2011
2009