Muhammad Waqar Azhar

ACM Trans. Archit. Code Optim., June, 2024

Moving Forward: A Review of Autonomous Driving Software and Hardware Systems.

[BibT_eX]

[DOI]

CoRR, 2024

Simulation of Quantum Computers: Review and Acceleration Opportunities.

[BibT_eX]

[DOI]

CoRR, 2024

Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs.

[BibT_eX]

[DOI]

Proceedings of the Workshop Proceedings of the 53rd International Conference on Parallel Processing, 2024

Scratchpad Memory Management for Deep Learning Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 53rd International Conference on Parallel Processing, 2024

DNNOPT: A Framework for Efficiently Selecting On-chip Memory Loop Optimizations of DNN Accelerators.

[BibT_eX]

[DOI]

Piyumal Ranawaka

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

2023

Approx-RM: Reducing Energy on Heterogeneous Multicore Processors under Accuracy and Timing Constraints.

[BibT_eX]

[DOI]

Madhavan Manivannan

ACM Trans. Archit. Code Optim., September, 2023

Exploiting the Potential of Flexible Processing Units.

[BibT_eX]

[DOI]

Mateo Vázquez

Proceedings of the 35th IEEE International Symposium on Computer Architecture and High Performance Computing, 2023

RAINBOW: Multi-Dimensional Hardware-Software Co-Design for DL Accelerator On-Chip Memory.

[BibT_eX]

[DOI]

Stavroula Zouzoula

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2023

Evaluation of heterogeneous AIoT Accelerators within VEDLIoT.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

VEDLIoT: Next generation accelerated AIoT systems and applications.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM International Conference on Computing Frontiers, 2023

ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators.

[BibT_eX]

[DOI]

Stavroula Zouzoula

Proceedings of the 20th ACM International Conference on Computing Frontiers, 2023

2022

Task-RM: A Resource Manager for Energy Reduction in Task-Parallel Applications under Quality of Service Constraints.

[BibT_eX]

[DOI]

Miquel Pericàs

ACM Trans. Archit. Code Optim., 2022

FiBHA: Fixed Budget Hybrid CNN Accelerator.

[BibT_eX]

[DOI]

Fareed Qararyah

Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022

VSA: A Hybrid Vector-Systolic Architecture.

[BibT_eX]

[DOI]

Mateo Vázquez Maceiras

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

2019

SaC: Exploiting Execution-Time Slack to Save Energy in Heterogeneous Multicore Systems.

[BibT_eX]

[DOI]

Miquel Pericàs

Proceedings of the 48th International Conference on Parallel Processing, 2019

2017

SLOOP: QoS-Supervised Loop Execution to Reduce Energy on Heterogeneous Architectures.

[BibT_eX]

[DOI]

Vassilis Papaefstathiou

ACM Trans. Archit. Code Optim., 2017

2012

Viterbi Accelerator for Embedded Processor Datapaths.

[BibT_eX]

[DOI]

Kashan Khurshid Ansari

Per Larsson-Edefors

Proceedings of the 23rd IEEE International Conference on Application-Specific Systems, 2012

2010

Cyclic Redundancy Checking (CRC) Accelerator for the FlexCore Processor.

[BibT_eX]

[DOI]