Proceedings of the 14th Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures and 12th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms, 2023

HULK-V: a Heterogeneous Ultra-low-power Linux capable RISC-V SoC.

[BibT_eX]

[DOI]

Luca Valente

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

TransLib: A Library to Explore Transprecision Floating-Point Arithmetic on Multi-Core IoT End-Nodes.

[BibT_eX]

[DOI]

Seyed Ahmad Mirsalari

Giuseppe Tagliavini

Davide Rossi

Luca Benini

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

End-to-End DNN Inference on a Massively Parallel Analog In Memory Computing Architecture.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

2022

A Low-Power Transprecision Floating-Point Cluster for Efficient Near-Sensor Data Analytics.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

Optimizing Random Forest-Based Inference on RISC-V MCUs at the Extreme Edge.

[BibT_eX]

[DOI]

Enrico Tabanelli

Giuseppe Tagliavini

Luca Benini

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Vega: A Ten-Core SoC for IoT Endnodes With DNN Acceleration and Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2022

An Optimized Heart Rate Detection System Based on Low-Power Microcontroller Platforms for Biosignal Processing.

[BibT_eX]

[DOI]

Proceedings of the Advances in System-Integrated Intelligence, 2022

PULP-TrainLib: Enabling On-Device Training for RISC-V Multi-core MCUs Through Performance-Driven Autotuning.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2022

Scale up your In-Memory Accelerator: Leveraging Wireless-on-Chip Communication for AIMC-based CNN Inference.

[BibT_eX]

[DOI]

Albert Cabellos-Aparicio

Proceedings of the 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2022

2021

Energy-Efficient Hardware-Accelerated Synchronization for Shared-L1-Memory Multiprocessor Clusters.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

XpulpNN: Enabling Energy Efficient and Flexible Inference of Quantized Neural Networks on RISC-V Based IoT End Nodes.

[BibT_eX]

[DOI]

IEEE Trans. Emerg. Top. Comput., 2021

DORY: Automatic End-to-End Deployment of Real-World DNNs on Low-Cost IoT MCUs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

Efficient Transform Algorithms for Parallel Ultra-Low-Power IoT End Nodes.

[BibT_eX]

[DOI]

IEEE Embed. Syst. Lett., 2021

Vega: A 10-Core SoC for IoT End-Nodes with DNN Acceleration and Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode.

[BibT_eX]

[DOI]

CoRR, 2021

RVfplib: A Fast and Compact Open-Source Floating-Point Emulation Library for Tiny RISC-V Processors.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2021

4.4 A 1.3TOPS/W @ 32GOPS Fully Integrated 10-Core SoC for IoT End-Nodes with 1.7μW Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2021

GVSoC: A Highly Configurable, Fast and Accurate Full-Platform Simulator for RISC-V based IoT Processors.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE International Conference on Computer Design, 2021

A 1.15 TOPS/W, 16-Cores Parallel Ultra-Low Power Cluster with 2b-to-32b Fully Flexible Bit-Precision and Vector Lockstep Execution Mode.

[BibT_eX]

[DOI]

Proceedings of the 47th ESSCIRC 2021, 2021

Source Code Classification for Energy Efficiency in Parallel Ultra Low-Power Microcontrollers.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Towards Long-term Non-invasive Monitoring for Epilepsy via Wearable EEG Devices.

[BibT_eX]

[DOI]

Thorir Mar Ingolfsson

Proceedings of the IEEE Biomedical Circuits and Systems Conference, BioCAS 2021, 2021

Streamlining the OpenMP Programming Model on Ultra-Low-Power Multi-core MCUs.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - 34th International Conference, 2021

2020

FlexFloat: A Software Library for Transprecision Computing.

[BibT_eX]

[DOI]

Giuseppe Tagliavini

Andrea Marongiu

Luca Benini

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

XpulpNN: Enabling Energy Efficient and Flexible Inference of Quantized Neural Network on RISC-V based IoT End Nodes.

[BibT_eX]

[DOI]

CoRR, 2020

A transprecision floating-point cluster for efficient near-sensor data analytics.

[BibT_eX]

[DOI]

CoRR, 2020

A Mixed-Precision RISC-V Processor for Extreme-Edge DNN Inference.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE Computer Society Annual Symposium on VLSI, 2020

TRANSPIRE: An energy-efficient TRANSprecision floating-point Programmable archItectuRE.

[BibT_eX]

[DOI]

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

XpulpNN: Accelerating Quantized Neural Networks on RISC-V Processors Through ISA Extensions.

[BibT_eX]

[DOI]

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

Enabling mixed-precision quantized neural networks in extreme-edge devices.

[BibT_eX]

[DOI]

Proceedings of the 17th ACM International Conference on Computing Frontiers, 2020

Combining learning and optimization for transprecision computing.

[BibT_eX]

[DOI]

Proceedings of the 17th ACM International Conference on Computing Frontiers, 2020

2019

BioWolf: A Sub-10-mW 8-Channel Advanced Brain-Computer Interface Platform With a Nine-Core Processor and BLE Connectivity.

[BibT_eX]

[DOI]

Victor Javier Kartsch

IEEE Trans. Biomed. Circuits Syst., 2019

Mr.Wolf: An Energy-Precision Scalable Parallel Ultra Low Power SoC for IoT Edge Processing.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2019

Design and Evaluation of SmallFloat SIMD extensions to the RISC-V ISA.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

2018

Unleashing Fine-Grained Parallelism on Embedded Many-Core Accelerators with Lightweight OpenMP Tasking.

[BibT_eX]

[DOI]

Giuseppe Tagliavini

Daniele Cesarini

Andrea Marongiu

IEEE Trans. Parallel Distributed Syst., 2018

Synergistic HW/SW Approximation Techniques for Ultralow-Power Parallel Computing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

Optimizing memory bandwidth exploitation for OpenVX applications on embedded many-core accelerators.

[BibT_eX]

[DOI]

J. Real Time Image Process., 2018

A Transprecision Floating-Point Architecture for Energy-Efficient Embedded Computing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

A transprecision floating-point platform for ultra-low power computing.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

The transprecision computing paradigm: Concept, design, and applications.

[BibT_eX]

[DOI]

A. Cristiano I. Malossi

Dimitrios S. Nikolopoulos

Eric Flamand

Norbert Wehn

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

2017

Optimization Techniques for Parallel Programming of Embedded Many-Core Computing Platforms.

[BibT_eX]

[DOI]

Giuseppe Tagliavini

PhD thesis, 2017

2016

Always-on motion detection with application-level error control on a near-threshold approximate computing platform.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Electronics, Circuits and Systems, 2016

Enabling OpenVX support in mW-scale parallel accelerators.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Compilers, 2016

2015

Simplifying Many-Core-Based Heterogeneous SoC Programming With Offload Directives.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, 2015

A framework for optimizing OpenVX applications performance on embedded manycore accelerators.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Software and Compilers for Embedded Systems, 2015

ADRENALINE: An OpenVX Environment to Optimize Embedded Vision Applications on Many-core Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2015

Synergistic Architecture and Programming Model Support for Approximate Micropower Computing.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Computer Society Annual Symposium on VLSI, 2015

Exploring architectural heterogeneity in intelligent vision systems.

[BibT_eX]

[DOI]

Nandhini Chandramoorthy

Vijaykrishnan Narayanan

Luca Benini

Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

PULP: A parallel ultra low power platform for next generation IoT applications.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Hot Chips 27 Symposium (HCS), 2015

2014

Tightly-coupled hardware support to dynamic parallelism acceleration in embedded shared memory clusters.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

Optimizing memory bandwidth in OpenVX graph execution on embedded many-core accelerators.

[BibT_eX]

[DOI]

Giuseppe Tagliavini

Germain Haugou

Luca Benini

Proceedings of the 2014 Conference on Design and Architectures for Signal and Image Processing, 2014

Supporting localized OpenVX kernel execution for efficient computer vision application development on STHORM many-core platform.

[BibT_eX]

[DOI]

Giuseppe Tagliavini

Germain Haugou

Luca Benini

Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013

Improving the programmability of STHORM-based heterogeneous systems with offload-enabled OpenMP.

[BibT_eX]

[DOI]

Proceedings of the 1st International Workshop on Many-core Embedded Systems 2013, 2013

Enabling fine-grained OpenMP tasking on tightly-coupled shared memory clusters.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2013

2011

MPOpt-Cell: a high-performance data-flow programming environment for the CELL BE processor.

[BibT_eX]

[DOI]

Alessio Franceschelli

Proceedings of the 8th Conference on Computing Frontiers, 2011

Giuseppe Tagliavini

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...