Carlos Álvarez

IEEE Trans. Computers, January, 2024

Automated parallel execution of distributed task graphs with FPGA clusters.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2024

The TEXTAROSSA Project: Cool all the Way Down to the Hardware.

[BibT_eX]

[DOI]

Proceedings of the 27th Euromicro Conference on Digital System Design, 2024

2023

FPGA Framework Improvements for HPC Applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field Programmable Technology, 2023

Accelerating SpMV on FPGAs Through Block-Row Compress: A Task-Based Approach.

[BibT_eX]

[DOI]

Proceedings of the 33rd International Conference on Field-Programmable Logic and Applications, 2023

Improving Performance of HPC Kernels on FPGAs Using High-Level Resource Management.

[BibT_eX]

[DOI]

Alberto Riccardo Martinelli

Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023

b8c: SpMV accelerator implementation leveraging high memory bandwidth.

[BibT_eX]

[DOI]

Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023

2022

Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach.

[BibT_eX]

[DOI]

Pier Stanislao Paolucci

Microprocess. Microsystems, November, 2022

OmpSs@cloudFPGA: An FPGA Task-Based Programming Model with Message Passing.

[BibT_eX]

[DOI]

Rubén Cano

Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Towards Reconfigurable Accelerators in HPC: Designing a Multipurpose eFPGA Tile for Heterogeneous SoCs.

[BibT_eX]

[DOI]

Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

2021

OmpSs@FPGA Framework for High Performance FPGA Computing.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

The AXIOM Project: IoT on Heterogeneous Embedded Platforms.

[BibT_eX]

[DOI]

Marc Mateu

Dionisios N. Pnevmatikatos

Dimitrios Theodoropoulos

IEEE Des. Test, 2021

An FPGA cached sparse matrix vector product (SpMV) for unstructured computational fluid dynamics simulations.

[BibT_eX]

[DOI]

CoRR, 2021

TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale.

[BibT_eX]

[DOI]

Alberto Riccardo Martinelli

Pier Stanislao Paolucci

Proceedings of the 24th Euromicro Conference on Digital System Design, 2021

Task-Based Programming Models for Heterogeneous Recurrent Workloads.

[BibT_eX]

[DOI]

Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2021

2020

Asynchronous runtime with distributed manager for task-based programming models.

[BibT_eX]

[DOI]

Parallel Comput., 2020

Breaking master-slave model between host and FPGAs.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing.

[BibT_eX]

[DOI]

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

2019

A Hardware Runtime for Task-Based Programming Models.

[BibT_eX]

[DOI]

Xubin Tan

Andrew Anesetti-Rothermel

IEEE Trans. Parallel Distributed Syst., 2019

Individual Mobility and Uncertain Geographic Context: Real-time Versus Neighborhood Approximated Exposure to Retail Tobacco Outlets Across the US.

[BibT_eX]

[DOI]

Thomas R. Kirchner

Hong Gao

Daniel J. Lewis

Brian House

J. Heal. Informatics Res., 2019

LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing.

[BibT_eX]

[DOI]

CoRR, 2019

Adding Tightly-Integrated Task Scheduling Acceleration to a RISC-V Multi-core Processor.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

2018

LEGaTO: first steps towards energy-efficient toolset for heterogeneous computing.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

Application Acceleration on FPGAs with OmpSs@FPGA.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field-Programmable Technology, 2018

LEGaTO: towards energy-efficient, secure, fault-tolerant toolset for heterogeneous computing.

[BibT_eX]

[DOI]

Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018

2017

The AXIOM platform for next-generation cyber physical systems.

[BibT_eX]

[DOI]

Dimitris Theodoropoulos

Dionisis N. Pnevmatikatos

Francesco Montefoschi

David Oro

Microprocess. Microsystems, 2017

Implementation of the K-Means Algorithm on Heterogeneous Devices: A Use Case Based on an Industrial Dataset.

[BibT_eX]

[DOI]

Filippo Mantovani

Proceedings of the Parallel Computing is Everywhere, 2017

General Purpose Task-Dependence Management Hardware for Task-Based Dataflow Programming Models.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Characterizing and Improving the Performance of Many-Core Task-Based Parallel Programming Runtimes.

[BibT_eX]

[DOI]

Xubin Tan

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Picos, A Hardware Task-Dependence Manager for Task-Based Dataflow Programming Models.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

Exploiting Parallelism on GPUs and FPGAs with OmpSs.

[BibT_eX]

[DOI]

Proceedings of the 1st Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems, 2017

2016

MInGLE: An Efficient Framework for Domain Acceleration Using Low-Power Specialized Functional Units.

[BibT_eX]

[DOI]

Cecilia González-Alvarez

Jennifer B. Sartor

Lieven Eeckhout

ACM Trans. Archit. Code Optim., 2016

The AXIOM software layers.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2016

The Secrets of the Accelerators Unveiled: Tracing Heterogeneous Executions Through OMPT.

[BibT_eX]

[DOI]

Germán Llort

Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Performance analysis of a hardware accelerator of dependence management for task-based dataflow programming models.

[BibT_eX]

[DOI]

Xubin Tan

Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016

AXIOM: A Hardware-Software Platform for Cyber Physical Systems.

[BibT_eX]

[DOI]

Dionisios N. Pnevmatikatos

Francesco Montefoschi

David Oro

Antonio Rizzo

Dimitris Theodoropoulos

Roberto Giorgi

Proceedings of the 2016 Euromicro Conference on Digital System Design, 2016

2015

Picos: A hardware runtime architecture support for OmpSs.

[BibT_eX]

[DOI]

Fahimeh Yazdanpanah

Rosa M. Badia

Future Gener. Comput. Syst., 2015

Coarse-Grain Performance Estimator for Heterogeneous Parallel Computing Architectures like Zynq All-Programmable SoC.

[BibT_eX]

[DOI]

Dionisios N. Pnevmatikatos

CoRR, 2015

The AXIOM project (Agile, eXtensible, fast I/O Module).

[BibT_eX]

[DOI]

Dimitris Theodoropoulos

Javier Rodríguez Saeta

Paolo Gai

Antonio Rizzo

Roberto Giorgi

Proceedings of the 2015 International Conference on Embedded Computer Systems: Architectures, 2015

The AXIOM Software Layers.

[BibT_eX]

[DOI]

Proceedings of the 2015 Euromicro Conference on Digital System Design, 2015

Automatic design of domain-specific instructions for low-power processors.

[BibT_eX]

[DOI]

Cecilia González-Alvarez

Jennifer B. Sartor

Lieven Eeckhout

Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

2014

Hybrid Dataflow/von-Neumann Architectures.

[BibT_eX]

[DOI]

Fahimeh Yazdanpanah

Yoav Etsion

IEEE Trans. Parallel Distributed Syst., 2014

OmpSs@Zynq all-programmable SoC ecosystem.

[BibT_eX]

[DOI]

Eduard Gil

Proceedings of the 2014 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2014

2013

Accelerating an application domain with specialized functional units.

[BibT_eX]

[DOI]

Cecilia González-Alvarez

Jennifer B. Sartor

Lieven Eeckhout

ACM Trans. Archit. Code Optim., 2013

Heterogeneous tasking on SMP/FPGA SoCs: The case of OmpSs and the Zynq.

[BibT_eX]

[DOI]

Eduard Gil

Jan Langer

Juanjo Noguera

Proceedings of the 21st IEEE/IFIP International Conference on VLSI and System-on-Chip, 2013

Analysis of the Task Superscalar Architecture Hardware Design.

[BibT_eX]

[DOI]

Fahimeh Yazdanpanah

Yoav Etsion

Rosa M. Badia

Proceedings of the International Conference on Computational Science, 2013

2012

Dynamic Tolerance Region Computing for Multimedia.

[BibT_eX]

[DOI]

Jesús Corbal

IEEE Trans. Computers, 2012

2007

Computación difusa.

[BibT_eX]

[DOI]

PhD thesis, 2007

2005

Fuzzy Memoization for Floating-Point Multimedia Applications.

[BibT_eX]

[DOI]

Jesús Corbal