Roger Ferrer

Orcid: 0000-0003-3306-8610

According to our database1, Roger Ferrer authored at least 33 papers between 2006 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



RAVE: RISC-V Analyzer of Vector Executions, a QEMU tracing plugin.
CoRR, 2024

Exploiting long vectors with a CFD code: a co-design show case.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Compressed Real Numbers for AI: a case-study using a RISC-V CPU.
CoRR, 2023

Acceleration with long vector architectures: Implementation and evaluation of the FFT kernel on NEC SX-Aurora and RISC-V vector extension.
Concurr. Comput. Pract. Exp., 2023

Software Development Vehicles to Enable Extended and Early Co-design: A RISC-V and HPC Case of Study.
Proceedings of the High Performance Computing, 2023

DPU Offloading Programming with the OpenMP API.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

OpenMP Offloading to DPU.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

A Novel Set of Directives for Multi-device Programming with OpenMP.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

An OpenMP Free Agent Threads Implementation.
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

Static Analysis to Enhance Programmability and Performance in OmpSs-2.
Proceedings of the High Performance Computing, 2020

Performance and energy effects on task-based parallelized applications - User-directed versus manual vectorization.
J. Supercomput., 2018

MPI+X: task-based parallelization and dynamic load balance of finite element assembly.
CoRR, 2018

Evaluating the Impact of OpenMP 4.0 Extensions on Relevant Parallel Workloads.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Optimizing Overlapped Memory Accesses in User-directed Vectorization.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

Compiler analysis for OpenMP tasks correctness.
Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

ACOTES Project: Advanced Compiler Technologies for Embedded Streaming.
Int. J. Parallel Program., 2011

Quantifying the Potential Task-Based Dataflow Parallelism in MPI Applications.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

Extending OpenMP to Survive the Heterogeneous Multi-Core Era.
Int. J. Parallel Program., 2010

Optimizing the Exploitation of Multicore Processors and GPUs with OpenMP and OpenCL.
Proceedings of the Languages and Compilers for Parallel Computing, 2010

A Proposal for User-Defined Reductions in OpenMP.
Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

An Extension to Improve OpenMP Tasking Control.
Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

Analysis of Task Offloading for Accelerators.
Proceedings of the High Performance Embedded Architectures and Compilers, 2010

A Proposal to Extend the OpenMP Tasking Model with Dependent Tasks.
Int. J. Parallel Program., 2009

Achieving high memory performance from heterogeneous architectures with the SARC programming model.
Proceedings of the 10th workshop on MEmory performance, 2009

Unrolling Loops Containing Task Parallelism.
Proceedings of the Languages and Compilers for Parallel Computing, 2009

Barcelona OpenMP Tasks Suite: A Set of Benchmarks Targeting the Exploitation of Task Parallelism in OpenMP.
Proceedings of the ICPP 2009, 2009

Nebelung: Execution Environment for Transactional OpenMP.
Int. J. Parallel Program., 2008

Evaluation of memory performance on the cell BE with the SARC programming model.
Proceedings of the 9th workshop on MEmory performance, 2008

A Proposal for Error Handling in OpenMP.
Int. J. Parallel Program., 2007

Multithreaded software transactional memory and OpenMP.
Proceedings of the 2007 workshop on MEmory performance, 2007

Transactional Memory and OpenMP.
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007

Support for OpenMP tasks in Nanos v4.
Proceedings of the 2007 conference of the Centre for Advanced Studies on Collaborative Research, 2007

Techniques supporting threadprivate in OpenMP.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
