Bérenger Bramas

Orcid: 0000-0003-0281-9709

According to our database1, Bérenger Bramas authored at least 34 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Autovesk: Automatic Vectorized Code Generation from Unstructured Static Kernels Using Graph Transformations.
ACM Trans. Archit. Code Optim., March, 2024

Exploiting ray tracing technology through OptiX to compute particle interactions with cutoff in a 3D environment on GPU.
CoRR, 2024

Efficient GPU Implementation of Particle Interactions with Cutoff Radius and Few Particles per Cell.
CoRR, 2024

SPC5: An efficient SpMV framework vectorized using ARM SVE and x86 AVX-512.
Comput. Sci. Inf. Syst., 2024

Dynamic Tasks Scheduling with Multiple Priorities on Heterogeneous Computing Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
Specx: a C++ task-based runtime system for heterogeneous distributed architectures.
CoRR, 2023

Autovesk: Automatic vectorization of unstructured static kernels by graph transformations.
CoRR, 2023

GPU Code Generation of Cardiac Electrophysiology Simulation with MLIR.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

Lifting Code Generation of Cardiac Physiology Simulation to Novel Compiler Technology.
Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization, 2023

2022
Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach.
Microprocess. Microsystems, November, 2022

Automated prioritizing heuristics for parallel task graph scheduling in heterogeneous computing.
PeerJ Comput. Sci., 2022

An efficient particle tracking algorithm for large-scale parallel pseudo-spectral simulations of turbulence.
Comput. Phys. Commun., 2022

2021
A fast vectorized sorting implementation based on the ARM scalable vector extension (SVE).
PeerJ Comput. Sci., 2021

Automatic task-based parallelization of C++ applications by source-to-source transformations.
CoRR, 2021


2020
Improving parallel executions by increasing task granularity in task-based runtime systems using acyclic DAG clustering.
PeerJ Comput. Sci., 2020

TBFMM: A C++ generic and parallel fast multipole method library.
J. Open Source Softw., 2020

On the improvement of the in-place merge algorithm parallelization.
CoRR, 2020

2019
Impact study of data locality on task-based applications through the Heteroprio scheduler.
PeerJ Prepr., 2019

Increasing the degree of parallelism using speculative execution in task-based runtime systems.
PeerJ Comput. Sci., 2019

An Integral Equation Formulation of the N-body Dielectric Spheres Problem. Part II: Complexity Analysis.
CoRR, 2019

2018
Computing the sparse matrix vector product using block-based kernels without zero padding on processors with AVX-512 instructions.
PeerJ Comput. Sci., 2018

2017
Bridging the Gap Between OpenMP and Task-Based Runtime Systems for the Fast Multipole Method.
IEEE Trans. Parallel Distributed Syst., 2017

Inastemp: A Novel Intrinsics-as-Template Library for Portable SIMD-Vectorization.
Sci. Program., 2017

Fast Sorting Algorithms using AVX-512 on Intel Knights Landing.
CoRR, 2017

2016
Optimization and parallelization of the boundary element method for the wave equation in time domain. (Optimisation et parallèlisation de la méthode des élements frontières pour l'équation des ondes dans le domaine temporel).
PhD thesis, 2016

Task-based FMM for heterogeneous architectures.
Concurr. Comput. Pract. Exp., 2016

2015
Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach.
Parallel Comput., 2015

2014
Task-Based FMM for Multicore Architectures.
SIAM J. Sci. Comput., 2014

Time-Domain BEM for the Wave Equation: Optimization and Hybrid Parallelization.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2012
Optimized M2L Kernels for the Chebyshev Interpolation based Fast Multipole Method
CoRR, 2012

Pipelining the Fast Multipole Method over a Runtime System
CoRR, 2012




  Loading...