Gregorio Bernabé

Orcid: 0000-0002-7265-3508

According to our database1, Gregorio Bernabé authored at least 43 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Expanding the deep-learning model to diagnosis LVNC: limitations and trade-offs.
Comput. methods Biomech. Biomed. Eng. Imaging Vis., December, 2024

POAS: a framework for exploiting accelerator level parallelism in heterogeneous environments.
J. Supercomput., July, 2024

Code Detection for Hardware Acceleration Using Large Language Models.
IEEE Access, 2024

Matching linear algebra and tensor code to specialized hardware accelerators.
Dataset, January, 2023

Matching linear algebra and tensor code to specialized hardware accelerators.
Dataset, January, 2023

Matching linear algebra and tensor code to specialized hardware accelerators.
Dataset, January, 2023

Matching linear algebra and tensor code to specialized hardware accelerators.
Dataset, January, 2023

Matching Linear Algebra and Tensor Code to Specialized Hardware Accelerators.
Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction, 2023

HDNN: a cross-platform MLIR dialect for deep neural networks.
J. Supercomput., 2022

Performance portability in a real world application: PHAST applied to Caffe.
Int. J. High Perform. Comput. Appl., 2022

POAS: A high-performance scheduling framework for exploiting Accelerator Level Parallelism.
CoRR, 2022

Applying Intel's oneAPI to a machine learning case study.
Concurr. Comput. Pract. Exp., 2022

Left ventricular non-compaction cardiomyopathy automatic diagnosis using a deep learning approach.
Comput. Methods Programs Biomed., 2022

FARMIT: continuous assessment of crop quality using machine learning and deep learning techniques for IoT-based smart farming.
Clust. Comput., 2022

Deploying deep learning approaches to left ventricular non-compaction measurement.
J. Supercomput., 2021

Deep learning approach to left ventricular non-compaction measurement.
CoRR, 2020

Using PHAST to port Caffe library: First experiences and lessons learned.
CoRR, 2020

A highly accurate method for quantifying LVNC cardiomyophaty.
Proceedings of the AMIA 2020, 2020

A self-optimized software tool for quantifying the degree of left ventricle hyper-trabeculation.
J. Supercomput., 2019

Parallel implementations of the 3D fast wavelet transform on a Raspberry Pi 2 cluster.
J. Supercomput., 2018

On the Parallelization of Stream Compaction on a Low-Cost SDC Cluster.
Sci. Program., 2018

Exploiting Hybrid Parallelism in the Kinematic Analysis of Multibody Systems Based on Group Equations.
Proceedings of the International Conference on Computational Science, 2017

A Training Engine for Automatic Quantification of Left Ventricular Trabeculation from Cardiac MRI.
Proceedings of the International Conference on Computational Science 2016, 2016

Auto-tuning techniques for linear algebra routines on hybrid platforms.
J. Comput. Sci., 2015

An Autotuning Engine for the 3D Fast Wavelet Transform on Clusters with Hybrid CPU + GPU Platforms.
Int. J. Parallel Program., 2015

A Software Tool for the Automatic Quantification of the Left Ventricle Myocardium Hyper-trabeculation Degree.
Proceedings of the International Conference on Computational Science, 2015

Improving an autotuning engine for 3D Fast Wavelet Transform on manycore systems.
J. Supercomput., 2014

A High Performance Computing Course Guided by the LU Factorization.
Proceedings of the International Conference on Computational Science, 2014

Tuning basic Linear Algebra Routines for Hybrid CPU+GPU Platforms.
Proceedings of the International Conference on Computational Science, 2014

Optimizing a 3D-FWT Code in a Heterogeneous Cluster of Multicore CPUs and Manycore GPUs.
Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

Optimization Techniques for 3D-FWT on Systems with Manycore GPUs and Multicore CPUs.
Proceedings of the International Conference on Computational Science, 2013

The 2D wavelet transform on emerging architectures: GPUs and multicores.
J. Real Time Image Process., 2012

Parallel 3D fast wavelet transform on manycore GPUs and multicore CPUs.
Proceedings of the International Conference on Computational Science, 2010

A lossy 3D wavelet transform for high-quality compression of medical video.
J. Syst. Softw., 2009

A Parallel Implementation of the 2D Wavelet Transform Using CUDA.
Proceedings of the 17th Euromicro International Conference on Parallel, 2009

Multicore Platforms for Scientific Computing: Cell BE and NVIDIA Tesla.
Proceedings of the 2008 International Conference on Scientific Computing, 2008

An efficient implementation of a 3D wavelet transform based encoder on hyper-threading technology.
Parallel Comput., 2007

Reducing 3D Fast Wavelet Transform Execution Time Using Blocking and the Streaming SIMD Extensions.
J. VLSI Signal Process., 2005

Optimizing a 3D-FWT Video Encoder for SMPs and HyperThreading Architectures.
Proceedings of the 13th Euromicro Workshop on Parallel, 2005

Reducing 3D Wavelet Transform Execution Time through the Streaming SIMD Extensions.
Proceedings of the 11th Euromicro Workshop on Parallel, 2003

Virtual surgical tele-simulations in ophthalmology.
Proceedings of the CARS 2003. Computer Assisted Radiology and Surgery. Proceedings of the 17th International Congress and Exhibition, 2003

A Virtual surgical telesimulation in micrographic dermatologic surgery.
Proceedings of the CARS 2003. Computer Assisted Radiology and Surgery. Proceedings of the 17th International Congress and Exhibition, 2003

Memory Conscious 3D Wavelet Transform.
Proceedings of the 28th EUROMICRO Conference 2002, 4-6 September 2002, Dortmund, Germany, 2002
