Ruymán Reyes

Orcid: 0000-0002-9108-9615

According to our database1, Ruymán Reyes authored at least 29 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Programming Model Extensions for General-Purpose Processing-In-Memory.
Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024

2020
Towards Cross-Platform Performance Portability of DNN Models using SYCL.
Proceedings of the IEEE/ACM International Workshop on Performance, 2020

SYCL 2020: More than meets the eye.
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

Bringing performant support for Nvidia® hardware to SYCL.
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

2019
Towards Heterogeneous and Distributed Computing in C++.
Proceedings of the International Workshop on OpenCL, 2019

2018
What's New in SYCL 1.2.1 and How to Explore the Features.
Proceedings of the International Workshop on OpenCL, 2018

Introducing Parallelism to the Ranges TS.
Proceedings of the International Workshop on OpenCL, 2018

2017
SYCL-BLAS: Combining Expression Trees and Kernel Fusion on Heterogeneous Systems.
Proceedings of the Parallel Computing is Everywhere, 2017

Heterogeneous Computing Using Modern C++ with OpenCL Devices: Tutorial at IWOCL 2017.
Proceedings of the 5th International Workshop on OpenCL, 2017

SYCL-BLAS: Leveraging Expression Trees for Linear Algebra.
Proceedings of the 5th International Workshop on OpenCL, 2017

2016
C++ for OpenCL Workshop, IWOCL 2016.
Proceedings of the 4th International Workshop on OpenCL, 2016

2015
SYCL: Single-source C++ accelerator programming.
Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Khronos SYCL for OpenCL: a tutorial.
Proceedings of the 3rd International Workshop on OpenCL, 2015

2014
Leveraging task-parallelism in message-passing dense matrix factorizations using SMPSs.
Parallel Comput., 2014

Performance Evaluation of OpenACC Compilers.
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

2013
A preliminary evaluation of OpenACC implementations.
J. Supercomput., 2013

Exploring large macromolecular functional motions on clusters of multicore processors.
J. Comput. Phys., 2013

Programming for GPUs: The Directive-Based Approach.
Proceedings of the Eighth International Conference on P2P, 2013

2012
Optimization strategies in different CUDA architectures using llCoMP.
Microprocess. Microsystems, 2012

accULL: An User-directed Approach to Heterogeneous Programming.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Binding Performance and Power of Dense Linear Algebra Operations.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Tools for Power-Energy Modelling and Analysis of Parallel Scientific Applications.
Proceedings of the 41st International Conference on Parallel Processing, 2012

Directive-based Programming for GPUs: A Comparative Study.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

accULL: An OpenACC Implementation with CUDA and OpenCL Support.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
Automatic code generation for GPUs in llc.
J. Supercomput., 2011

Optimize or Wait? Using llc Fast-Prototyping Tool to Evaluate CUDA Optimizations.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

Symmetric Rank-k Update on Clusters of Multicore Processors with SMPSs.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

2010
Case Studies in Automatic GPGPU Code Generation with llc.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2009
Automatic Hybrid MPI+OpenMP Code Generation with llc.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009


  Loading...