Juan Touriño

Orcid: 0000-0001-9670-1933

Affiliations:
  • University of A Coruña, Spain


According to our database1, Juan Touriño authored at least 136 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
CUDA acceleration of MI-based feature selection methods.
J. Parallel Distributed Comput., 2024

Serverless-like platform for container-based YARN clusters.
Future Gener. Comput. Syst., 2024

Automated Approach for Accurate CPU Power Modelling.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

2023
SeQual-Stream: approaching stream processing to quality control of NGS datasets.
BMC Bioinform., December, 2023

A pipeline architecture for feature-based unsupervised clustering using multivariate time series from HPC jobs.
Inf. Fusion, May, 2023

PATO: genome-wide prediction of lncRNA-DNA triple helices.
Bioinform., March, 2023

ParRADMeth: Identification of Differentially Methylated Regions on Multicore Clusters.
IEEE ACM Trans. Comput. Biol. Bioinform., 2023

pRIblast: A highly efficient parallel application for comprehensive lncRNA-RNA interaction prediction.
Future Gener. Comput. Syst., 2023

Clupiter: a Raspberry Pi mini-supercomputer for educational purposes.
Proceedings of the 22nd IEEE International Conference on Trust, 2023

Accelerating the quality control of genetic sequences through stream processing.
Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, 2023

Parallel construction of RNA databases for extensive lncRNA-RNA interaction prediction.
Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, 2023

2022
Parallel-FST: A feature selection library for multicore clusters.
J. Parallel Distributed Comput., 2022

SparkEC: speeding up alignment-based DNA error correction tools.
BMC Bioinform., 2022

MARTA: Multi-configuration Assembly pRofiler and Toolkit for performance Analysis.
Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022

Custom High-Performance Vector Code Generation for Data-Specific Sparse Computations.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021
Optimizing Coherence Traffic in Manycore Processors Using Closed-Form Caching/Home Agent Mappings.
IEEE Access, 2021

2020
SMusket: Spark-based DNA error correction on distributed-memory systems.
Future Gener. Comput. Syst., 2020

Real-time resource scaling platform for Big Data workloads on serverless environments.
Future Gener. Comput. Syst., 2020

Coherence Traffic in Manycore Processors with Opaque Distributed Directories.
CoRR, 2020

Enabling Hardware Affinity in JVM-Based Applications: A Case Study for Big Data.
Proceedings of the Computational Science - ICCS 2020, 2020

Power Budgeting of Big Data Applications in Container-based Clusters.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
Performance Evaluation of Big Data Analysis.
Proceedings of the Encyclopedia of Big Data Technologies., 2019

Affine Modeling of Program Traces.
IEEE Trans. Computers, 2019

Parallel feature selection for distributed-memory clusters.
Inf. Sci., 2019

Simulating the Network Activity of Modern Manycores.
IEEE Access, 2019

Effect of Distributed Directories in Mesh Interconnects.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

2018
Enhancing in-memory efficiency for MapReduce-based data processing.
J. Parallel Distributed Comput., 2018

Big Data-Oriented PaaS Architecture with Disk-as-a-Resource Capability and Container-Based Virtualization.
J. Grid Comput., 2018

BDEv 3.0: Energy efficiency and microarchitectural characterization of Big Data processing frameworks.
Future Gener. Comput. Syst., 2018

BDWatchdog: Real-time monitoring and profiling of Big Data applications and frameworks.
Future Gener. Comput. Syst., 2018

Architectural exploration of heterogeneous memory systems.
CoRR, 2018

Optimization of Real-World MapReduce Applications With Flame-MR: Practical Use Cases.
IEEE Access, 2018

2017
MarDRe: efficient MapReduce-based removal of duplicate DNA reads in the cloud.
Bioinform., 2017

2016
Parallel Pairwise Epistasis Detection on Heterogeneous Computing Architectures.
IEEE Trans. Parallel Distributed Syst., 2016

Multithreaded and Spark parallelization of feature selection filters.
J. Comput. Sci., 2016

Locality-Aware Automatic Parallelization for GPGPU with OpenHMPP Directives.
Int. J. Parallel Program., 2016

Flame-MR: An event-driven architecture for MapReduce applications.
Future Gener. Comput. Syst., 2016

Performance Evaluation of Data-Intensive Computing Applications on a Public IaaS Cloud.
Comput. J., 2016

Analysis and evaluation of MapReduce solutions on an HPC cluster.
Comput. Electr. Eng., 2016

MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems.
Bioinform., 2016

Trace-based affine reconstruction of codes.
Proceedings of the 2016 International Symposium on Code Generation and Optimization, 2016

Performance evaluation of big data frameworks for large-scale data analytics.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015
Nonblocking collectives for scalable Java communications.
Concurr. Comput. Pract. Exp., 2015

Low-latency Java communication devices on RDMA-enabled networks.
Concurr. Comput. Pract. Exp., 2015

MREv: An Automatic MapReduce Evaluation Tool for Big Data Workloads.
Proceedings of the International Conference on Computational Science, 2015

2014
A 2D algorithm with asymmetric workload for the UPC conjugate gradient method.
J. Supercomput., 2014

Volatile STT-RAM Scratchpad Design and Data Allocation for Low Energy.
ACM Trans. Archit. Code Optim., 2014

FastMPJ: a scalable and efficient Java message-passing library.
Clust. Comput., 2014

A parallelizing compiler for multicore systems.
Proceedings of the 17th International Workshop on Software and Compilers for Embedded Systems, 2014

The HPS3 Service: Reduction of Cost and Transfer Time for Storing Data on Clouds.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

2013
Parallel simulation of Brownian dynamics on shared memory systems with OpenMP and Unified Parallel C.
J. Supercomput., 2013

Performance evaluation of sparse matrix products in UPC.
J. Supercomput., 2013

Java in the High Performance Computing arena: Research, practice and experience.
Sci. Comput. Program., 2013

Evaluation of messaging middleware for high-performance cloud computing.
Pers. Ubiquitous Comput., 2013

A novel compiler support for automatic parallelization on multicore systems.
Parallel Comput., 2013

Design and Implementation of an Extended Collectives Library for Unified Parallel C.
J. Comput. Sci. Technol., 2013

Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience.
Int. J. Parallel Program., 2013

Analysis of I/O Performance on an Amazon EC2 Cluster Compute and High I/O Platform.
J. Grid Comput., 2013

Performance analysis of HPC applications in the cloud.
Future Gener. Comput. Syst., 2013

Web-GIS tool for the management of rural land markets - Application to the Land Bank of Galicia (NWSpain).
Earth Sci. Informatics, 2013

Parallel Brownian dynamics simulations with the message-passing and PGAS programming models.
Comput. Phys. Commun., 2013

General-purpose computation on GPUs for high performance cloud computing.
Concurr. Comput. Pract. Exp., 2013

Design of Scalable Java Communication Middleware for Multi-Core Systems.
Comput. J., 2013

The Servet 3.0 benchmark suite: Characterization of network performance degradation.
Comput. Electr. Eng., 2013

Evaluation of Java for General Purpose GPU Computing.
Proceedings of the 27th International Conference on Advanced Information Networking and Applications Workshops, 2013

2012
F-MPJ: scalable Java message-passing communications on parallel systems.
J. Supercomput., 2012

Design of scalable Java message-passing communications over InfiniBand.
J. Supercomput., 2012

UPCBLAS: a library for parallel matrix computations in Unified Parallel C.
Concurr. Comput. Pract. Exp., 2012

Automatic mapping of parallel applications on multicore architectures using the Servet benchmark suite.
Comput. Electr. Eng., 2012

Communication avoiding and overlapping for numerical linear algebra.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Design and Performance Issues of Cholesky and LU Solvers Using UPCBLAS.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

2011
Design of efficient Java message-passing collectives on multi-core clusters.
J. Supercomput., 2011

Special issue on "Theory and practice of high-performance computing, communications, and security".
J. Supercomput., 2011

Dense Triangular Solvers on Multicore Clusters using UPC.
Proceedings of the International Conference on Computational Science, 2011

Device level communication libraries for high-performance computing in Java.
Concurr. Comput. Pract. Exp., 2011

Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study.
Comput. J., 2011

Extending the Globus Information Service with the Common Information Model.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2011

Design and Implementation of MapReduce Using the PGAS Programming Model with UPC.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Scalable Java Communication Middleware for Hybrid Shared/Distributed Memory Architectures.
Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011

2010
Performance analysis of message-passing libraries on high-speed clusters.
Comput. Syst. Sci. Eng., 2010

CPPC: a compiler-assisted tool for portable checkpointing of message-passing applications.
Concurr. Comput. Pract. Exp., 2010

Servet: A benchmark suite for autotuning on multicore clusters.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
A Heuristic Approach for the Automatic Insertion of Checkpoints in Message-Passing Codes.
J. Univers. Comput. Sci., 2009

Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Java for high performance computing: assessment of current research and practice.
Proceedings of the 7th International Conference on Principles and Practice of Programming in Java, 2009

NPB-MPJ: NAS Parallel Benchmarks Implementation for Message-Passing in Java.
Proceedings of the 17th Euromicro International Conference on Parallel, 2009

Ontological Configuration Management for Wireless Mesh Routers.
Proceedings of the IP Operations and Management, 9th IEEE International Workshop, 2009

Performance Evaluation of Unified Parallel C Collective Communications.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

Efficient Java Communication Libraries over InfiniBand.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

A Parallel Numerical Library for UPC.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
XARK: An extensible framework for automatic recognition of computational kernels.
ACM Trans. Program. Lang. Syst., 2008

Java Fast Sockets: Enabling high-speed Java communications on high performance clusters.
Comput. Commun., 2008

Integrating the common information model with MDS4.
Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008

Topic 6: Grid and Cluster Computing.
Proceedings of the Euro-Par 2008, 2008

Efficiently Building the Gated Single Assignment Form in Codes with Pointers in Modern Optimizing Compilers.
Proceedings of the Euro-Par 2008, 2008

2007
Guest Editorial Grid Education and Grid-Based Technologies Applied to Education: Ongoing Activities.
IEEE Trans. Educ., 2007

Special Issue: Current Trends in Compilers for Parallel Computers.
Concurr. Comput. Pract. Exp., 2007

Automated and accurate cache behavior analysis for codes with irregular access patterns.
Concurr. Comput. Pract. Exp., 2007

Enhancing Fault-Tolerance of Large-Scale MPI Scientific Applications.
Proceedings of the Parallel Computing Technologies, 2007

High Performance Java Remote Method Invocation for Parallel Computing on Clusters.
Proceedings of the 12th IEEE Symposium on Computers and Communications (ISCC 2007), 2007

High Performance Java Sockets for Parallel Computing on Clusters.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Program Behavior Characterization Through Advanced Kernel Recognition.
Proceedings of the Euro-Par 2007, 2007

Towards Low-Latency Model-Oriented Distributed Systems Management.
Proceedings of the Managing Next Generation Networks and Services, 2007

2006
Controller/Precompiler for Portable Checkpointing.
IEICE Trans. Inf. Syst., 2006

Non-blocking Java Communications Support on Clusters.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Efficient Java Communication Protocols on High-speed Cluster Interconnects.
Proceedings of the LCN 2006, 2006

2005
A Grid Portal for an Undergraduate Parallel Programming Course.
IEEE Trans. Educ., 2005

Designing Efficient Java Communications on Clusters.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

A Framework Focus on Configuration Modeling and Integration with Transparent Persistence.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

2004
A compiler tool to predict memory hierarchy performance of scientific codes.
Parallel Comput., 2004

A middleware architecture for distributed systems management.
J. Parallel Distributed Comput., 2004

A Grid Portal to Support High-Performance Scientific Computing on Distributed Resources.
IEICE Trans. Inf. Syst., 2004

An Inspector-Executor Algorithm for Irregular Assignment Parallelization.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

Compiler Support for Parallel Code Generation through Kernel Recognition.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

2003
Research Article: A GIS-embedded system to support land consolidation plans in Galicia.
Int. J. Geogr. Inf. Sci., 2003

Performance Modeling and Evaluation of Java Message-Passing Primitives on a Cluster.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

A GSA-based compiler infrastructure to extract parallelism from complex loops.
Proceedings of the 17th Annual International Conference on Supercomputing, 2003

Performance Analysis of Java Message-Passing Libraries on Fast Ethernet, Myrinet and SCI Clusters.
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

2002
Performance Modeling and Evaluation of MPI-I/O on a Cluster.
J. Inf. Sci. Eng., 2002

Performance analysis of MPI-I/O primitives on a PC cluster.
Proceedings of the 2002 ACM Symposium on Applied Computing (SAC), 2002

Irregular Assignment Computations on cc-NUMA Multiprocessors.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

Exploiting Locality in the Run-Time Parallelization of Irregular Loops.
Proceedings of the 31st International Conference on Parallel Processing (ICPP 2002), 2002

Improving Locality in the Parallelization of Doacross Loops (Research Note).
Proceedings of the Euro-Par 2002, 2002

Towards Detection of Coarse-Grain Loop-Level Parallelism in Irregular Computations.
Proceedings of the Euro-Par 2002, 2002

2001
Efficient parallel numerical solver for the elastohydrodynamic Reynolds-Hertz problem.
Parallel Comput., 2001

Hierarchical Radiosity on Multicomputers: a Load-Balanced Approach.
Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001

A Compiler Framework to Detect Parallelism in Irregular Codes.
Proceedings of the Languages and Compilers for Parallel Computing, 2001

Characterization of Message-Passing Overhead on the AP3000 Multicomputer.
Proceedings of the 2001 International Conference on Parallel Processing, 2001

COPA: a GIS-based Tool for Land Consolidation Projects.
Proceedings of the ACM-GIS 2001, 2001

1999
Modeling MPI Collective Communications on the AP3000 Multicomputer.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1999

A Parallel Approach for Solving a Lubrication Problem in Industrial Devices.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

Performance Evaluation and Modeling of the Fujitsu AP3000 Message-Passing Libraries.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

1998
A PVM-Based Library for Sparse Matrex Factorizations.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1998

HPF-2 Support for Dynamic Sparse Computations.
Proceedings of the Languages and Compilers for Parallel Computing, 1998

1996
Sparse Householder QR Factorization on a Mesh.
Proceedings of the 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96), 1996

Parallel Sparse Modified Gram-Schmidt QR Decomposition.
Proceedings of the High-Performance Computing and Networking, 1996


  Loading...