Ravi Reddy
Orcid: 0000-0001-9181-3290Affiliations:
- University College Dublin, Ireland
According to our database1,
Ravi Reddy
authored at least 63 papers
between 2000 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications.
J. Parallel Distributed Comput., January, 2024
OpenH: A Novel Programming Model and API for Developing Portable Parallel Programs on Heterogeneous Hybrid Servers.
IEEE Access, 2024
2023
Efficient exact algorithms for continuous bi-objective performance-energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms.
Concurr. Comput. Pract. Exp., 2023
Acceleration of Bi-Objective Optimization of Data-Parallel Applications for Performance and Energy on Heterogeneous Hybrid Platforms.
IEEE Access, 2023
2022
Novel bi-objective optimization algorithms minimizing the max and sum of vectors of functions.
CoRR, 2022
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
2021
Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution.
IEEE Trans. Parallel Distributed Syst., 2021
Improving the accuracy of energy predictive models for multicore CPUs by combining utilization and performance events model variables.
J. Parallel Distributed Comput., 2021
Energy Predictive Models of Computing: Theory, Practical Implications and Experimental Analysis on Multicore Processors.
IEEE Access, 2021
A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021
2020
The 27th International Heterogeneity in Computing Workshop and the 16th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms.
Concurr. Comput. Pract. Exp., 2020
A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms.
Concurr. Comput. Pract. Exp., 2020
A Comparative Study of Techniques for Energy Predictive Modeling Using Performance Monitoring Counters on Modern Multicore CPUs.
IEEE Access, 2020
A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes.
IEEE Access, 2020
Accurate Energy Modelling of Hybrid Parallel Applications on Modern Heterogeneous Computing Platforms Using System-Level Measurements.
IEEE Access, 2020
2019
ACM Comput. Surv., 2019
Modern Multicore CPUs are not Energy Proportional: Opportunity for Bi-objective Optimization for Performance and Energy.
CoRR, 2019
Bi-objective Optimisation of Data-parallel Applications on Heterogeneous Platforms for Performance and Energy via Workload Distribution.
CoRR, 2019
Energy of Computing on Multicore CPUs: Predictive Models and Energy Conservation Law.
CoRR, 2019
Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution.
Concurr. Comput. Pract. Exp., 2019
Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters.
Proceedings of the Parallel Computing Technologies, 2019
SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution.
Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019
Proceedings of the Ultrascale Computing Systems, 2019
Proceedings of the Ultrascale Computing Systems, 2019
2018
A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms.
IEEE Trans. Parallel Distributed Syst., 2018
J. Supercomput., 2018
J. Supercomput., 2018
Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy.
IEEE Trans. Computers, 2018
Novel Model-based Methods for Performance Optimization of Multithreaded 2D Discrete Fourier Transform on Multicore Processors.
CoRR, 2018
libhclooc: Software Library Facilitating Out-of-core Implementations of Accelerator Kernels on Hybrid Computing Platforms.
CoRR, 2018
Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy.
IEEE Access, 2018
Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method.
IEEE Access, 2018
Proceedings of the High Performance Computing, 2018
Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches.
Proceedings of the 25th IEEE International Conference on High Performance Computing Workshops, 2018
2017
New Model-Based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters.
IEEE Trans. Parallel Distributed Syst., 2017
Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling.
Supercomput. Front. Innov., 2017
ACM Comput. Surv., 2017
2016
Design of a dual-hormone model predictive control for artificial pancreas with exercise model.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016
2011
Design and implementation of self-adaptable parallel algorithms for scientific computing on highly heterogeneous HPC platforms
CoRR, 2011
2010
Experimental Study of Six Different Implementations of Parallel Matrix Multiplication on Heterogeneous Computational Clusters of Multicore Processors.
Proceedings of the 18th Euromicro Conference on Parallel, 2010
2009
HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters.
Scalable Comput. Pract. Exp., 2009
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009
Two-Dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on Their Functional Performance Models.
Proceedings of the Euro-Par 2009, 2009
Distributed Data Partitioning for Heterogeneous Processors Based on Partial Estimation of Their Functional Performance Models.
Proceedings of the Euro-Par 2009, 2009
2008
Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008
Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008
2007
Parallel Comput., 2007
Int. J. High Perform. Comput. Appl., 2007
A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors.
Proceedings of the Parallel Computing Technologies, 2007
2006
HeteroMPI: Towards a message-passing library for heterogeneous networks of computers.
J. Parallel Distributed Comput., 2006
Proceedings of the 2006 ACM Symposium on Applied Computing (SAC), 2006
HeteroMPI+ScaLAPACK: Towards a ScaLAPACK (Dense Linear Solvers) on Heterogeneous Networks of Computers.
Proceedings of the High Performance Computing, 2006
2005
Data partitioning for multiprocessors with memory heterogeneity and memory constraints.
Sci. Program., 2005
A Variable Group Block Distribution Strategy for Dense Factorizations on Networks of Heterogeneous Computers.
Proceedings of the Parallel Processing and Applied Mathematics, 2005
2004
Data Partitioning with a Realistic Performance Model of Networks of Heterogeneous Computers with Task Size Limits.
Proceedings of the 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), 2004
Data Partitioning with a Realistic Performance Model of Networks of Heterogeneous Computers.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004
2003
Proceedings of the Parallel Processing and Applied Mathematics, 2003
Proceedings of the Parallel Computing Technologies, 2003
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
2000
A phased approach towards an open standards based, highly available, scalable architecture with asynchronous message processing.
Proceedings of the Networked Planet: Management Beyond 2000, 2000