Simon McIntosh-Smith
Orcid: 0000-0002-5312-0378Affiliations:
- University of Bristol
According to our database1,
Simon McIntosh-Smith
authored at least 78 papers
between 1994 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on twitter.com
-
on orcid.org
-
on d-nb.info
On csauthors.net:
Bibliography
2024
Isambard-AI: a leadership class supercomputer optimised specifically for Artificial Intelligence.
CoRR, 2024
Preliminary report: Initial evaluation of StdPar implementations on AMD GPUs for HPC.
CoRR, 2024
Assessing the GPU Offload Threshold of GEMM and GEMV Kernels on Modern Heterogeneous HPC Systems.
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Federated Single Sign-On and Zero Trust Co-design for AI and HPC Digital Research Infrastructures.
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Optimisation and Evaluation of Breadth First Search with oneAPI/SYCL on Intel FPGAs: from Describing Algorithms to Describing Architectures.
Proceedings of the 12th International Workshop on OpenCL and SYCL, 2024
2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Time Machine: Generative Real-Time Model for Failure (and Lead Time) Prediction in HPC Systems.
Proceedings of the 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Network, 2023
2022
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022
Proceedings of the IEEE/ACM International Workshop on Performance, 2022
2021
A Performance Analysis of Modern Parallel Programming Models Using a Compute-Bound Application.
Proceedings of the High Performance Computing - 36th International Conference, 2021
Applying Recent Machine Learning Approaches to Accelerate the Algebraic Multigrid Method for Fluid Simulations.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation, 2021
Proceedings of the IEEE/ACM International Workshop on Hierarchical Parallelism for Exascale Computing, 2021
Proceedings of the 2021 International Workshop on Performance Modeling, 2021
Proceedings of the International Workshop on Performance, 2021
Proceedings of the International Workshop on Performance, 2021
On measuring the maturity of SYCL implementations by tracking historical performance improvements.
Proceedings of the IWOCL'21: International Workshop on OpenCL, Munich Germany, April, 2021, 2021
2020
Concurr. Comput. Pract. Exp., 2020
Proceedings of the Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI, 2020
Proceedings of the IEEE/ACM International Workshop on Performance, 2020
Proceedings of the IEEE/ACM International Workshop on Performance, 2020
Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing, 2020
Enabling System Wide Shared Memory for Performance Improvement in PyCOMPSs Applications.
Proceedings of the 9th IEEE/ACM Workshop on Python for High-Performance and Scientific Computing, 2020
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020
Proceedings of the Euro-Par 2020: Parallel Processing, 2020
Proceedings of the IEEE International Conference on Cluster Computing, 2020
2019
J. Signal Process. Syst., 2019
Concurr. Comput. Pract. Exp., 2019
Exploiting Hardware-Accelerated Ray Tracing for Monte Carlo Particle Transport with OpenMC.
Proceedings of the 2019 IEEE/ACM Performance Modeling, 2019
Proceedings of the 2019 IEEE/ACM International Workshop on Performance, 2019
2018
Int. J. High Perform. Comput. Appl., 2018
Int. J. High Perform. Comput. Appl., 2018
Evaluating attainable memory bandwidth of parallel programming models via BabelStream.
Int. J. Comput. Sci. Eng., 2018
Proceedings of the Euro-Par 2018: Parallel Processing Workshops, 2018
ASPEN: An Efficient Algorithm for Data Redistribution Between Producer and Consumer Grids.
Proceedings of the Euro-Par 2018: Parallel Processing Workshops, 2018
Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018
UnSNAP: A Mini-App for Exploring the Performance of Deterministic Discrete Ordinates Transport on Unstructured Meshes.
Proceedings of the IEEE International Conference on Cluster Computing, 2018
2017
Assessing the performance portability of modern parallel programming models using TeaLeaf.
Concurr. Comput. Pract. Exp., 2017
Exploiting Auto-tuning to Analyze and Improve Performance Portability on Many-Core Architectures.
Proceedings of the High Performance Computing, 2017
On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures.
Proceedings of the High Performance Computing, 2017
A Survey of Application Memory Usage on a National Supercomputer: An Analysis of Memory Requirements on ARCHER.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017
The Productivity, Portability and Performance of OpenMP 4.5 for Scientific Applications Targeting Intel CPUs, IBM CPUs, and NVIDIA GPUs.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017
On the Performance of Parallel Tasking Runtimes for an Irregular Fast Multipole Method Application.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017
Analyzing and improving performance portability of OpenCL applications via auto-tuning.
Proceedings of the 5th International Workshop on OpenCL, 2017
Application-Based Fault Tolerance Techniques for Fully Protecting Sparse Matrix Solvers.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
TeaLeaf: A Mini-Application to Enable Design-Space Explorations for Iterative Sparse Linear Solvers.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
The Arch Project: Physics Mini-Apps for Algorithmic Exploration and Evaluating Programming Environments on HPC Architectures.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Exploring On-Node Parallelism with Neutral, a Monte Carlo Neutral Particle Transport Mini-App.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
2016
GPU-STREAM v2.0: Benchmarking the Achievable Memory Bandwidth of Many-Core Processors Across Diverse Parallel Programming Models.
Proceedings of the High Performance Computing, 2016
Proceedings of the High Performance Computing - 31st International Conference, 2016
Proceedings of the 7th International Workshop on Performance Modeling, 2016
Unprotected computing: a large-scale study of DRAM raw error rate on a supercomputer.
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the 7th International Workshop on Programming Models and Applications for Multicores and Manycores, 2016
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
2015
Int. J. High Perform. Comput. Appl., 2015
Proceedings of the Parallel Computing: On the Road to Exascale, 2015
Improving Auto-Tuning Convergence Times with Dynamically Generated Predictive Performance Models.
Proceedings of the IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2015
Proceedings of the 3rd International Workshop on OpenCL, 2015
Proceedings of the 3rd International Workshop on OpenCL, 2015
Proceedings of the 3rd International Workshop on OpenCL, 2015
Exploiting Spatial Information in Datasets to Enable Fault Tolerant Sparse Matrix Solvers.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
2014
On the Performance Portability of Structured Grid Codes on Many-Core Computer Architectures.
Proceedings of the Supercomputing - 29th International Conference, 2014
Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014
Proceedings of the International Workshop on OpenCL, 2014
Proceedings of the International Workshop on OpenCL, 2014
2013
Special issue of the Journal of Parallel and Distributed Computing (JDPC) on novel architectures for high-performance computing.
J. Parallel Distributed Comput., 2013
2012
Benchmarking Energy Efficiency, Power Costs and Carbon Emissions on Heterogeneous Systems.
Comput. J., 2012
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
2011
SIGMETRICS Perform. Evaluation Rev., 2011
2010
J. Comput. Chem., 2010
2008
Proceedings of the Spring Conference on Computer Graphics, 2008
1994
Intelligent Algorithm Decomposition for Parallelism.
Proceedings of the Massively Parallel Processing Applications and Develompent, 1994