Stephen Olivier

Orcid: 0000-0001-6247-8980

According to our database1, Stephen Olivier authored at least 57 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Performance Insights Into Supporting Kokkos Views in the Kokkos Comm MPI Library.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

2023
Enabling power measurement and control on Astra: The first petascale Arm supercomputer.
Concurr. Comput. Pract. Exp., 2023

Observed Memory Bandwidth and Power Usage on FPGA Platforms with OneAPI and Vitis HLS: A Comparison with GPUs.
Proceedings of the High Performance Computing, 2023

Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top 500 List.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

View-aware Message Passing Through the Integration of Kokkos and ExaMPI.
Proceedings of the 30th European MPI Users' Group Meeting, 2023

The Kokkos OpenMPTarget Backend: Implementation and Lessons Learned.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

Latency and Bandwidth Microbenchmarks of Six US Department of Energy Systems in the Top500.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

Performance Insights into Device-initiated RMA Using Kokkos Remote Spaces.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
Characterizing the Performance of Task Reductions in OpenMP 5.X Implementations.
Proceedings of the OpenMP in a Modern World: From Multi-device Support to Meta Programming, 2022

MultiGrid on FPGA Using Data Parallel C++.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Publisher Correction: Computer-aided interpretation of chest radiography reveals the spectrum of tuberculosis in rural South Africa.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
npj Digit. Medicine, 2021

Computer-aided interpretation of chest radiography reveals the spectrum of tuberculosis in rural South Africa.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
npj Digit. Medicine, 2021

Performance Portability of an SpMV Kernel Across Scientific Computing and Data Science Applications.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

2020
ALAMO: Autonomous Lightweight Allocation, Management, and Optimization.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI, 2020

Implementing Flexible Threading Support in Open MPI.
Proceedings of the Workshop on Exascale MPI, 2020

Cache Oblivious Strategies to Exploit Multi-Level Memory on Manycore Systems.
Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing, 2020

Evaluating the Efficiency of OpenMP Tasking for Unbalanced Computation on Diverse CPU Architectures.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

Exploring Chapel Productivity Using Some Graph Algorithms.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Small scale to extreme: Methods for characterizing energy efficiency in supercomputing applications.
Sustain. Comput. Informatics Syst., 2019

Scalable generation of graphs for benchmarking HPC community-detection algorithms.
Proceedings of the International Conference for High Performance Computing, 2019

Making OpenMP Ready for C++ Executors.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

2018
The Ongoing Evolution of OpenMP.
Proc. IEEE, 2018

Assessing Task-to-Data Affinity in the LLVM OpenMP Runtime.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

A Comparison of Power Management Mechanisms: P-States vs. Node-Level Power Cap Control.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Optimizing for KNL Usage Modes When Data Doesn't Fit in MCDRAM.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
OpenMPIR: Implementing OpenMP Tasks with Tapir.
Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC, 2017

Improving Energy Efficiency in Memory-constrained Applications Using Core-specific Power Control.
Proceedings of the 5th International Workshop on Energy Efficient Supercomputing, 2017

Double Buffering for MCDRAM on Second Generation $$\hbox {Intel}^{\circledR }$$ Xeon Phi $$^{\text {TM}}$$ Processors with OpenMP.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

An Adaptive Core-Specific Runtime for Energy Efficiency.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Scheduling Chapel Tasks with Qthreads on Manycore: A Tale of Two Schedulers.
Proceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers, 2017

Evaluating energy and power profiling techniques for HPC workloads.
Proceedings of the Eighth International Green and Sustainable Computing Conference, 2017

2016
Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout.
CoRR, 2016

Standardizing Power Monitoring and Control at Exascale.
Computer, 2016

Cactus Environment Machine - Shared Environment Call-by-Need.
Proceedings of the Trends in Functional Programming - 17th International Conference, 2016

Approaches for Task Affinity in OpenMP.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Overcoming Challenges in Scalable Power Monitoring with the Power API.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Kokkos/Qthreads task-parallel approach to linear algebra based graph analytics.
Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

2015
Early experiences with node-level power capping on the Cray XC40 platform.
Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing, 2015

Toward an evolutionary task parallel integrated MPI + X programming model.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Towards Task-Parallel Reductions in OpenMP.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

2014
Using a complementary emulation-simulation co-design approach to assess application readiness for processing-in-memory systems.
Proceedings of the 1st International Workshop on Hardware-Software Co-Design for High Performance Computing, 2014

Early experiences co-scheduling work and communication tasks for hybrid MPI+X applications.
Proceedings of the 2014 Workshop on Exascale MPI, 2014

Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Exploiting Geometric Partitioning in Task Mapping for Parallel Computers.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013
Characterizing and mitigating work time inflation in task parallel programs.
Sci. Program., 2013

A Proposal for Task-Generating Loops in OpenMP.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Power Measurement and Concurrency Throttling for Energy Reduction in OpenMP Programs.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

HIPS Introduction.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012
Locality Awareness for Task Parallel Computation.
PhD thesis, 2012

OpenMP task scheduling strategies for multicore NUMA systems.
Int. J. High Perform. Comput. Appl., 2012

2010
Comparison of OpenMP 3.0 and Other Task Parallel Frameworks on Unbalanced Task Graphs.
Int. J. Parallel Program., 2010

2009
Evaluating OpenMP 3.0 Run Time Systems on Unbalanced Task Graphs.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

2008
A message passing benchmark for unbalanced applications.
Simul. Model. Pract. Theory, 2008

Scalable Dynamic Load Balancing Using UPC.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

2007
Porting the GROMACS Molecular Dynamics Code to the Cell Processor.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Dynamic Load Balancing of Unbalanced Computations Using Message Passing.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006
UTS: An Unbalanced Tree Search Benchmark.
Proceedings of the Languages and Compilers for Parallel Computing, 2006


  Loading...