Thomas Scogland

Orcid: 0000-0001-7234-5743

According to our database1, Thomas Scogland authored at least 59 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
sqlelf: a SQL-centric Approach to ELF Analysis.
CoRR, 2024

Enabling RAJA on Intel GPUs with SYCL.
Proceedings of the 12th International Workshop on OpenCL and SYCL, 2024

Shared Virtual Memory: Its Design and Performance Implications for Diverse Applications.
Proceedings of the 38th ACM International Conference on Supercomputing, 2024

BLP: Block-Level Pipelining for GPUs.
Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

2023
GPU First - Execution of Legacy CPU Codes on GPUs.
CoRR, 2023

OpenMP Kernel Language Extensions for Performance Portable GPU Codes.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Fluxion: A Scalable Graph-Based Resource Model for HPC Scheduling Challenges.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

An analytical performance model of generalized hierarchical scheduling.
Int. J. High Perform. Comput. Appl., 2022

Reliabuild: Searching for High-Fidelity Builds Using Active Learning.
CoRR, 2022

Mapping Out the HPC Dependency Chaos.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Piper: Pipelining OpenMP Offloading Execution Through Compiler Optimization For Performance.
Proceedings of the IEEE/ACM International Workshop on Performance, 2022

Searching for High-Fidelity Builds Using Active Learning.
Proceedings of the 19th IEEE/ACM International Conference on Mining Software Repositories, 2022

Extending OpenMP to Support Automated Function Specialization Across Translation Units.
Proceedings of the OpenMP in a Modern World: From Multi-device Support to Meta Programming, 2022

Breaking the Vendor Lock: Performance Portable Programming through OpenMP as Target Independent Runtime Layer.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021
Beyond Explicit Transfers: Shared and Managed Memory in OpenMP.
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

Inter-loop optimization in RAJA using loop chains.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2020
Flux: Overcoming scheduling challenges for exascale workflows.
Future Gener. Comput. Syst., 2020

Unified Sequential Optimization Directives in OpenMP.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

FAROS: A Framework to Analyze OpenMP Compilation Through Benchmarking and Compiler Optimization Analysis.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019

RAJA: Portable Performance for Large-Scale Scientific Applications.
Proceedings of the 2019 IEEE/ACM International Workshop on Performance, 2019

Performance portable C++ programming with RAJA.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

A Framework for Enabling OpenMP Autotuning.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

Making OpenMP Ready for C++ Executors.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

Extending OpenMP Metadirective Semantics for Runtime Adaptation.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

2018
The Ongoing Evolution of OpenMP.
Proc. IEEE, 2018

Introduction.
Int. J. High Perform. Comput. Appl., 2018

Extending OpenMP to Facilitate Loop Optimization.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

2017
The Design and Implementation of OpenMP 4.5 and OpenACC Backends for the RAJA C++ Performance Portability Layer.
Proceedings of the Accelerator Programming Using Directives - 4th International Workshop, 2017

Custom Data Mapping for Composable Data Management.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Directive-Based Partitioning and Pipelining for Graphics Processing Units.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
A Case for Extending Task Dependencies.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016


Scalable I/O-Aware Job Scheduling for Burst Buffer Enabled HPC Clusters.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Directive-Based Pipelining Extension for OpenMP.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
CoreTSAR: Core Task-Size Adapting Runtime.
IEEE Trans. Parallel Distributed Syst., 2015

Design and Evaluation of Scalable Concurrent Queues for Many-Core Architectures.
Proceedings of the 6th ACM/SPEC International Conference on Performance Engineering, Austin, TX, USA, January 31, 2015

Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG.
Proceedings of the International Conference for High Performance Computing, 2015

Supporting Indirect Data Mapping in OpenMP.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Enabling Region Merging Optimizations in OpenMP.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

2014
Runtime Adaptation for Autonomic Heterogeneous Computing.
PhD thesis, 2014

A power-measurement methodology for large-scale, high-performance computing.
Proceedings of the ACM/SPEC International Conference on Performance Engineering, 2014

CoreTSAR: Adaptive Worksharing for Heterogeneous Systems.
Proceedings of the Supercomputing - 29th International Conference, 2014

Runtime Adaptation for Autonomic Heterogeneous Computing.
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014

Locality-aware memory association for multi-target worksharing in OpenMP.
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013
The Green500 list: escapades to exascale.
Comput. Sci. Res. Dev., 2013

On the Programmability and Performance of Heterogeneous Platforms.
Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, 2013

Trends in energy-efficient computing: A perspective from the Green500.
Proceedings of the International Green Computing Conference, 2013

2012
OpenCL and the 13 dwarfs: a work in progress.
Proceedings of the Third Joint WOSP/SIPEW International Conference on Performance Engineering, 2012

Heterogeneous Task Scheduling for Accelerated OpenMP.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011
Emerging Trends on the Evolving Green500: Year Three.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

StreamMR: An Optimized MapReduce Framework for AMD GPUs.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Architecture-Aware Mapping and Optimization on a 1600-Core GPU.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Towards accelerating molecular modeling via multi-scale approximation on a GPU.
Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, 2011

2010
A first look at integrated GPUs for green high-performance computing.
Comput. Sci. Res. Dev., 2010

2009
The Green500 List: Year one.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Multi-dimensional characterization of temporal data mining on graphics processors.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008
Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008


  Loading...