Thomas L. Sterling

Orcid: 0009-0000-3554-9931

Affiliations:
  • Indiana University Bloomington, Center for Research in Extreme Scale Technologies, USA
  • Louisiana State University, Department of Electrical & Computer Engineering, USA
  • California Institute of Technology, Center for Advanced Computing Research, USA


According to our database1, Thomas L. Sterling authored at least 105 papers between 1986 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
AM-CCA: A Memory-Driven System for Fine-Grain and Dynamic Computations.
CoRR, 2024

Structures and Techniques for Streaming Dynamic Graph Processing on Decentralized Message-Driven Systems.
Proceedings of the Workshop Proceedings of the 53rd International Conference on Parallel Processing, 2024

2018
Continuum Computing - on a New Performance Trajectory beyond Exascale.
Supercomput. Front. Innov., 2018

TabulaROSA: Tabular Operating System Architecture for Massively Parallel Heterogeneous Compute Engines.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

2017
A Survey: Runtime Software Systems for High Performance Computing.
Supercomput. Front. Innov., 2017

Simultac Fonton: A Fine-Grain Architecture for Extreme Performance beyond Moore's Law.
Supercomput. Front. Innov., 2017

DASHMM Accelerated Adaptive Fast Multipole Poisson-Boltzmann Solver on Distributed Memory Architecture.
CoRR, 2017

Runtime System Architecture for Dynamic Adaptive Execution.
Proceedings of the Big Data and HPC: Ecosystem and Convergence, TopHPC 2017, 2017

Accelerating the 3-D FFT Using a Heterogeneous FPGA Architecture.
Proceedings of the Euro-Par 2017: Parallel Processing Workshops, 2017

2016
Keynote: The quantum step in parallel execution through dynamic adaptive runtime and programming strategies.
Proceedings of the Second International Workshop on Extreme Scale Programming Models and Middleware, 2016

Network-Managed Virtual Global Address Space for Message-driven Runtimes.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

A non von neumann continuum computer architecture for scalability beyond Moore's law.
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015
An Autonomic Performance Environment for Exascale.
Supercomput. Front. Innov., 2015

2014
SLOWER: A performance model for Exascale computing.
Supercomput. Front. Innov., 2014

A Dynamic Execution Model Applied to Distributed Collision Detection.
Proceedings of the Supercomputing - 29th International Conference, 2014

Towards Exascale Co-design in a Runtime System.
Proceedings of the Solving Software Challenges for Exascale, 2014

2013
Exascale Computing [Guest Editorial].
Comput. Sci. Eng., 2013

Performance Modeling of Gyrokinetic Toroidal Simulations for a Many-Tasking Runtime System.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Tabulated Equations of State with a Many-tasking Execution Model.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Towards Brain-Inspired System Architectures.
Proceedings of the Brain-Inspired Computing - International Workshop, 2013

2012
Improving the scalability of parallel <i>N</i>-body applications with an event-driven constraint-based execution model.
Int. J. High Perform. Comput. Appl., 2012

Neutron Star Evolutions using Tabulated Equations of State with a New Execution Model
CoRR, 2012

Achieving scalability in the presence of Asynchrony for Exascale Computing.
Proceedings of the Transition of HPC Towards Exascale Computing, 2012

2011
Clusters.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Preliminary design examination of the ParalleX system from a software and hardware perspective.
SIGMETRICS Perform. Evaluation Rev., 2011

The International Exascale Software Project roadmap.
Int. J. High Perform. Comput. Appl., 2011

Adaptive Mesh Refinement for Astrophysics Applications with ParalleX
CoRR, 2011

An Application Driven Analysis of the ParalleX Execution Model
CoRR, 2011

Improving the scalability of parallel N-body applications with an event driven constraint based execution model
CoRR, 2011

2010
Advanced Architectures and Execution Models to Support Green Computing.
Comput. Sci. Eng., 2010

HPC in Phase Change: Towards a New Execution Model.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Beowulf bootcamp: teaching local high schools about HPC.
Proceedings of the 2010 TeraGrid Conference, 2010

2009
The Biggest Need: a New Model of Computation.
Int. J. High Perform. Comput. Appl., 2009

Models of Computation - Enabling Exascale.
Int. J. High Perform. Comput. Appl., 2009

A High-Performance Computing Forecast: Partly Cloudy.
Comput. Sci. Eng., 2009

ParalleX.
Proceedings of the ICPPW 2009, 2009

Distance Learning with High-definition Low Latency Video and Multi-media Infrastructure.
Proceedings of the CSEDU 2009 - Proceedings of the First International Conference on Computer Supported Education, Lisboa, Portugal, March 23-26, 2009, 2009

2008
Productivity in High Performance Computing.
Adv. Comput., 2008

Improving code compression using clustered modalities.
Proceedings of the 46th Annual Southeast Regional Conference, 2008

2007
HCW Keynote Address ParalleX: An Asynchronous Execution Model for Scalable Heterogeneous Computing.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

ParalleX: A Study of A New Parallel Computation Model.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Societies of Cores and Their Computing Culture.
Proceedings of the Euro-Par 2007 Workshops: Parallel Processing, 2007

2006
Multi-core issues - Multi-Core for HPC: breakthrough or breakdown?
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Computer science - architecture - Beyond the beyond and the extremes of computing.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Exotic technologies I - HPC computational systems of 2020.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Hierarchical multithreading: programming model and system software.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005
High-performance computing: clusters, constellations, MPPs, and future directions.
Comput. Sci. Eng., 2005

Towards a Technology and Architecture Hybrid?
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Challenges to Evaluating Petaflops Systems.
Proceedings of the Second International Conference on the Quantitative Evaluaiton of Systems (QEST 2005), 2005

2004
Productivity Metrics and Models for High Performance Computing.
Int. J. High Perform. Comput. Appl., 2004

Analysis and Modeling of Advanced PIM Architecture Design Tradeoffs.
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

Towards Memory Oriented Scalable Computer Architecture and High Efficiency Petaflops Computing.
Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

The "MIND" scalable PIM architecture.
Proceedings of the Grid Computing: The New Frontier of High Performance Computing [post-proceedings of the High Performance Computing Workshop, 2004

2003
InfiniBand: The "De Facto" Future Standard for System and Local Area Networks or Just a Scalable Replacement for PCI Buses?
Clust. Comput., 2003

An Executable Analytical Performance Evaluation Approach for Early Performance Prediction.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Programming Models and System Software for Future High-End Computing Systems: Work-in-Progress.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2002
Gilgamesh: a multithreaded processor-in-memory architecture for petaflops computing.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

The Gilgamesh MIND Processor-in-Memory Architecture for Petaflops-Scale Computing.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

Next Generation System Software for Future High-End Computing Systems.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Launching into the Future of Commodity Cluster Computing.
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

2001
What Are the Top Ten Most Influential Parallel and Distributed Processing Concepts of the Past Millenium?
J. Parallel Distributed Comput., 2001

High Performance Computing Systems for Autonomous Spaceborne Missions.
Int. J. High Perform. Comput. Appl., 2001

An Introduction to PC Clusters for High Performance Computing.
Int. J. High Perform. Comput. Appl., 2001

Continuum computer architecture for exaflops computation.
Commun. ACM, 2001

An Introduction to the Gilgamesh PIM Architecture.
Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000
Characterizing and representing workloads for parallel computer architectures.
J. Syst. Archit., 2000

Symbolic Computing with Beowulf-Class PC Clusters.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2000

Macroservers: An Object-Based Programming and Execution Model for Processor-in-Memory Arrays.
Proceedings of the High Performance Computing, Third International Symposium, 2000

Support for Irregular Computations in Massively Parallel PIM Arrays, Using an Object-Based Execution Model.
Proceedings of the Parallel and Distributed Processing, 2000

PC Clusters - The Convergence of High Performance Systems Architecture.
Proceedings of the 2000 IEEE International Conference on Cluster Computing (CLUSTER 2000), November 28th, 2000

1999
A Vector-space Model for Parallel Workload Characterization.
J. King Saud Univ. Comput. Inf. Sci., 1999

From Toys to Teraflops: Bridging the Beowulf Gap.
Int. J. High Perform. Comput. Appl., 1999

A design analysis of a hybrid technology multithreaded architecture for petaflops scale computation3.
Proceedings of the 13th international conference on Supercomputing, 1999

Microservers: a new memory semantics for massively parallel computing.
Proceedings of the 13th international conference on Supercomputing, 1999

A Coming of Age for Beowulf-Class Computing.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

1998
Scaling of Beowulf-class Distributed Systems.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998

Beowulf PC clusters: breaking the cost barrier to high end application computing (keynote address - abstract).
Proceedings of the 1998 ACM symposium on Applied Computing, 1998

Convergence Points on Commercial Parallel Systems: Do We Have the Node Architecture? Do We Have the Network? Do We Have the Programming Paradigm?
Proceedings of the 1998 International Conference on Parallel Processing (ICPP '98), 1998

Scaling of Beowulf-Class Distributed Systems.
Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998

1997
Halo World: Tools for Parallel Cluster Finding in Astrophysical N-body Simulations.
Data Min. Knowl. Discov., 1997

Pentium Pro Inside: I. A Treecode at 430 Gigaflops on ASCI Red, II. Price/Performance of $50/Mflop on Loki and Hyglac.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1997

Parallel Supercomputing with Commodity Components.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1997

1996
An Empirical Evaluation of the Convex SPP-1000 Hierarchical Shared Memory System.
Int. J. Parallel Program., 1996

Improving Application Performance on the HP/Convex Exemplar.
Computer, 1996

The Scientific Workstation of the Future May Be a Pile of PCs.
Commun. ACM, 1996

A Quantitative Approach for Architecture-Invariant Parallel Workload Characterization.
Proceedings of the Applied Parallel Computing, 1996

Achieving a Balanced Low-Cost Architecture for Mass Storage Management through Multiple Fast Ethernet Channels on the Beowulf Parallel Workstation.
Proceedings of IPPS '96, 1996

A Design Study of Alternative Network Topologies for the Beowulf Parallel Workstation.
Proceedings of the 5th International Symposium on High Performance Distributed Computing (HPDC '96), 1996

1995
An initial evaluation of the Convex SPP-1000 for earth and space science applications.
Future Gener. Comput. Syst., 1995

A Performance Evaluation of the Convex SPP-1000 Scalable Shared Memory Parallel Computer.
Proceedings of the Proceedings Supercomputing '95, San Diego, CA, USA, December 4-8, 1995, 1995

System Software and Tools for High-Performance Computing Environments.
Proceedings of the Proceedings Supercomputing '95, San Diego, CA, USA, December 4-8, 1995, 1995

BEOWULF: A Parallel Workstation for Scientific Computation.
Proceedings of the 1995 International Conference on Parallel Processing, 1995

Communication Overhead for Space Science Applications on the Beowulf Parallel Workstation.
Proceedings of the 4th International Symposium on High Performance Distributed Computing (HPDC '95), 1995

An Initial Evaluation of the Convex SPP-1000 for Earth and Space Science Application.
Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture (HPCA 1995), 1995

1994
Findings of the joint NSF-NASA initiative in evaluation.
Proceedings of the Proceedings Supercomputing '94, 1994

JNNIE: Evaluation of Scalable Parallel Processing Architectures.
Proceedings of the High-Performance Computing and Networking, 1994

1993
Fine Grain Dataflow Computation without Tokens for Balanced Execution.
J. Parallel Distributed Comput., 1993

The Realities of Parallel Processing and Dataflow's Role in It: Lessons from the NASA HPCC Program.
Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism, 1993

1992
The Effectiveness of Random Mapping on Fine-Grain MIMD Architectures.
Proceedings of the 1992 International Conference on Parallel Processing, 1992

1988
Tokenless static data flow using associative templates.
Proceedings of the Proceedings Supercomputing '88, Orlando, FL, USA, November 12-17, 1988, 1988

Multiprocessor Performance Measurement Using Embedded Instrumentation.
Proceedings of the International Conference on Parallel Processing, 1988

A Practical Static Data Flow Computer Based on Associative Methods.
Proceedings of the International Conference on Parallel Processing, 1988

Efficient Dynamic Scheduling of Medium-Grained Tasks for General Purpose Parallel Processing.
Proceedings of the International Conference on Parallel Processing, 1988

1987
Effective Implementation of a Parallel Language on a Multiprocessor.
IEEE Micro, 1987

1986
Concert: Design of a Multiprocessor Development System.
Proceedings of the 13th Annual Symposium on Computer Architecture, Tokyo, Japan, June 1986, 1986


  Loading...