Thomas L. Sterling

Proceedings of the Workshop Proceedings of the 53rd International Conference on Parallel Processing, 2024

2018

Continuum Computing - on a New Performance Trajectory beyond Exascale.

[BibT_eX]

[DOI]

Supercomput. Front. Innov., 2018

TabulaROSA: Tabular Operating System Architecture for Massively Parallel Heterogeneous Compute Engines.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

2017

A Survey: Runtime Software Systems for High Performance Computing.

[BibT_eX]

[DOI]

Supercomput. Front. Innov., 2017

Simultac Fonton: A Fine-Grain Architecture for Extreme Performance beyond Moore's Law.

[BibT_eX]

[DOI]

Supercomput. Front. Innov., 2017

DASHMM Accelerated Adaptive Fast Multipole Poisson-Boltzmann Solver on Distributed Memory Architecture.

[BibT_eX]

[DOI]

CoRR, 2017

Runtime System Architecture for Dynamic Adaptive Execution.

[BibT_eX]

[DOI]

Bo Zhang

Proceedings of the Big Data and HPC: Ecosystem and Convergence, TopHPC 2017, 2017

Accelerating the 3-D FFT Using a Heterogeneous FPGA Architecture.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2017: Parallel Processing Workshops, 2017

2016

Keynote: The quantum step in parallel execution through dynamic adaptive runtime and programming strategies.

[BibT_eX]

[DOI]

Proceedings of the Second International Workshop on Extreme Scale Programming Models and Middleware, 2016

Network-Managed Virtual Global Address Space for Message-driven Runtimes.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

A non von neumann continuum computer architecture for scalability beyond Moore's law.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015

An Autonomic Performance Environment for Exascale.

[BibT_eX]

[DOI]

Supercomput. Front. Innov., 2015

2014

SLOWER: A performance model for Exascale computing.

[BibT_eX]

[DOI]

Supercomput. Front. Innov., 2014

A Dynamic Execution Model Applied to Distributed Collision Detection.

[BibT_eX]

[DOI]

Proceedings of the Supercomputing - 29th International Conference, 2014

Towards Exascale Co-design in a Runtime System.

[BibT_eX]

[DOI]

Proceedings of the Solving Software Challenges for Exascale, 2014

2013

Exascale Computing [Guest Editorial].

[BibT_eX]

[DOI]

Steven Gottlieb

Comput. Sci. Eng., 2013

Performance Modeling of Gyrokinetic Toroidal Simulations for a Many-Tasking Runtime System.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Tabulated Equations of State with a Many-tasking Execution Model.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Towards Brain-Inspired System Architectures.

[BibT_eX]

[DOI]

Timur Gilmanov

Proceedings of the Brain-Inspired Computing - International Workshop, 2013

2012

Improving the scalability of parallel <i>N</i>-body applications with an event-driven constraint-based execution model.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2012

Neutron Star Evolutions using Tabulated Equations of State with a New Execution Model

[BibT_eX]

[DOI]

Hartmut Kaiser

CoRR, 2012

Achieving scalability in the presence of Asynchrony for Exascale Computing.

[BibT_eX]

[DOI]

Proceedings of the Transition of HPC Towards Exascale Computing, 2012

2011

Clusters.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Parallel Computing, 2011

Preliminary design examination of the ParalleX system from a software and hardware perspective.

[BibT_eX]

[DOI]

SIGMETRICS Perform. Evaluation Rev., 2011

The International Exascale Software Project roadmap.

[BibT_eX]

[DOI]

Bertrand Braunschweig

Int. J. High Perform. Comput. Appl., 2011

Adaptive Mesh Refinement for Astrophysics Applications with ParalleX

[BibT_eX]

[DOI]

Hartmut Kaiser

CoRR, 2011

An Application Driven Analysis of the ParalleX Execution Model

[BibT_eX]

[DOI]

CoRR, 2011

Improving the scalability of parallel N-body applications with an event driven constraint based execution model

[BibT_eX]

[DOI]

CoRR, 2011

2010

Advanced Architectures and Execution Models to Support Green Computing.

[BibT_eX]

[DOI]

Richard Murphy

Chirag Dekate

Comput. Sci. Eng., 2010

HPC in Phase Change: Towards a New Execution Model.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Beowulf bootcamp: teaching local high schools about HPC.

[BibT_eX]

[DOI]

Proceedings of the 2010 TeraGrid Conference, 2010

2009

The Biggest Need: a New Model of Computation.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2009

Models of Computation - Enabling Exascale.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2009

A High-Performance Computing Forecast: Partly Cloudy.

[BibT_eX]

[DOI]

Dylan Stark

Comput. Sci. Eng., 2009

ParalleX.

[BibT_eX]

[DOI]

Hartmut Kaiser

Proceedings of the ICPPW 2009, 2009

Distance Learning with High-definition Low Latency Video and Multi-media Infrastructure.

[BibT_eX]

Proceedings of the CSEDU 2009 - Proceedings of the First International Conference on Computer Supported Education, Lisboa, Portugal, March 23-26, 2009, 2009

2008

Productivity in High Performance Computing.

[BibT_eX]

Chirag Dekate

Adv. Comput., 2008

Improving code compression using clustered modalities.

[BibT_eX]

[DOI]

Chris J. Michael

Proceedings of the 46th Annual Southeast Regional Conference, 2008

2007

HCW Keynote Address ParalleX: An Asynchronous Execution Model for Scalable Heterogeneous Computing.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

ParalleX: A Study of A New Parallel Computation Model.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Societies of Cores and Their Computing Culture.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2007 Workshops: Parallel Processing, 2007

2006

Multi-core issues - Multi-Core for HPC: breakthrough or breakdown?

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Computer science - architecture - Beyond the beyond and the extremes of computing.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Exotic technologies I - HPC computational systems of 2020.

[BibT_eX]

[DOI]

Erik DeBenedictis

Fernand (Doc) Bedard

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Hierarchical multithreading: programming model and system software.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

High-performance computing: clusters, constellations, MPPs, and future directions.

[BibT_eX]

[DOI]

Comput. Sci. Eng., 2005

Towards a Technology and Architecture Hybrid?

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Challenges to Evaluating Petaflops Systems.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on the Quantitative Evaluaiton of Systems (QEST 2005), 2005

2004

Productivity Metrics and Models for High Performance Computing.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2004

Analysis and Modeling of Advanced PIM Architecture Design Tradeoffs.

[BibT_eX]

[DOI]

Ed T. Upchurch

Jay B. Brockman

Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

Towards Memory Oriented Scalable Computer Architecture and High Efficiency Petaflops Computing.

[BibT_eX]

[DOI]

Proceedings of the Network and Parallel Computing, IFIP International Conference, 2004

The "MIND" scalable PIM architecture.

[BibT_eX]

[DOI]

Proceedings of the Grid Computing: The New Frontier of High Performance Computing [post-proceedings of the High Performance Computing Workshop, 2004

2003

InfiniBand: The "De Facto" Future Standard for System and Local Area Networks or Just a Scalable Replacement for PCI Buses?

[BibT_eX]

[DOI]

Timothy Mark Pinkston

Clust. Comput., 2003

An Executable Analytical Performance Evaluation Approach for Early Performance Prediction.

[BibT_eX]

[DOI]

Ramaswamy Govindarajan

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Programming Models and System Software for Future High-End Computing Systems: Work-in-Progress.

[BibT_eX]

[DOI]

Guang R. Gao

Kevin B. Theobald

Ramaswamy Govindarajan

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2002

Gilgamesh: a multithreaded processor-in-memory architecture for petaflops computing.

[BibT_eX]

[DOI]

Hans P. Zima

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

The Gilgamesh MIND Processor-in-Memory Architecture for Petaflops-Scale Computing.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 4th International Symposium, 2002

Next Generation System Software for Future High-End Computing Systems.

[BibT_eX]

[DOI]

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Launching into the Future of Commodity Cluster Computing.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

2001

What Are the Top Ten Most Influential Parallel and Distributed Processing Concepts of the Past Millenium?

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2001

High Performance Computing Systems for Autonomous Spaceborne Missions.

[BibT_eX]

[DOI]

Daniel S. Katz

Larry A. Bergman

Int. J. High Perform. Comput. Appl., 2001

An Introduction to PC Clusters for High Performance Computing.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2001

Continuum computer architecture for exaflops computation.

[BibT_eX]

[DOI]

Commun. ACM, 2001

An Introduction to the Gilgamesh PIM Architecture.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000

Characterizing and representing workloads for parallel computer architectures.

[BibT_eX]

[DOI]

Abdullah I. AlMojel

Tarek A. El-Ghazawi

J. Syst. Archit., 2000

Symbolic Computing with Beowulf-Class PC Clusters.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2000

Macroservers: An Object-Based Programming and Execution Model for Processor-in-Memory Arrays.

[BibT_eX]

[DOI]

Hans P. Zima

Proceedings of the High Performance Computing, Third International Symposium, 2000

Support for Irregular Computations in Massively Parallel PIM Arrays, Using an Object-Based Execution Model.

[BibT_eX]

[DOI]

Hans P. Zima

Proceedings of the Parallel and Distributed Processing, 2000

PC Clusters - The Convergence of High Performance Systems Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2000 IEEE International Conference on Cluster Computing (CLUSTER 2000), November 28th, 2000

1999

A Vector-space Model for Parallel Workload Characterization.

[BibT_eX]

[DOI]

Abdullah I. Meajil

Tarek A. El-Ghazawi

J. King Saud Univ. Comput. Inf. Sci., 1999

From Toys to Teraflops: Bridging the Beowulf Gap.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 1999

A design analysis of a hybrid technology multithreaded architecture for petaflops scale computation3.

[BibT_eX]

[DOI]

Larry A. Bergman

Proceedings of the 13th international conference on Supercomputing, 1999

Microservers: a new memory semantics for massively parallel computing.

[BibT_eX]

[DOI]

Proceedings of the 13th international conference on Supercomputing, 1999

A Coming of Age for Beowulf-Class Computing.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

1998

Scaling of Beowulf-class Distributed Systems.

[BibT_eX]

[DOI]

John K. Salmon

Christopher Stein

Proceedings of the ACM/IEEE Conference on Supercomputing, 1998

Beowulf PC clusters: breaking the cost barrier to high end application computing (keynote address - abstract).

[BibT_eX]

[DOI]

Proceedings of the 1998 ACM symposium on Applied Computing, 1998

Convergence Points on Commercial Parallel Systems: Do We Have the Node Architecture? Do We Have the Network? Do We Have the Programming Paradigm?

[BibT_eX]

[DOI]

Proceedings of the 1998 International Conference on Parallel Processing (ICPP '98), 1998

Scaling of Beowulf-Class Distributed Systems.

[BibT_eX]

[DOI]

Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998

1997

Halo World: Tools for Parallel Cluster Finding in Astrophysical N-body Simulations.

[BibT_eX]

[DOI]

David W. Pfitzner

John K. Salmon

Data Min. Knowl. Discov., 1997

Pentium Pro Inside: I. A Treecode at 430 Gigaflops on ASCI Red, II. Price/Performance of $50/Mflop on Loki and Hyglac.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on Supercomputing, 1997

Parallel Supercomputing with Commodity Components.

[BibT_eX]

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1997

1996

An Empirical Evaluation of the Convex SPP-1000 Hierarchical Shared Memory System.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1996

Improving Application Performance on the HP/Convex Exemplar.

[BibT_eX]

[DOI]

Phillip Merkey

Computer, 1996

The Scientific Workstation of the Future May Be a Pile of PCs.

[BibT_eX]

[DOI]

Commun. ACM, 1996

A Quantitative Approach for Architecture-Invariant Parallel Workload Characterization.

[BibT_eX]

[DOI]

Abdullah I. Meajil

Tarek A. El-Ghazawi

Proceedings of the Applied Parallel Computing, 1996

Achieving a Balanced Low-Cost Architecture for Mass Storage Management through Multiple Fast Ethernet Channels on the Beowulf Parallel Workstation.

[BibT_eX]

[DOI]

Proceedings of IPPS '96, 1996

A Design Study of Alternative Network Topologies for the Beowulf Parallel Workstation.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on High Performance Distributed Computing (HPDC '96), 1996

1995

An initial evaluation of the Convex SPP-1000 for earth and space science applications.

[BibT_eX]

[DOI]

Thomas Lawrence Sterling

Phillip R. Merkey

Jeffrey P. Gardner

Future Gener. Comput. Syst., 1995

A Performance Evaluation of the Convex SPP-1000 Scalable Shared Memory Parallel Computer.

[BibT_eX]

[DOI]

Proceedings of the Proceedings Supercomputing '95, San Diego, CA, USA, December 4-8, 1995, 1995

System Software and Tools for High-Performance Computing Environments.

[BibT_eX]

[DOI]

Paul Messina

James C. T. Pool

Proceedings of the Proceedings Supercomputing '95, San Diego, CA, USA, December 4-8, 1995, 1995

BEOWULF: A Parallel Workstation for Scientific Computation.

[BibT_eX]

Proceedings of the 1995 International Conference on Parallel Processing, 1995

Communication Overhead for Space Science Applications on the Beowulf Parallel Workstation.

[BibT_eX]

[DOI]

Proceedings of the 4th International Symposium on High Performance Distributed Computing (HPDC '95), 1995

An Initial Evaluation of the Convex SPP-1000 for Earth and Space Science Application.

[BibT_eX]

[DOI]

Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture (HPCA 1995), 1995

1994

Findings of the joint NSF-NASA initiative in evaluation.

[BibT_eX]

[DOI]

Steven Hotovy

Wayne Pfeiffer

Proceedings of the Proceedings Supercomputing '94, 1994

JNNIE: Evaluation of Scalable Parallel Processing Architectures.

[BibT_eX]

[DOI]

Steven Hotovy

Proceedings of the High-Performance Computing and Networking, 1994

1993

Fine Grain Dataflow Computation without Tokens for Balanced Execution.

[BibT_eX]

[DOI]

Jeffrey M. Arnold

J. Parallel Distributed Comput., 1993

The Realities of Parallel Processing and Dataflow's Role in It: Lessons from the NASA HPCC Program.

[BibT_eX]

[DOI]

Michael J. MacDonald

Proceedings of the IFIP WG10.3. Working Conference on Architectures and Compilation Techniques for Fine and Medium Grain Parallelism, 1993

1992

The Effectiveness of Random Mapping on Fine-Grain MIMD Architectures.

[BibT_eX]

Proceedings of the 1992 International Conference on Parallel Processing, 1992

1988

Tokenless static data flow using associative templates.

[BibT_eX]

[DOI]

D. Scott Wills

Ellery Y. Chan

Proceedings of the Proceedings Supercomputing '88, Orlando, FL, USA, November 12-17, 1988, 1988

Multiprocessor Performance Measurement Using Embedded Instrumentation.

[BibT_eX]

Albert J. Musciano

Donald J. Becker

Proceedings of the International Conference on Parallel Processing, 1988

A Practical Static Data Flow Computer Based on Associative Methods.

[BibT_eX]

Ellery Y. Chan

Proceedings of the International Conference on Parallel Processing, 1988

Efficient Dynamic Scheduling of Medium-Grained Tasks for General Purpose Parallel Processing.

[BibT_eX]

Albert J. Musciano

Proceedings of the International Conference on Parallel Processing, 1988

1987

Effective Implementation of a Parallel Language on a Multiprocessor.

[BibT_eX]

[DOI]

IEEE Micro, 1987

1986

Concert: Design of a Multiprocessor Development System.

[BibT_eX]

[DOI]

Robert H. Halstead Jr.

Thomas L. Anderson

Randy B. Osborne