2025
COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025
2024
Acoustic fingerprints in nature: A self-supervised learning approach for ecosystem activity monitoring.
Ecol. Informatics, 2024
XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing.
,
,
,
,
,
,
,
,
,
,
Comput. Sci. Eng., 2024
PAISE 2024 Preface and Committees.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
2023
Adversarial Predictions of Data Distributions Across Federated Internet-of-Things Devices.
Proceedings of the 9th IEEE World Forum on Internet of Things, 2023
Hardware Specialization: Estimating Monte Carlo Cross-Section Lookup Kernel Performance and Area.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Benchmarking and In-depth Performance Study of Large Language Models on Habana Gaudi Processors.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
2022
Goal-driven scheduling model in edge computing for smart city applications.
J. Parallel Distributed Comput., 2022
Hands-On Computer Science: The Array of Things Experimental Urban Instrument.
Comput. Sci. Eng., 2022
SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates.
CoRR, 2022
Workshop on Resource Arbitration for Dynamic Runtimes (RADR).
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
2021
Narrowing the Search Space of Applications Mapping on Hierarchical Topologies.
Proceedings of the 2021 International Workshop on Performance Modeling, 2021
2020
Measuring Cities with Software-Defined Sensors.
J. Soc. Comput., 2020
Workshop 17: PAISE Parallel AI and Systems for the Edge.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020
2019
Improving the scalabiliy of neutron cross-section lookup codes on multicore NUMA system.
CoRR, 2019
Explicit Data Layout Management for Autotuning Exploration on Complex Memory Topologies.
Proceedings of the 2019 IEEE/ACM Workshop on Memory Centric High Performance Computing, 2019
Understanding the Impact of Dynamic Power Capping on Application Progress.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Introduction to PAISE 2019.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
Introduction to RADR 2019.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
HPBDC 2019 Keynote Speaker.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
Spatiotemporal Real-Time Anomaly Detection for Supercomputing Systems.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019
2018
Machine Learning-Based Temperature Prediction for Runtime Thermal Management Across System Components.
IEEE Trans. Parallel Distributed Syst., 2018
Argobots: A Lightweight Low-Level Threading and Tasking Framework.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Parallel Distributed Syst., 2018
Big data and extreme-scale computing.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Int. J. High Perform. Comput. Appl., 2018
Toward a smart data transfer node.
Future Gener. Comput. Syst., 2018
Minimizing Thermal Variation in Heterogeneous HPC Systems with FPGA Nodes.
Proceedings of the 36th IEEE International Conference on Computer Design, 2018
Towards Autonomic Science Infrastructure: Architecture, Limitations, and Open Issues.
Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science, 2018
2017
In Situ Workflows at Exascale: System Software to the Rescue.
Proceedings of the In Situ Infrastructures on Enabling Extreme-Scale Analysis and Visualization, 2017
Argo NodeOS: Toward Unified Resource Management for Exascale.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Array of things: a scientific research instrument in the public way: platform design and early lessons learned.
Proceedings of the 2nd International Workshop on Science of Smart City Operations and Platforms Engineering, 2017
2016
Systemwide Power Management with Argo.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
Waggle: An open sensor platform for edge computing.
Proceedings of the 2016 IEEE SENSORS, Orlando, FL, USA, October 30 - November 3, 2016, 2016
Exploring Data Migration for Future Deep-Memory Many-Core Systems.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016
2015
Minimizing Thermal Variation Across System Components.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the International Conference on Computational Science, 2015
Distributed Monitoring and Management of Exascale Systems in the Argo Project.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Distributed Applications and Interoperable Systems, 2015
2014
Improved cache performance in Monte Carlo transport calculations using energy banding.
Comput. Phys. Commun., 2014
CINET 2.0: A CyberInfrastructure for Network Science.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 10th IEEE International Conference on e-Science, 2014
2012
Exascale System Software for the Year of the Dragon.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
CINET: A cyberinfrastructure for network science.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 8th IEEE International Conference on E-Science, 2012
Evaluating Power-Monitoring Capabilities on IBM Blue Gene/P and Blue Gene/Q.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012
2011
Performance and Scalability Evaluation of 'Big Memory' on Blue Gene Linux.
Int. J. High Perform. Comput. Appl., 2011
Understanding Checkpointing Overheads on Massive-Scale Systems: Analysis of the IBM Blue Gene/P System.
Int. J. High Perform. Comput. Appl., 2011
The International Exascale Software Project roadmap.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Int. J. High Perform. Comput. Appl., 2011
Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems.
Comput. Sci. Res. Dev., 2011
2010
Middleware support for many-task computing.
,
,
,
,
,
,
,
,
,
,
,
,
Clust. Comput., 2010
A practical failure prediction with location and lead time for Blue Gene/P.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W 2010), Chicago, Illinois, USA, June 28, 2010
2009
The International Exascale Software Project: a Call To Cooperative Action By the Global High-Performance Community.
,
,
,
,
,
,
,
,
,
,
Int. J. High Perform. Comput. Appl., 2009
Parallel Scripting for Applications at the Petascale and Beyond.
Computer, 2009
Robust data placement in urgent computing environments.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009
Characterizing the Performance of .
Proceedings of the ICPPW 2009, 2009
Analyzing Checkpointing Trends for Applications on the IBM Blue Gene/P System.
Proceedings of the ICPPW 2009, 2009
CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems.
Proceedings of the ICPP 2009, 2009
2008
Towards Loosely-Coupled Programming on Petascale Systems
CoRR, 2008
Benchmarking the effects of operating system interference on extreme-scale parallel machines.
Clust. Comput., 2008
Toward loosely coupled programming on petascale systems.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008
ZOID: I/O-forwarding infrastructure for petascale architectures.
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008
Looking toward Exascale Computing.
Proceedings of the Ninth International Conference on Parallel and Distributed Computing, 2008
Empirical-based probabilistic upper bounds for urgent computing applications.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008
2007
The ghost in the machine: observing the effects of kernel operation on parallel application performance.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007
2006
Operating system issues for petascale systems.
ACM SIGOPS Oper. Syst. Rev., 2006
Multi-core issues - Multi-Core for HPC: breakthrough or breakdown?
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
SPRUCE: A System for Supporting Urgent High-Performance Computing.
Proceedings of the Grid-Based Problem Solving Environments, 2006
TeraGrid: Analysis of Organization, System Architecture, and Middleware Enabling New Types of Applications.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the High Performance Computing and Grids in Action, 2006
Building an Infrastructure for Urgent Computing.
Proceedings of the High Performance Computing and Grids in Action, 2006
The Influence of Operating Systems on the Performance of Collective Operations at Extreme Scale.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006
2004
The Inca Test Harness and Reporting Framework.
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004
2001
HPC++ and the HPC++Lib Toolkit.
Proceedings of the Compiler Optimizations for Scalable Parallel Systems Languages, 2001
2000
Ligature: Component Architecture for High Performance Applications.
Int. J. High Perform. Comput. Appl., 2000
Workshop on Run-Time Systems for Parallel Programming (RTSPP).
Proceedings of the Parallel and Distributed Processing, 2000
Clusters, Servers, Thin Clients, and On-line Communities.
Proceedings of the Distributed Communities on the Web, Third International Workshop, 2000
1999
Linux on the Move - Guest Editors' Introduction.
IEEE Softw., 1999
A Programming Model for Clusters of SMPs.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999
Implementation and Evaluation of MPI on an SMP Cluster.
Proceedings of the Parallel and Distributed Processing, 1999
SMARTS: exploiting temporal locality and parallelism through vertical execution.
Proceedings of the 13th international conference on Supercomputing, 1999
1998
Portable profiling and tracing for parallel, scientific applications using C++.
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998
An IL converter and program database for analysis tools.
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998
Efficient Coupling of Parallel Applications Using PAWS.
Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998
1996
Galaxies Collide On the I-Way: an Example of Heterogeneous Wide-Area Collaborative Supercomputing.
,
,
,
,
,
,
,
,
,
,
Int. J. High Perform. Comput. Appl., 1996
Tulip: A Portable Run-Time System for Object-Parallel Systems.
Proceedings of IPPS '96, 1996
Portable Parallel Programming in HPC++.
Proceedings of the 1996 International Conference on Parallel Processing Workshop, 1996
1994
Performance Analysis of pC++: A Portable Data-Parallel Programming System for Scalable Parallel Computers.
Proceedings of the 8th International Symposium on Parallel Processing, 1994
1993
Distributed pC++ Basic Ideas for an Object Parallel Language.
Sci. Program., 1993
Implementing a parallel C++ runtime system for scalable parallel systems.
Proceedings of the Proceedings Supercomputing '93, 1993