Darren J. Kerbyson

According to our database1, Darren J. Kerbyson authored at least 129 papers between 1989 and 2017.

Collaborative distances:



In proceedings 
PhD thesis 


On csauthors.net:


Representative paths analysis.
Proceedings of the International Conference for High Performance Computing, 2017

Towards Efficient Resource Allocation for Distributed Workflows Under Demand Uncertainties.
Proceedings of the Job Scheduling Strategies for Parallel Processing, 2017

Scaling Deep Learning Workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Generating Performance Models for Irregular Applications.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Performance and power for highly parallel systems.
Concurr. Comput. Pract. Exp., 2016

Assessing Advanced Technology in CENATE.
Proceedings of the IEEE International Conference on Networking, 2016

Modeling the Impact of Silicon Photonics on Graph Analytics.
Proceedings of the IEEE International Conference on Networking, 2016

Fault Modeling of Extreme Scale Applications Using Machine Learning.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Algorithm and Architecture Independent Benchmarking with SEAK.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Modeling the Performance and Energy Impact of Dynamic Power Steering.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

New-Sum: A Novel Online ABFT Scheme For General Iterative Methods.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Leveraging large sensor streams for robust cloud control.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Combating the Reliability Challenge of GPU Register File at Low Supply Voltage.
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

Scaling Support Vector Machines on modern HPC platforms.
J. Parallel Distributed Comput., 2015

A case for application-oblivious energy-efficient MPI runtime.
Proceedings of the International Conference for High Performance Computing, 2015

Towards efficient scheduling of data intensive high energy physics workflows.
Proceedings of the 10th Workshop on Workflows in Support of Large-Scale Science, 2015

Towards an application-specific thermal energy model of current processors.
Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing, 2015

Diagnosing the causes and severity of one-sided message contention.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Investigating the Interplay between Energy Efficiency and Resilience in High Performance Computing.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

HIPS-LSPP Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Fast and Accurate Support Vector Machines on Large Scale Systems.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Power and performance trade-offs for Space Time Adaptive Processing.
Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

Guest Editors' Note: Special Issue on Large-Scale Parallel Processing.
Parallel Process. Lett., 2014

Online Monitoring Systems for Performance Fault Detection.
Parallel Process. Lett., 2014

A performance comparison of current HPC systems: Blue Gene/Q, Cray XE6 and InfiniBand systems.
Future Gener. Comput. Syst., 2014

Evaluating performance and power efficiency of scientific applications on multi-threaded systems.
Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

On the feasibility of dynamic power steering.
Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

Cross-Layer Self-Adaptive/Self-Aware System Software for Exascale Systems.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

LSPP Introduction and Committees.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Online Monitoring System for Performance Fault Detection.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

An adaptive cross-architecture combination method for graph traversal.
Proceedings of the 2014 International Conference on Supercomputing, 2014

On the suitability of MPI as a PGAS runtime.
Proceedings of the 21st International Conference on High Performance Computing, 2014

Designing energy efficient communication runtime systems: a view from PGAS models.
J. Supercomput., 2013

Guest Editors' note: Large-Scale Parallel Processing.
Parallel Process. Lett., 2013

A Performance Analysis of Three Generations of Blue gene.
Parallel Process. Lett., 2013

Tracking the Performance Evolution of Blue Gene Systems.
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

Unified performance and power modeling of scientific workloads.
Proceedings of the 1st International Workshop on Energy Efficient Supercomputing, 2013

Building Scalable PGAS Communication Subsystem on Blue Gene/Q.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Quantifying the energy cost of data movement in scientific applications.
Proceedings of the IEEE International Symposium on Workload Characterization, 2013

Enabling accurate power profiling of HPC applications on exascale systems.
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

LSPP Introduction.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Comparing the Performance of Blue Gene/Q with Leading Cray XE6 and InfiniBand Systems.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

Guest Editor's Note: Large-Scale Parallel Processing.
Parallel Process. Lett., 2011

Modeling the Performance of Direct numerical Simulation on Parallel Systems.
Parallel Process. Lett., 2011

Adapting wave-front algorithms to efficiently utilize systems with deep communication hierarchies.
Parallel Comput., 2011

Codesign Challenges for Exascale Systems: Performance, Power, and Reliability.
Computer, 2011

An early performance analysis of POWER7-IH HPC systems.
Proceedings of the Conference on High Performance Computing Networking, 2011

A Performance Model of Direct Numerical Simulation for Analyzing Large-Scale Systems.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

DCPM Introduction.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Energy Templates: Exploiting Application Information to Save Energy.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

Analyzing the Performance Bottlenecks of the POWER7-IH Network.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

Guest Editor's Note: Large-Scale Parallel Processing.
Parallel Process. Lett., 2010

On the Performance and Technological Impact of Adding Memory Controllers in Multi-Core Processors.
Parallel Process. Lett., 2010

Optimized InfiniBand<sup>TM</sup> fat-tree routing for shift all-to-all communication patterns.
Concurr. Comput. Pract. Exp., 2010

Analyzing the trade-off between multiple memory controllers and memory channels on multi-core processor performance.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models.
Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications, 2010

Characterizing the Impact of Using Spare-Cores on Application Performance.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Performance Prediction and Evaluation.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

An MPI Performance Monitoring Interface for Cell Based Compute Nodes.
Parallel Process. Lett., 2009

Performance Prediction via Modeling: a Case Study of the ORNL Cray XT4 Upgrade.
Parallel Process. Lett., 2009

The reverse-acceleration model for programming petascale hybrid systems.
IBM J. Res. Dev., 2009

Optimizing multiple conjugate gradient solvers for large-scale systems.
Concurr. Comput. Pract. Exp., 2009

Using Performance Modeling to Design Large-Scale Systems.
Computer, 2009

Application profiling on Cell-based clusters.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Performance modeling in action: Performance prediction of a Cray XT4 system during upgrade.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Dynamic Load Balancing of Matrix-Vector Multiplications on Roadrunner Compute Nodes.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Infiniband Routing Table Optimizations for Scientific Applications.
Parallel Process. Lett., 2008

A Performance Evaluation of the Nehalem Quad-Core Processor for Scientific Computing.
Parallel Process. Lett., 2008

0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on Roadrunner.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Entering the petaflop era: the architecture and performance of Roadrunner.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Analysis of double buffering on two different multicore architectures: Quad-core Opteron and the Cell-BE.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Optimization of infiniband for scientific applications.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Experiences in scaling scientific applications on current-generation quad-core processors.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Improving the Performance of Multiple Conjugate Gradient Solvers by Exploiting Overlap.
Proceedings of the Euro-Par 2008, 2008

Analysis of the Weather Research and Forecasting (WRF) Model on Large-Scale Systems.
Proceedings of the Parallel Computing: Architectures, 2007

Performance Analysis of an Optical Circuit Switched Network for Peta-Scale Systems.
Proceedings of the Euro-Par 2007, 2007

Efficient offloading of collective communications in large-scale systems.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Performance feature identification by comparative trace analysis.
Future Gener. Comput. Syst., 2006

A performance model of non-deterministic particle transport on large-scale systems.
Future Gener. Comput. Syst., 2006

Special section: Large-scale system performance modeling and analysis.
Future Gener. Comput. Syst., 2006

MPI tools and performance studies - Quantifying the potential benefit of overlapping communication and computation in large-scale scientific applications.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

S05 - A practical approach to performance analysis and modeling of large-scale systems.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Architecture - A performance comparison through benchmarking and modeling of three leading supercomputers: blue Gene/L, Red Storm, and Purple.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Performance Modeling of the Blue Gene Architecture.
Proceedings of the 2006 IEEE John Vincent Atanasoff International Symposium on Modern Computing (JVA2006), 2006

Dynamic performance prediction of an adaptive mesh application.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A look at application performance sensitivity to the bandwidth and latency of InfiniBand networks.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A Performance Model of the Krak Hydrodynamics Application.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

A General Performance Model of Structured and Unstructured Mesh Particle Transport Computations.
J. Supercomput., 2005

Use of Predictive Performance Modeling during Large-scale System Installation.
Parallel Process. Lett., 2005

A Performance Model of the Parallel Ocean Program.
Int. J. High Perform. Comput. Appl., 2005

A performance comparison between the Earth Simulator and other terascale systems on a characteristic ASCI workload.
Concurr. Pract. Exp., 2005

On the Feasibility of Optical Circuit Switching for High Performance Computing Systems.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

A Performance Model and Scalability Analysis of the HYCOM Ocean Simulation Application.
Proceedings of the International Conference on Parallel and Distributed Computing Systems, 2005

Automatic Identification of Application Communication Patterns via Templates.
Proceedings of the ISCA 18th International Conference on Parallel and Distributed Computing Systems, 2005

A Performance Evaluation of an Alpha EV7 Processing Node.
Int. J. High Perform. Comput. Appl., 2004

A Performance and Scalability Analysis of the BlueGene/L Architecture.
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

Performance Modeling of Unstructered Mesh Particle Transport Computations.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

An empirical performance analysis of commodity memories in commodity servers.
Proceedings of the 2004 workshop on Memory System Performance, 2004

Modelling the performance of large-scale systems.
IEE Proc. Softw., 2003

The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8, 192 Processors of ASCI Q.
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

A Comparison between the Earth Simulator and AlphaServer Systems Using Predictive Application Performance Models.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Identification of Performance Characteristics from Multi-view Trace Analysis.
Proceedings of the Computational Science - ICCS 2003, 2003

Performance Prediction Technology for Agent-Based Resource Management in Grid Environments.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Agent-Based Resource Management for Grid Computing.
Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

Realistic image synthesis of plant structures for genetic analysis.
Image Vis. Comput., 2001

High Performance Service Discovery in Large-Scale Multi-Agent and Mobile-Agent Systems.
Int. J. Softw. Eng. Knowl. Eng., 2001

Optimisation of application execution on dynamic systems.
Future Gener. Comput. Syst., 2001

Predictive performance and scalability modeling of a large-scale application.
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

Dynamic Instrumentation and Performance Prediction of Application Execution.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

Use of Agent-Based Service Discovery for Resource Management in Metacomputing Environment.
Proceedings of the Euro-Par 2001: Parallel Processing, 2001

Performance Evaluation of an Agent-Based Resource Management Infrastructure for Grid Computing.
Proceedings of the First IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2001), 2001

Performance optimization of financial option calculations.
Parallel Comput., 2000

Pace - A Toolset for the Performance Prediction of Parallel and Distributed Systems.
Int. J. High Perform. Comput. Appl., 2000

Run-Time Optimization Using Dynamic Performance Prediction.
Proceedings of the High-Performance Computing and Networking, 8th International Conference, 2000

Use of Performance Technology for the Management of Distributed Systems.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Analytical Modeling of Set-Associative Cache Behavior.
IEEE Trans. Computers, 1999

Size invariant circle detection.
Image Vis. Comput., 1999

Efficient Analytical Modelling of Multi-Level Set-Associative Caches.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

A performance analysis environment for life.
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998

Application Execution Steering using On-the-Fly Performance Prediction.
Proceedings of the High-Performance Computing and Networking, 1998

A Layered Approach to Parallel Software Performance Prediction: A Case Study.
Proceedings of the Massively Parallel Processing Applications and Develompent, 1994

The Coherent Circle Hough Transform.
Proceedings of the British Machine Vision Conference, 1993

A multiple-SIMD architecture for image and tracking analysis.
PhD thesis, 1992

Hierarchical multiple-SIMD architecture for image analysis.
Mach. Vis. Appl., 1992

An heterogeneous M-SIMD architecture for Kalman filter controlled processing of image sequences.
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1992

Passive Estimation of Range to Objects from Image Sequences.
Proceedings of the British Machine Vision Conference, 1991

A hierarchical multiple-SIMD architecture for image analysis.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

Performance evaluation of the hierarchical Hough transform on an associative M-SIMD architecture.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

A Generalised Parallel Architecture for Image Based Algorithms.
Proceedings of the Advances in Computer Graphics Hardware IV (Eurographics'89 Workshop), 1989
