Xingfu Wu

Orcid: 0000-0001-8150-5171

Affiliations:
  • Texas A&M University, College Station, Texas, USA


According to our database1, Xingfu Wu authored at least 58 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Integrating ytopt and libEnsemble to Autotune OpenMC.
CoRR, 2024

2023
ytopt: Autotuning Scientific Applications for Energy Efficiency at Large Scales.
CoRR, 2023

Performance and power modeling and prediction using MuMMI and 10 machine learning methods.
Concurr. Comput. Pract. Exp., 2023

Utilizing ensemble learning for performance and power modeling and improvement of parallel cancer deep learning CANDLE benchmarks.
Concurr. Comput. Pract. Exp., 2023

Autotuning Apache TVM-based Scientific Applications Using Bayesian Optimization.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Transfer-learning-based Autotuning using Gaussian Copula.
Proceedings of the 37th International Conference on Supercomputing, 2023

2022
Autotuning PolyBench benchmarks with LLVM Clang/Polly loop optimization pragmas using Bayesian optimization.
Concurr. Comput. Pract. Exp., 2022

Performance Debugging and Tuning of Flash-X with Data Analysis Tools.
Proceedings of the IEEE/ACM Workshop on Programming and Performance Visualization Tools, 2022

2021
Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version).
CoRR, 2021

Customized Monte Carlo Tree Search for LLVM/Polly's Composable Loop Optimization Transformations.
Proceedings of the 2021 International Workshop on Performance Modeling, 2021

Performance and Energy Improvement of ECP Proxy App SW4lite under Various Workloads.
Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing, 2021

A Dynamic Power Capping Library for HPC Applications.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Performance and Power Modeling and Prediction Using MuMMI and Ten Machine Learning Methods.
CoRR, 2020

Autotuning Search Space for Loop Transformations.
CoRR, 2020

Toward an End-to-End Auto-tuning Framework in HPC PowerStack.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
Performance, Energy, and Scalability Analysis and Improvement of Parallel Cancer Deep Learning CANDLE Benchmarks.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2017
Performance and Power Characteristics and Optimizations of Hybrid MPI/OpenMP LULESH Miniapps under Various Workloads.
Proceedings of the 5th International Workshop on Energy Efficient Supercomputing, 2017

An Energy Efficient Demand-Response Model for High Performance Computing Systems.
Proceedings of the 25th IEEE International Symposium on Modeling, 2017

2016
Using Performance-Power Modeling to Improve Energy Efficiency of HPC Applications.
Computer, 2016

Utilizing Hardware Performance Counters to Model and Optimize the Energy and Performance of Large Scale Scientific Applications on Power-Aware Supercomputers.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015
Power and performance characteristics of CORAL Scalable Science Benchmarks on BlueGene/Q Mira.
Proceedings of the Sixth International Green and Sustainable Computing Conference, 2015

2014
E-AMOM: an energy-aware modeling and optimization methodology for scientific applications.
Comput. Sci. Res. Dev., 2014

Parallel Optical Flow Processing of 4D Cardiac CT Data on Multicore Clusters.
Proceedings of the 17th IEEE International Conference on Computational Science and Engineering, 2014

SKOPE: a framework for modeling and exploring workload behavior.
Proceedings of the Computing Frontiers Conference, CF'14, 2014

2013
Performance modeling of hybrid MPI/OpenMP scientific applications on large-scale multicore supercomputers.
J. Comput. Syst. Sci., 2013

Performance Characteristics of Hybrid MPI/OpenMP Scientific Applications on a Largescale Multithreaded BlueGene/Q Supercomputer.
Int. J. Networked Distributed Comput., 2013

MuMMI: multiple metrics modeling infrastructure for exploring performance and power modeling.
Proceedings of the Extreme Science and Engineering Discovery Environment: Gateway to Discovery, 2013

Performance Characteristics of Hybrid MPI/OpenMP Scientific Applications on a Large-Scale Multithreaded BlueGene/Q Supercomputer.
Proceedings of the 14th ACIS International Conference on Software Engineering, 2013

MuMMI: Multiple Metrics Modeling Infrastructure.
Proceedings of the 14th ACIS International Conference on Software Engineering, 2013

MuMMI: Multiple Metrics Modeling Infrastructure.
Proceedings of the Tools for High Performance Computing 2013, 2013

2012
Power-aware predictive models of hybrid (MPI/OpenMP) scientific applications on multicore systems.
Comput. Sci. Res. Dev., 2012

Performance Characteristics of Hybrid MPI/OpenMP Implementations of NAS Parallel Benchmarks SP and BT on Large-Scale Multicore Clusters.
Comput. J., 2012

SWAPP: A Framework for Performance Projections of HPC Applications Using Benchmarks.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Performance characteristics of hybrid MPI/OpenMP implementations of NAS parallel benchmarks SP and BT on large-scale multicore supercomputers.
SIGMETRICS Perform. Evaluation Rev., 2011

Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems.
Int. J. High Perform. Comput. Appl., 2011

Performance Modeling of Hybrid MPI/OpenMP Scientific Applications on Large-scale Multicore Cluster Systems.
Proceedings of the 14th IEEE International Conference on Computational Science and Engineering, 2011

2009
Performance Analysis and Optimization of Parallel Scientific Applications on CMP Clusters.
Scalable Comput. Pract. Exp., 2009

Performance projection of HPC applications using SPEC CFP2006 benchmarks.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

An OpenMP Approach to Modeling Dynamic Earthquake Rupture Along Geometrically Complex Faults on CMP Systems.
Proceedings of the ICPPW 2009, 2009

2008
Performance Analysis and Optimization of Parallel Scientific Applications on CMP Cluster Systems.
Proceedings of the 37th International Conference on Parallel Processing, 2008

Performance Analysis of Parallel Visualization Applications and Scientific Applications on an Optical Grid.
Proceedings of the International Conference on Cyberworlds 2008, 2008

2006
Performance Analysis, Modeling and Prediction of a Parallel Multiblock Lattice Boltzmann Application Using Prophesy System.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

Approaches to Architecture-Aware Parallel Scientific Computation.
Proceedings of the Parallel Processing for Scientific Computing, 2006

2005
Performance Prediction-based versus Load-based Site Selection: Quantifying the Difference.
Proceedings of the ISCA 18th International Conference on Parallel and Distributed Computing Systems, 2005

2004
Guest Editors' Introduction.
IEEE Distributed Syst. Online, 2004

Isocoupling: Reusing Kernel Coupling Values to Predict the Performance of Parallel Applications.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

2003
Prophesy: an infrastructure for performance analysis and modeling of parallel and grid applications.
SIGMETRICS Perform. Evaluation Rev., 2003

Using Kernel Coupling to Improve the Performance of Multithreaded Applications.
Proceedings of the ISCA 16th International Conference on Parallel and Distributed Computing Systems, 2003

2002
Design and Development of a Scalable Distributed Debugger for Cluster Computing.
Clust. Comput., 2002

Using Kernel Couplings to Predict Parallel Application Performance.
Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing (HPDC-11 2002), 2002

2001
Design and Development of the Prophesy Performance Database for Distributed Scientific Applications.
Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001

Prophesy: Automating the Modeling Process.
Proceedings of the 3rd Annual International Workshop on Active Middleware Services (AMS 2001), 2001

2000
PDRS: A Performance Data Representation System.
Proceedings of the Parallel and Distributed Processing, 2000

Prophesy: An Infrastructure for Analyzing and Modeling the Performance of Parallel and Distributed Applications.
Proceedings of the Ninth IEEE International Symposium on High Performance Distributed Computing, 2000

1999
A Java-based Distributed Debbuger Supporting MPI and PVM.
Scalable Comput. Pract. Exp., 1999

1998
Performance models for scalable cluster computing.
J. Syst. Archit., 1998

1997
An Approach to Scalability of Parallel Matrix Multiplication Algorithms.
Proceedings of the Computing and Combinatorics, Third Annual International Conference, 1997

1996
Scalability of Parallel Algorithm Implementation.
Proceedings of the 1996 International Symposium on Parallel Architectures, 1996


  Loading...