We stand with Ukraine

We stand with Ukraine

Hongzhang Shan

According to our database¹, Hongzhang Shan authored at least 40 papers between 1997 and 2019.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2019

Accelerating the Performance of Modal Aerosol Module of E3SM Using OpenACC.

[BibT_eX]

[DOI]

,

,

Proceedings of the Accelerator Programming Using Directives - 6th International Workshop, 2019

2018

A Novel Multi-level Integrated Roofline Model Approach for Performance Characterization.

[BibT_eX]

[DOI]

,

,

,

Adetokunbo Adedoyin

,

,

Philippe Thierry

,

,

Rahulkumar Gayatri

,

,

,

,

,

Samuel Williams

Proceedings of the High Performance Computing - 33rd International Conference, 2018

Improving MPI Reduction Performance for Manycore Architectures with OpenMP and Data Compression.

[BibT_eX]

[DOI]

,

Samuel Williams

,

Calvin W. Johnson

Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

2017

Performance analysis and optimization of the RAMPAGE metal alloy potential generation software.

[BibT_eX]

[DOI]

,

,

,

Nikolas Antolin

,

Sarat Sreepathi

,

,

Samuel Williams

,

,

Proceedings of the 4th ACM SIGPLAN International Workshop on Software Engineering for Parallel Systems, 2017

A Locality-Based Threading Algorithm for the Configuration-Interaction Method.

[BibT_eX]

[DOI]

,

Samuel Williams

,

Calvin W. Johnson

,

Kenneth S. McElvain

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016

Experiences of Applying One-Sided Communication to Nearest-Neighbor Communication.

[BibT_eX]

[DOI]

,

Samuel Williams

,

,

,

,

Stéphane Ethier

,

Proceedings of the 2016 PGAS Applications Workshop, 2016

MPI usage at NERSC: Present and Future.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

2015

Parallel implementation and performance optimization of the configuration-interaction method.

[BibT_eX]

[DOI]

,

Samuel Williams

,

Calvin W. Johnson

,

Kenneth S. McElvain

,

W. Erich Ormand

Proceedings of the International Conference for High Performance Computing, 2015

Thread-level parallelization and optimization of NWChem for the Intel MIC architecture.

[BibT_eX]

[DOI]

,

Samuel Williams

,

,

Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

2014

Evaluation of PGAS Communication Paradigms with Geometric Multigrid.

[BibT_eX]

[DOI]

,

,

Samuel Williams

,

,

Katherine A. Yelick

Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

UPC++: A PGAS Extension for C++.

[BibT_eX]

[DOI]

,

,

Michael B. Driscoll

,

,

Katherine A. Yelick

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013

Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms.

[BibT_eX]

[DOI]

,

,

,

,

Nicholas J. Wright

,

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

2012

A preliminary evaluation of the hardware acceleration of the Cray Gemini interconnect for PGAS languages and comparison with MPI.

[BibT_eX]

[DOI]

,

Nicholas J. Wright

,

,

Katherine A. Yelick

,

,

Nathan Wichmann

SIGMETRICS Perform. Evaluation Rev., 2012

Optimizing the Advanced Accelerator Simulation Framework Synergia Using OpenMP.

[BibT_eX]

[DOI]

,

Erich Strohmaier

,

James F. Amundson

,

Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

2010

A programming model performance study using the NAS parallel benchmarks.

[BibT_eX]

[DOI]

,

Filip Blagojevic

,

,

,

,

Karl Fürlinger

,

,

Nicholas J. Wright

Sci. Program., 2010

Developing a Parameterized Performance Proxy for Sequential Scientific Kernels.

[BibT_eX]

[DOI]

,

Erich Strohmaier

Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010

2009

HPC global file system performance analysis using a scientific-application derived benchmark.

[BibT_eX]

[DOI]

,

,

,

,

Parallel Comput., 2009

2008

Performance Analysis of Leading HPC Architectures With Beambeam3D.

[BibT_eX]

[DOI]

,

Erich Strohmaier

,

Int. J. High Perform. Comput. Appl., 2008

Linearly scaling 3D fragment method for large-scale electronic structure calculations.

[BibT_eX]

[DOI]

,

,

,

,

,

Erich Strohmaier

,

David H. Bailey

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Characterizing and predicting the I/O performance of HPC applications using a parameterized synthetic benchmark.

[BibT_eX]

[DOI]

,

,

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

2007

APEX-Map: a parameterized scalable memory access probe for high-performance computing systems.

[BibT_eX]

[DOI]

Erich Strohmaier

,

Concurr. Comput. Pract. Exp., 2007

Investigation of leading HPC I/O performance using a scientific-application derived benchmark.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

Scientific Application Performance on Candidate PetaScale Platforms.

[BibT_eX]

[DOI]

,

,

Jonathan Carter

,

,

Michael Lijewski

,

,

,

,

Erich Strohmaier

,

Stéphane Ethier

,

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006

Particles and contiuum - Performance modeling and optimization of a high energy colliding beam simulation code.

[BibT_eX]

[DOI]

,

Erich Strohmaier

,

,

David H. Bailey

,

Katherine A. Yelick

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Performance Analysis of a High Energy Colliding Beam Simulation Code on Four HPC Architectures.

[BibT_eX]

[DOI]

,

,

Erich Strohmaier

,

Katherine A. Yelick

Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

2005

Apex-Map: A Global Data Access Benchmark to Analyze HPC Systems and Parallel Programming Paradigms.

[BibT_eX]

[DOI]

Erich Strohmaier

,

Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Apex-Map: A Synthetic Scalable Benchmark Probe to Explore Data Access Performance on Highly Parallel Systems.

[BibT_eX]

[DOI]

Erich Strohmaier

,

Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004

A Performance Evaluation of the Cray X1 for Scientific Applications.

[BibT_eX]

[DOI]

,

,

,

,

Jonathan Carter

,

M. Jahed Djomehri

,

,

Proceedings of the High Performance Computing for Computational Science, 2004

Architecture Independent Performance Characterization and Benchmarking for Scientific Applications.

[BibT_eX]

[DOI]

Erich Strohmaier

,

Proceedings of the 12th International Workshop on Modeling, 2004

Performance characteristics of the Cray X1 and their implications for application performance tuning.

[BibT_eX]

[DOI]

,

Erich Strohmaier

Proceedings of the 18th Annual International Conference on Supercomputing, 2004

2003

Message passing and shared address space parallelism on an SMP cluster.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

,

,

Parallel Comput., 2003

Job Superscheduler Architecture and Performance in Computational Grid Environments.

[BibT_eX]

[DOI]

,

,

Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

2002

A Comparison of Three Programming Models for Adaptive Applications on the Origin2000.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

,

,

J. Parallel Distributed Comput., 2002

2001

A Comparison of MPI, SHMEM and Cache-Coherent Shared Address Space Programming Models on a Tightly-Coupled Multiprocessors.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

Int. J. Parallel Program., 2001

Design Strategies for Irregularly Adapting Parallel Applications.

[BibT_eX]

,

,

,

Jaswinder Pal Singh

Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001

Message Passing Vs. Shared Address Space on a Clusters of SMPs.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

,

,

Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

1999

Parallel Sorting on Cache-coherent DSM Multiprocessors.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

Proceedings of the ACM/IEEE Conference on Supercomputing, 1999

A comparison of MPI, SHMEM and cache-coherent shared address space programming models on the SGI Origin2000.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

Proceedings of the 13th international conference on Supercomputing, 1999

1998

Parallel Tree Building on a Range of Shared Address Space Multiprocessors: Algorithms and Application Performance.

[BibT_eX]

[DOI]

,

Jaswinder Pal Singh

Proceedings of the 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30, 1998

1997

Application Restructuring and Performance Portability on Shared Virtual Memory and Hardware-Coherent Multiprocessors.

[BibT_eX]

[DOI]

,

,

Jaswinder Pal Singh

Proceedings of the Sixth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1997

Loading...