Yonggang Che

Orcid: 0000-0001-6906-4940

According to our database1, Yonggang Che authored at least 43 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Extending OP2 framework to support portable parallel programming of complex applications.
CCF Trans. High Perform. Comput., June, 2024

Evaluating performance portability of five shared-memory programming models using a high-order unstructured CFD solver.
J. Parallel Distributed Comput., May, 2024

Improving CUDA performance of an unstructured high-order CFD application under OP2 framework.
J. Supercomput., March, 2024

Towards Scalable Unstructured Mesh Computations on Shared Memory Many-Cores.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

Optimizing General Matrix Multiplications on Modern Multi-core DSPs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Optimizing Stencil Computation on Multi-core DSPs.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

2023
Multi-scale Recurrent LSTM and Transformer Network for Depth Completion.
CoRR, 2023

Developing a proxy application for an industrial unstructured CFD software: preliminary results.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

PowerDis: Fine-Grained Power Monitoring Through Power Disaggregation Model.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2023

Evaluating Performance Portability of SYCL and Kokkos: A Case Study on LBM Simulations.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023

2022
GPU Parallelization and Optimization of a Combustion Simulation Application.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

2020
Memory Access Optimization of High-Order CFD Stencil Computations on GPU.
Proceedings of the Parallel and Distributed Computing, Applications and Technologies, 2020

GPU Acceleration of a High-Order CFD Program.
Proceedings of the HP3C 2020: 4th International Conference on High Performance Compilation, 2020

Parallelization and Optimization of a Combustion Simulation Application on GPU Platform.
Proceedings of the HP3C 2020: 4th International Conference on High Performance Compilation, 2020

Load Balancing a Multi-Block Grids-based Application on Heterogeneous Platform.
Proceedings of the 23rd IEEE International Conference on Computational Science and Engineering, 2020

2019
Collaborating CPUs and MICs for Large-Scale LBM Multiphase Flow Simulations.
Proceedings of the Network and Parallel Computing, 2019

OpenMP4.5-Enabled Large-Scale Heterogeneous Lattice Boltzmann Multiphase Flow Simulations.
Proceedings of the 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2019

2018
Petascale scramjet combustion simulation on the Tianhe-2 heterogeneous supercomputer.
Parallel Comput., 2018

2017
Improved Algorithm for Reconstructing Singular Connection in Multi-Block CFD Applications.
CoRR, 2017

2016
Benchmarking the Powering Computations for Application Tuning.
Proceedings of the International Conference on Software Analysis, Testing and Evolution, 2016

2015
Realistic Performance Characterization of CFD Applications on Intel Many Integrated Core Architecture.
Comput. J., 2015

2014
Microarchitectural performance comparison of Intel Knights Corner and Intel Sandy Bridge with CFD applications.
J. Supercomput., 2014

Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer.
J. Comput. Phys., 2014

Optimization of a Parallel CFD Code and Its Performance Evaluation on Tianhe-1A.
Comput. Informatics, 2014

Test-driving Intel Xeon Phi.
Proceedings of the ACM/SPEC International Conference on Performance Engineering, 2014

Balancing CPU-GPU Collaborative High-Order CFD Simulations on the Tianhe-1A Supercomputer.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Performance Optimization of a CFD Application on Intel Multicore and Manycore Architectures.
Proceedings of the Advanced Computer Architecture - 10th Annual Conference, 2014

2013
An Empirical Study of Intel Xeon Phi.
CoRR, 2013

Parallelizing a High-Order CFD Software for 3D, Multi-block, Structural Grids on the TianHe-1A Supercomputer.
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

Performance Evaluation and Scalability Analysis of NPB-MZ on Intel Xeon Phi Coprocessor.
Proceedings of the Computer Engineering and Technology - 17th CCF Conference, 2013

2012
Simulation-based evaluation of the Imagine stream processor with scientific programs.
Int. J. High Perform. Comput. Netw., 2012

2011
PIT: A Framework for Effectively Composing High-Level Loop Transformations.
Comput. Informatics, 2011

2010
Optimizing Adaptive Synchronization in Parallel Simulators for Large-scale Parallel Systems and Applications.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

Evaluating the Performance and Accuracy Impact of Trace Generation to the BigSim Emulator.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
Combining Model and Iterative Compilation for Program Performance Optimization.
J. Softw., 2009

A Framework for Effective Memory Optimization of High Performance Computing Applications.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

MPTD: A Scalable and Flexible Performance Prediction Framework for Parallel Systems.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

2008
Analyzing the Efficiency and Bottleneck of Scientific Programs on Imagine Stream Processor by Simulation.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008

Evaluating the Data Access Efficiency of Imagine Stream Processor with Scientific Applications.
Proceedings of the 9th International Conference for Young Computer Scientists, 2008

An Effective Iterative Compilation Search Algorithm for High Performance Computing Applications.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008

2006
A Lightweight Iterative Compilation Approach for Optimization Parameter Selection.
Proceedings of the Interdisciplinary and Multidisciplinary Research in Computer Science, 2006

2004
Locality Optimizations for Jacobi Iteration on Distributed Parallel Systems.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

2003
Optimization Parameter Selection by Means of Limited Execution and Genetic Algorithms.
Proceedings of the Advanced Parallel Programming Technologies, 5th International Workshop, 2003


  Loading...