Rengan Xu

Pei Yang

Proceedings of the High Performance Computing - 34th International Conference, 2019

2018

The OpenACC data model: Preliminary study on its major challenges and implementations.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Deep Learning at Scale on NVIDIA V100 Accelerators.

[BibT_eX]

[DOI]

Frank Han

Quy Ta

Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

2017

Implementing the OpenACC Data Model.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016

Compiler transformation of nested loops for general purpose GPUs.

[BibT_eX]

[DOI]

Yonghong Yan

Deepak Eachempati

Concurr. Comput. Pract. Exp., 2016

An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 31st International Conference, 2016

Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations.

[BibT_eX]

[DOI]

Proceedings of the 45th International Conference on Parallel Processing, 2016

2015

Multi-GPU Support on Single Node Using Directive-Based Programming Model.

[BibT_eX]

[DOI]

Sci. Program., 2015

2014

Accelerating Kirchhoff migration on GPU using directives.

[BibT_eX]

[DOI]

Maxime R. Hugues

Henri Calandra

Proceedings of the First Workshop on Accelerator Programming using Directives, 2014

SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Reduction Operations in Parallel Loops for GPGPUs.

[BibT_eX]

[DOI]

Yonghong Yan

Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014

NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model.

[BibT_eX]

[DOI]

Yonghong Yan

Proceedings of the Languages and Compilers for Parallel Computing, 2014

A Validation Testsuite for OpenACC 1.0.

[BibT_eX]

[DOI]

Cheng Wang

Oscar R. Hernandez

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013

Compiling a High-Level Directive-Based Programming Model for GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2013

Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Filesystem Aware Scalable I/O Framework for Data-Intensive Parallel Applications.

[BibT_eX]

[DOI]

Mauricio Araya-Polo