Hongbo Rong

Proceedings of the IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10, 2022

2021

Programming and Synthesis for Software-defined FPGA Acceleration: Status and Future Prospects.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., 2021

2020

Mapping Stencils on Coarse-grained Reconfigurable Spatial Architecture.

[BibT_eX]

[DOI]

CoRR, 2020

Systolic Computing on GPUs for Productive Performance.

[BibT_eX]

[DOI]

CoRR, 2020

Building Application-Specific Overlays on FPGAs with High-Level Customizable IPs.

[BibT_eX]

[DOI]

CoRR, 2020

SuSy: A Programming Model for Productive Construction of High-Performance Systolic Arrays on FPGAs.

[BibT_eX]

[DOI]

Christopher J. Hughes

Pradeep Dubey

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

Tensaurus: A Versatile Accelerator for Mixed Sparse-Dense Tensor Computations.

[BibT_eX]

[DOI]

Nitish Kumar Srivastava

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

2019

T2S-Tensor: Productively Generating High-Performance Spatial Hardware for Dense Tensor Computations.

[BibT_eX]

[DOI]

Nitish Kumar Srivastava

Christopher J. Hughes

Timothy G. Mattson

Pradeep Dubey

Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

2018

Expressing Sparse Matrix Computations for Productive Performance on Spatial Architectures.

[BibT_eX]

[DOI]

CoRR, 2018

Productively Expressing High-performance Spatial Designs of Givens Rotation-based QR Decomposition Algorithm.

[BibT_eX]

[DOI]

CoRR, 2018

2017

Programmatic Control of a Compiler for Generating High-performance Spatial Hardware.

[BibT_eX]

[DOI]

CoRR, 2017

Mozart : Efficient Composition of Library Functions for Heterogeneous Execution.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2017

2016

Automating wavefront parallelization for sparse matrix computations.

[BibT_eX]

[DOI]

Anand Venkat

Mahdi Soltan Mohammadi

Jongsoo Park

Rajkishore Barik

Michelle Mills Strout

Mary W. Hall

Proceedings of the International Conference for High Performance Computing, 2016

Sparso: Context-driven Optimizations of Sparse Linear Algebra.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

ProductiveC: enabling high productivity in C-family languages.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

2014

Just-In-Time Software Pipelining.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2014

2013

Allocating rotating registers by scheduling.

[BibT_eX]

[DOI]

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, 2013

2012

SMARQ: Software-Managed Alias Register Queue for Dynamic Optimizations.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

2009

Tree register allocation.

[BibT_eX]

[DOI]

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

2008

[BibT_eX]

[DOI]

ACM Trans. Program. Lang. Syst., 2008

2007

Single-dimension software pipelining for multidimensional loops.

[BibT_eX]

[DOI]

Zhizhong Tang

Ramaswamy Govindarajan

ACM Trans. Archit. Code Optim., 2007

Advances in Software Pipelining.

[BibT_eX]

R. Govindarajan

Proceedings of the Compiler Design Handbook: Optimizations and Machine Code Generation, 2007

2006

Multi-dimensional Kernel Generation for Loop Nest Software Pipelining.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

2005

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN 2005 Conference on Programming Language Design and Implementation, 2005

2004

Single-Dimension Software Pipelining for Multi-Dimensional Loops.

[BibT_eX]

[DOI]

Zhizhong Tang

Ramaswamy Govindarajan

Proceedings of the 2nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2004), 2004

Code Generation for Single-Dimension Software Pipelining of Multi-Dimensional Loops.

[BibT_eX]

[DOI]

Ramaswamy Govindarajan