2022
Scaling Poisson Solvers on Many Cores via MMEwald.
IEEE Trans. Parallel Distributed Syst., 2022

2021
Accelerating all-electron <i>ab initio</i> simulation of raman spectra for biological systems.
Proceedings of the International Conference for High Performance Computing, 2021

2020
Bandwidth-Aware Loop Tiling for DMA-Supported Scratchpad Memory.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019
PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion.
Proceedings of the 28th International Conference on Compiler Construction, 2019