Scaling Poisson Solvers on Many Cores via MMEwald.
IEEE Trans. Parallel Distributed Syst., 2022
Accelerating all-electron <i>ab initio</i> simulation of raman spectra for biological systems.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the International Conference for High Performance Computing, 2021
Bandwidth-Aware Loop Tiling for DMA-Supported Scratchpad Memory.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020
PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion.
Proceedings of the 28th International Conference on Compiler Construction, 2019