Keigo Nitadori

Orcid: 0000-0001-7374-4236

According to our database1, Keigo Nitadori authored at least 23 papers between 2009 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
102 PFLOPS lattice QCD quark solver on Fugaku.
Comput. Phys. Commun., 2023

Wilson matrix kernel for lattice QCD on A64FX architecture.
Proceedings of the HPC Asia 2023 Workshops, 2023

2020
Implementation and performance of Barnes-hut n-body algorithm on extreme-scale heterogeneous many-core architectures.
Int. J. High Perform. Comput. Appl., 2020

Implementation and Numerical Techniques for One EFlop/s HPL-AI Benchmark on Fugaku.
Proceedings of the 11th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2020

Prompt Report on Exa-Scale HPL-AI Benchmark.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2018
Fortran interface layer of the framework for developing particle simulator FDPS.
CoRR, 2018

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems.
Proceedings of the 4th International Workshop on Extreme Scale Programming Models and Middleware, 2018

Global Simulation of Planetary Rings on Sunway TaihuLight.
Proceedings of the Computational Science - ICCS 2018, 2018

2016
Implementation and Evaluation of Data-Compression Algorithms for Irregular-Grid Iterative Methods on the PEZY-SC Processor.
Proceedings of the 6th Workshop on Irregular Applications: Architecture and Algorithms, 2016

Simulations of below-ground dynamics of fungi: 1.184 pflops attained by automated generation and autotuning of temporal blocking codes.
Proceedings of the International Conference for High Performance Computing, 2016

Automatic generation of efficient codes from mathematical descriptions of stencil computation.
Proceedings of the 5th International Workshop on Functional High-Performance Computing, 2016

2015
FDPS: a novel framework for developing high-performance particle simulation codes for distributed-memory systems.
Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2015

2014
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs.
Proceedings of the International Conference for High Performance Computing, 2014

2013
Up to 700k GPU Cores, Kepler, and the Exascale Future for Simulations of Star Clusters Around Black Holes.
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

2012
4.45 Pflops astrophysical <i>N</i>-body simulation on K computer: the gravitational trillion-body problem.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

2011
Astrophysical particle simulations with large custom GPU clusters on three continents.
Comput. Sci. Res. Dev., 2011

2010
Simulating the universe on an intercontinental grid of supercomputers
CoRR, 2010

Simulating the Universe on an Intercontinental Grid.
Computer, 2010

190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs.
Proceedings of the Conference on High Performance Computing Networking, 2010

Astrophysical Particle Simulations with Custom GPU Clusters.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
A novel multiple-walk parallel algorithm for the Barnes-Hut treecode on GPUs - towards cost effective, high performance N-body simulation.
Comput. Sci. Res. Dev., 2009

42 TFlops hierarchical <i>N</i>-body simulations on GPUs with applications in both astrophysics and turbulence.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

A Comparative Study on ASIC, FPGAs, GPUs and General Purpose Processors in the O(N^2) Gravitational N-body Simulation.
Proceedings of the NASA/ESA Conference on Adaptive Hardware and Systems, 2009


  Loading...