Lossy Compressed Collective Inter-FPGA Communications.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2025
Direct Quantized Training of Language Models with Stochastic Rounding.
CoRR, 2024
Estimation of Shor's Circuit for 2048-bit Integers based on Quantum Simulator.
IACR Cryptol. ePrint Arch., 2023
Optimizing the number of scan layers in multi-layered scanning methods for detecting void sizes in concrete building tiles using electromagnetic waves.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023
Big Data Assimilation: Real-time 30-second-refresh Heavy Rain Forecast Using Fugaku During Tokyo Olympics and Paralympics.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the International Conference for High Performance Computing, 2023
mpiQulacs: A Scalable Distributed Quantum Computer Simulator for ARM-based Clusters.
Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023
Experiments and Resource Analysis of Shor's Factorization Using a Quantum Simulator.
Proceedings of the Information Security and Cryptology - ICISC 2023, 2023
Accelerating Hybrid DFT Simulations Using Performance Modeling on Supercomputers.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023
A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU.
IEICE Trans. Electron., 2022
mpiQulacs: A Distributed Quantum Computer Simulator for A64FX-based Cluster Systems.
CoRR, 2022
MLPerf™ HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021
Efficient and Large Scale Pre-training Techniques for Japanese Natural Language Processing.
Proceedings of the Ninth International Symposium on Computing and Networking, 2021
The 16, 384-node Parallelism of 3D-CNN Training on An Arm CPU based Supercomputer.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021
An Efficient Technique for Large Mini-batch Challenge of DNNs Training on Large Scale Cluster.
Proceedings of the HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, 2020
Visualization of Voids Between Tile and Concrete by Multi-Layered Scanning Method with Electromagnetic Waves.
J. Robotics Mechatronics, 2019
Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds.
CoRR, 2019
An Optimal Parallel Algorithm for Computing the Summed Area Table on the GPU.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018
An Efficient GPU Implementation of Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices.
Int. J. Netw. Comput., 2017
GPU-accelerated Exhaustive Verification of the Collatz Conjecture.
Int. J. Netw. Comput., 2017
Accelerating digital halftoning using the local exhaustive search on the GPU.
Concurr. Comput. Pract. Exp., 2017
Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations.
Proceedings of the 46th International Conference on Parallel Processing, 2017
GPU-Accelerated Bulk Execution of Multiple-Length Multiplication with Warp-Synchronous Programming Technique.
IEICE Trans. Inf. Syst., 2016
GPU-Accelerated Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices.
Proceedings of the Fourth International Symposium on Computing and Networking, 2016
A Warp-Synchronous Implementation for Multiple-Length Multiplication on the GPU.
Proceedings of the Third International Symposium on Computing and Networking, 2015
GPU-Accelerated Verification of the Collatz Conjecture.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2014
Digital Railway Museum: An Approach to Introduction of Digital Exhibition Systems at the Railway Museum.
Proceedings of the Human Interface and the Management of Information. Information and Interaction for Learning, Culture, Collaboration and Business,, 2013