Takahiro Katagiri

Orcid: 0000-0001-7193-9304

According to our database1, Takahiro Katagiri authored at least 63 papers between 1999 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ppOpen-AT: A Directive-base Auto-tuning Language.
CoRR, 2024

RAO-SS: A Prototype of Run-time Auto-tuning Facility for Sparse Direct Solvers.
CoRR, 2024

An Auto-tuning Method for Run-time Data Transformation for Sparse Matrix-Vector Multiplication.
CoRR, 2024

Adaptation of XAI to Auto-tuning for Numerical Libraries.
CoRR, 2024

Xabclib:A Fully Auto-tuned Sparse Iterative Solver.
CoRR, 2024

A Communication Avoiding and Reducing Algorithm for Symmetric Eigenproblem for Very Small Matrices.
CoRR, 2024

Performance Evaluation of CMOS Annealing with Support Vector Machine.
CoRR, 2024

Implementing Fast Modal Filtering of SCALE-DG.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

2023
Autotuning by Changing Directives and Number of Threads in OpenMP using ppOpen-AT.
CoRR, 2023

Implementation of Radio Wave Propagation using RT Cores and Consideration of Programming Models.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Parallelization of Automatic Tuning for Hyperparameter Optimization of Pedestrian Route Prediction Applications using Machine Learning.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2023

Auto-tuning Mixed-precision Computation by Specifying Multiple Regions.
Proceedings of the Eleventh International Symposium on Computing and Networking, CANDAR 2023, Matsue, Japan, November 28, 2023

2022
Autotuning Power Consumption and Computation Accuracy using ppOpen-AT.
Proceedings of the 15th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2022

2021
Parallelization of GKV benchmark using OpenACC.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020
Performance Evaluation of Accurate Matrix-Matrix Multiplication on GPU Using Sparse Matrix Multiplications.
Proceedings of the Eighth International Symposium on Computing and Networking Workshops, 2020

2019
Performance Evaluation of the MODYLAS Application on Modern Multi-core and Many-Core Environments.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Performance Improvement of High-Speed File Transfer Over JHPCN.
Proceedings of the 2019 IEEE Intl Conf on Dependable, 2019

Application of Techniques for High-Performance Computing.
Proceedings of the Art of High Performance Computing for Computational Science, 2019

Hybrid Parallelization Techniques.
Proceedings of the Art of High Performance Computing for Computational Science, 2019

Basics of OpenMP Programming.
Proceedings of the Art of High Performance Computing for Computational Science, 2019

Basics of MPI Programming.
Proceedings of the Art of High Performance Computing for Computational Science, 2019

High-Performance Computing Basics.
Proceedings of the Art of High Performance Computing for Computational Science, 2019

2018
A thread-level parallelization of pairwise additive potential and force calculations suitable for current many-core architectures.
J. Supercomput., 2018

Japanese Autotuning Research: Autotuning Languages and FFT.
Proc. IEEE, 2018

Auto-Tuning for the Era of Relatively High Bandwidth Memory Architectures: A Discussion Based on an FDM Application.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Threaded Accurate Matrix-Matrix Multiplications with Sparse Matrix-Vector Multiplications.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Preconditioner Auto-Tuning Using Deep Learning for Sparse Iterative Algorithms.
Proceedings of the Sixth International Symposium on Computing and Networking, 2018

Optimizing Forward Computation in Adjoint Method via Multi-level Blocking.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

2017
D-Spline Performance Tuning Method Flexibly Responsive to Execution Time Perturbation.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

Auto-Tuning on NUMA and Many-Core Environments with an FDM Code.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016
Utilization and Expansion of ppOpen-AT for OpenACC.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Auto-Tuning of Hybrid MPI/OpenMP Execution with Code Selection by ppOpen-AT.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

From FLOPS to BYTES: disruptive change in high-performance computing towards the post-moore era.
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015
Enhancement of Incremental Performance Parameter Estimation on ppOpen-AT.
Proceedings of the IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2015

Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

2014
Implementation of d-Spline-based incremental performance parameter estimation method with ppOpen-AT.
Sci. Program., 2014

Performance Optimization of SpMV Using CRS Format by Considering OpenMP Scheduling on CPUs and MIC.
Proceedings of the IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs, 2014

Auto-tuning of Computation Kernels from an FDM Code with ppOpen-AT.
Proceedings of the IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs, 2014

Implementation and Evaluation of an AMR Framework for FDM Applications.
Proceedings of the International Conference on Computational Science, 2014

2013
A Mathematical Method for Online Autotuning of Power and Energy Consumption with Corrected Temperature Effects.
Proceedings of the International Conference on Computational Science, 2013

A Sparse Matrix Library with Automatic Selection of Iterative Solvers and Preconditioners.
Proceedings of the International Conference on Computational Science, 2013

2012
Implementation and Evaluation of 3D Finite Element Method Application for CUDA.
Proceedings of the High Performance Computing for Computational Science, 2012

Control Formats for Unsymmetric and Symmetric Sparse Matrix-Vector Multiplications on OpenMP Implementations.
Proceedings of the High Performance Computing for Computational Science, 2012

A Smart Tuning Strategy for Restart Frequency of GMRES(<i>m</i>) with Hierarchical Cache Sizes.
Proceedings of the High Performance Computing for Computational Science, 2012

SSG-AT: An Auto-tuning Method of Sparse Matrix-vector Multiplicataion for Semi-structured Grids - An Adaptation to OpenFOAM.
Proceedings of the IEEE 6th International Symposium on Embedded Multicore/Manycore SoCs, 2012

2011
The Sixth International Workshop on Automatic Performance Tuning (iWAPT2011).
Proceedings of the International Conference on Computational Science, 2011

2010
A Massively Parallel Dense Symmetric Eigensolver with Communication Splitting Multicasting Algorithm.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

ABCLibScript: A Computer Language for Automatic Performance Tuning.
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010

2006
ABCLib_DRSSED: A parallel eigensolver with an auto-tuning facility.
Parallel Comput., 2006

ABCLibScript: a directive to support specification of an auto-tuning facility for numerical software.
Parallel Comput., 2006

Parallel Processing of Matrix Multiplication in a CPU and GPU Heterogeneous Environment.
Proceedings of the High Performance Computing for Computational Science, 2006

<i>d-Spline</i> Based Incremental Parameter Estimation in Automatic Performance Tuning.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Automatic Performance Tuning for the Multi-section with Multiple Eigenvalues Method for Symmetric Tridiagonal Eigenproblems.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

2005
A time-to-live based reservation algorithm on fully decentralized resource discovery in Grid computing.
Parallel Comput., 2005

Evaluation of the Acknowledgment Reduction in a Software-DSM System.
Proceedings of the Parallel Processing and Applied Mathematics, 2005

2004
The SimCore/Alpha Functional Simulator.
Proceedings of the 2004 workshop on Computer architecture education, 2004

Effect of auto-tuning with user's knowledge for numerical software.
Proceedings of the First Conference on Computing Frontiers, 2004

2003
FIBER: A Generalized Framework for Auto-tuning Software.
Proceedings of the High Performance Computing, 5th International Symposium, 2003

2002
Performance Evaluation of Parallel Gram-Schmidt Re-orthogonalization Methods.
Proceedings of the High Performance Computing for Computational Science, 2002

Knowledge Discovery in Auto-tuning Parallel Numerical Library.
Proceedings of the Progress in Discovery Science, 2002

2001
An efficient implementation of parallel eigenvalue computation for massively parallel processing.
Parallel Comput., 2001

1999
A Parallel Implementation of Eigensolver and Its Performance.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999


  Loading...