Yusaku Yamamoto

Orcid: 0000-0001-5682-3434

According to our database1, Yusaku Yamamoto authored at least 56 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A fast and efficient computation method for reflective diffraction simulations.
Comput. Phys. Commun., 2024

Automatic performance tuning using the ATMathCoreLib tool: Two experimental studies related to dense symmetric eigensolvers.
Concurr. Comput. Pract. Exp., 2024

A Cholesky QR type algorithm for computing tall-skinny QR factorization with column pivoting.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Approximate Block Diagonalization of Symmetric Matrices Using Quantum Annealing.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2024

2023
Roundoff error analysis of the double exponential formula-based method for the matrix sign function.
CoRR, 2023

A fast and accurate computation method for reflective diffraction simulations.
CoRR, 2023

2022
Convergence to Singular Triplets in the Two-Sided Block-Jacobi SVD Algorithm with Dynamic Ordering.
SIAM J. Matrix Anal. Appl., September, 2022

Error analysis of the truncated Taylor series expansion method for computing matrix exponential.
JSIAM Lett., 2022

Performance prediction of massively parallel computation by Bayesian inference.
JSIAM Lett., 2022

Automatic Code Selection for the Dense Symmetric Generalized Eigenvalue Problem Using ATMathCoreLib.
Proceedings of the Parallel Processing and Applied Mathematics, 2022

2021
Block red-black MILU(0) preconditioner with relaxation on GPU.
Parallel Comput., 2021

Combinatorial preconditioning for accelerating the convergence of the parallel block Jacobi method for the symmetric eigenvalue problem.
JSIAM Lett., 2021

2020
Shifted Cholesky QR for Computing the QR Factorization of Ill-Conditioned Matrices.
SIAM J. Sci. Comput., 2020

Fixed-point analysis of Ogita-Aishima's symmetric eigendecomposition refinement algorithm for multiple eigenvalues.
JSIAM Lett., 2020

A Parallelizable Energy-Preserving Integrator MB4 and Its Application to Quantum-Mechanical Wavepacket Dynamics.
CoRR, 2020

Error Analysis of the Cholesky QR-Based Block Orthogonalization Process for the One-Sided Block Jacobi SVD Algorithm.
Comput. Informatics, 2020

2019
Asymptotic Quadratic Convergence of the Two-Sided Serial and Parallel Block-Jacobi SVD Algorithm.
SIAM J. Matrix Anal. Appl., 2019

On using the shifted minimal residual method for quantum-mechanical wave packet simulation.
JSIAM Lett., 2019

High-Performance Algorithms for Numerical Linear Algebra.
Proceedings of the Art of High Performance Computing for Computational Science, 2019

2018
Performance of the parallel block Jacobi method with dynamic ordering for the symmetric eigenvalue problem.
JSIAM Lett., 2018

A Case Study on Modeling the Performance of Dense Matrix Computation: Tridiagonalization in the EigenExa Eigensolver on the K Computer.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

2017
Asymptotic quadratic convergence of the serial block-Jacobi EVD algorithm for Hermitian matrices.
Numerische Mathematik, 2017

Probabilistic analysis of an estimator for the Frobenius norm of a matrix product.
JSIAM Lett., 2017

On the optimality and sharpness of Laguerre's lower bound on the smallest eigenvalue of a symmetric positive definite matrix.
CoRR, 2017

Performance analysis and optimization of the parallel one-sided block Jacobi SVD algorithm with dynamic ordering and variable blocking.
Concurr. Comput. Pract. Exp., 2017

On Using the Cholesky QR Method in the Full-Blocked One-Sided Jacobi Algorithm.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

2016
Roundoff error analysis of the CholeskyQR2 algorithm in an oblique inner product.
JSIAM Lett., 2016

On Constructing Cost Models for Online Automatic Tuning Using ATMathCoreLib: Case Studies through the SVD Computation on a Multicore Processor.
Proceedings of the 10th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2016

2015
A new subtraction-free formula for lower bounds of the minimal singular value of an upper bidiagonal matrix.
Numer. Algorithms, 2015

Implementation details of an extended oqds algorithm for singular values.
JSIAM Lett., 2015

Performance of the Parallel One-Sided Block Jacobi SVD Algorithm on a Modern Distributed-Memory Parallel Computer.
Proceedings of the Parallel Processing and Applied Mathematics, 2015

iWAPT Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

2014
Convergence analysis of the parallel classical block Jacobi method for the symmetric eigenvalue problem.
JSIAM Lett., 2014

Performance Analysis of the Householder-Type Parallel Tall-Skinny QR Factorizations Toward Automatic Algorithm Selection.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014

CholeskyQR2: a simple and communication-avoiding algorithm for computing a tall-skinny QR factorization on a large-scale parallel system.
Proceedings of the 5th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2014

2012
Error analysis for matrix eigenvalue algorithm based on the discrete hungry Toda equation.
Numer. Algorithms, 2012

2011
A parallel algorithm for incremental orthogonalization based on the compact WY representation.
JSIAM Lett., 2011

Cache optimization of a non-orthogonal joint diagonalization method.
JSIAM Lett., 2011

Acceleration of Hessenberg Reduction for Nonsymmetric Eigenvalue Problems in a Hybrid CPU-GPU Computing Environment.
Int. J. Netw. Comput., 2011

2010
Differential qd algorithm for totally nonnegative Hessenberg matrices: introduction of origin shifts and relationship with the discrete hungry Lotka-Volterra system.
JSIAM Lett., 2010

Performance Modeling of Multishift QR Algorithms for the Parallel Solution of Symmetric Tridiagonal Eigenvalue Problems.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2010

Acceleration of Hessenberg Reduction for Nonsymmetric Eigenvalue Problems Using GPU.
Proceedings of the First International Conference on Networking and Computing, 2010

Dynamic Programming Approaches to Optimizing the Blocking Strategy for Basic Matrix Decompositions.
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010

2009
Differential qd algorithm for totally nonnegative band matrices: convergence properties and error analysis.
JSIAM Lett., 2009

A Fully Pipelined Multishift QR Algorithm for Parallel Solution of Symmetric Tridiagonal Eigenproblems.
Inf. Media Technol., 2009

2008
Equality conditions for lower bounds on the smallest singular value of a bidiagonal matrix.
Appl. Math. Comput., 2008

A dynamic programming approach to optimizing the blocking strategy for the Householder QR decomposition.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

A large-grained parallel algorithm for nonlinear eigenvalue problems and its implementation using OmniRPC.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

2007
Accelerating the Singular Value Decomposition of Rectangular Matrices with the CSX600 and the Integrable SVD.
Proceedings of the Parallel Computing Technologies, 2007

2006
Performance Modeling and Optimal Block Size Selection for the Small-Bulge Multishift QR Algorithm.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

Efficient parallel implementation of a weather derivatives pricing algorithm based on the fast Gauss transform.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005
An Efficient and Easily Parallelizable Algorithm for Pricing Weather Derivatives.
Proceedings of the Large-Scale Scientific Computing, 5th International Conference, 2005

2003
Application of the Fast Gauss Transform to Option Pricing.
Manag. Sci., 2003

A Vector-Parallel FFT with a User-Specificable Data Distribution Scheme.
Proceedings of the Parallel and Distributed Processing and Applications, 2003

2000
A Multi-color Inverse Iteration for a High Performance Real Symmetric Eigensolver (Research Note).
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999
A New M-Sequence Based Parallel Random Number Generator with Reduced Correlation.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999


  Loading...