Keita Teranishi

Orcid: 0000-0001-6647-2690

According to our database1, Keita Teranishi authored at least 58 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Large language model evaluation for high-performance computing software development.
Concurr. Comput. Pract. Exp., November, 2024

MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky).
Proceedings of the Asynchronous Many-Task Systems and Applications, 2024

Asynchrony and Failure Masking via Pseudo-Local Process Recovery in MPI Applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2024

CHARM-SYCL & IRIS: A Tool Chain for Performance Portability on Extremely Heterogeneous Systems.
Proceedings of the 20th IEEE International Conference on e-Science, 2024

2023
Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation.
CoRR, 2023

Analyzing the Performance Portability of Tensor Decomposition.
CoRR, 2023

MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation.
Proceedings of the 52nd International Conference on Parallel Processing Workshops, 2023

Scalable Incremental Checkpointing using GPU-Accelerated De-Duplication.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Optimizing Data Movement for GPU-Based In-Situ Workflow Using GPUDirect RDMA.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

2022
Resiliency in numerical algorithm design for extreme scale simulations.
Int. J. High Perform. Comput. Appl., 2022

Integrating process, control-flow, and data resiliency layers using a hybrid Fenix/Kokkos approach.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

Assembling Portable In-Situ Workflow from Heterogeneous Components using Data Reorganization.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021
Resilience and fault tolerance in high-performance computing for numerical weather and climate prediction.
Int. J. High Perform. Comput. Appl., 2021

Towards High Performance Resilience Using Performance Portable Abstractions.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021

2020
CoREC: Scalable and Resilient In-memory Data Staging for In-situ Workflows.
ACM Trans. Parallel Comput., 2020

Fault tolerance of MPI applications in exascale systems: The ULFM solution.
Future Gener. Comput. Syst., 2020

Parameter Sensitivity Analysis of the SparTen High Performance Sparse Tensor Decomposition Software: Extended Analysis.
CoRR, 2020

Integrating Inter-Node Communication with a Resilient Asynchronous Many-Task Runtime System.
Proceedings of the Workshop on Exascale MPI, 2020

Improving Scalability of Silent-Error Resilience for Message-Passing Solvers via Local Recovery and Asynchrony.
Proceedings of the 10th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2020

SparTen: Leveraging Kokkos for On-node Parallelism in a Second-Order Method for Fitting Canonical Polyadic Tensor Models to Poisson Data.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

Parameter Sensitivity Analysis of the SparTen High Performance Sparse Tensor Decomposition Software.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

2019
Enabling Resilience in Asynchronous Many-Task Programming Models.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018
RedThreads: An Interface for Application-Level Fault Detection/Correction Through Adaptive Redundant Multithreading.
Int. J. Parallel Program., 2018

Introduction to PDSEC 2018 and Keynotes.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Scalable Data Resilience for In-memory Data Staging.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

2017
Modeling and Simulating Multiple Failure Masking Enabled by Local Recovery for Stencil-Based Applications at Extreme Scales.
IEEE Trans. Parallel Distributed Syst., 2017

Scalable Failure Masking for Stencil Computations using Ghost Region Expansion and Cell to Rank Remapping.
SIAM J. Sci. Comput., 2017

Exploring versioned distributed arrays for resilience in scientific applications.
Int. J. High Perform. Comput. Appl., 2017

Introduction to PDSEC Workshop.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016
PDSEC Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Evaluating Online Global Recovery with Fenix Using Application-Aware In-Memory Checkpointing Techniques.
Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

2015
Practical scalable consensus for pseudo-synchronous distributed systems.
Proceedings of the International Conference for High Performance Computing, 2015

Local recovery and failure masking for stencil-based applications at extreme scales.
Proceedings of the International Conference for High Performance Computing, 2015

PDSEC Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience.
Proceedings of the International Conference on Computational Science, 2015

Evolving the Message Passing Programming Model via a Fault-Tolerant, Object-oriented Transport Layer.
Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, 2015

Exploring Failure Recovery for Stencil-based Applications at Extreme Scales.
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

2014
Fault Tolerance in an Inner-Outer Solver: A GVR-Enabled Case Study.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014

Toward Local Failure Local Recovery Resilience Model using MPI-ULFM.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

PDSEC Introduction and Committees.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

An evaluation of lazy fault detection based on Adaptive Redundant Multithreading.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2014

Opportunistic application-level fault detection through adaptive redundant multithreading.
Proceedings of the International Conference on High Performance Computing & Simulation, 2014

Extreme-Scale Viability of Collective Communication for Resilient Task Scheduling and Work Stealing.
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

2011
A convergence improvement of the BSAIC preconditioner by deflation.
JSIAM Lett., 2011

2010
Parallel Hybrid Preconditioning: Incomplete Factorization with Selective Sparse Approximate Inversion.
SIAM J. Sci. Comput., 2010

A block sparse approximate inverse with cutoff preconditioner for semi-sparse linear systems derived from Molecular Orbital calculations.
JSIAM Lett., 2010

Software Automatic Tuning: Concepts and State-of-the-Art Results.
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010

2004
Parallel Hybrid Sparse Solvers Through Flexible Incomplete Cholesky Preconditioning.
Proceedings of the Applied Parallel Computing, 2004

Towards a Grid enabled system for multicomponent materials design.
Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

2003
A latency tolerant hybrid sparse solver using incomplete Cholesky factorization.
Numer. Linear Algebra Appl., 2003

Time-Memory Trade-Offs Using Sparse Matrix Methods for Large-Scale Eigenvalue Problems.
Proceedings of the Computational Science and Its Applications, 2003

2002
A new data-mapping scheme for latency-tolerant distributed sparse triangular solution.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

A Combinatorial Scheme for Developing Efficient Composite Solvers.
Proceedings of the Computational Science - ICCS 2002, 2002

2001
Scalable Preconditioning Using Incomplete Factors.
Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001


  Loading...