Kenichi Hagihara

According to our database1, Kenichi Hagihara authored at least 113 papers between 1979 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2019
PACC: a directive-based programming framework for out-of-core stencil computation on accelerators.
Int. J. High Perform. Comput. Netw., 2019

GPU-based branch-and-bound method to solve large 0-1 knapsack problems with data-centric strategies.
Concurr. Comput. Pract. Exp., 2019

2018
Transparent Avoidance of Redundant Data Transfer on GPU-enabled Apache Spark.
Proceedings of the 11th Workshop on General Purpose Processing using GPUs, 2018

2017
Parallelizing Exact and Approximate String Matching via Inclusive Scan on a GPU.
IEEE Trans. Parallel Distributed Syst., 2017

Cache-Aware, In-Place Rotation Method for Texture-Based Volume Rendering.
IEICE Trans. Inf. Syst., 2017

An Out-of-Core Branch and Bound Method for Solving the 0-1 Knapsack Problem on a GPU.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2017

2016
Reducing memory usage by the lifting-based discrete wavelet transform with a unified buffer on a GPU.
J. Parallel Distributed Comput., 2016

Cache-Aware GPU Optimization for Out-of-Core Cone Beam CT Reconstruction of High-Resolution Volumes.
IEICE Trans. Inf. Syst., 2016

An Extension of OpenACC Directives for Out-of-Core Stencil Computation with Temporal Blocking.
Proceedings of the Third Workshop on Accelerator Programming Using Directives, 2016

An OpenACC Optimizer for Accelerating Histogram Computation on a GPU.
Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Towards Automating Multi-dimensional Data Decomposition for Executing a Single-GPU Code on a Multi-GPU System.
Proceedings of the Fourth International Symposium on Computing and Networking, 2016

2015
A bit-parallel algorithm for searching multiple patterns with various lengths.
J. Parallel Distributed Comput., 2015

Enumerating Joint Weight of a Binary Linear Code Using Parallel Architectures: multi-core CPUs and GPUs.
Int. J. Netw. Comput., 2015

Accelerating the Smith-Waterman algorithm with interpair pruning and band optimization for the all-pairs comparison of base sequences.
BMC Bioinform., 2015

2014
Accelerating ODE-Based Simulation of General and Heterogeneous Biophysical Models Using a GPU.
IEEE Trans. Parallel Distributed Syst., 2014

Efficient Acceleration of Mutual Information Computation for Nonrigid Registration Using CUDA.
IEEE J. Biomed. Health Informatics, 2014

Improving cache locality for GPU-based volume rendering.
Parallel Comput., 2014

A Fine Grained Cycle Sharing System with Cooperative Multitasking on GPUs.
Int. J. Netw. Comput., 2014

A parallel scheme for accelerating parameter sweep applications on a GPU.
Concurr. Comput. Pract. Exp., 2014

A Parallel Algorithm for Enumerating Joint Weight of a Binary Linear Code in Network Coding.
Proceedings of the Second International Symposium on Computing and Networking, 2014

2013
GPU-Chariot: A Programming Framework for Stream Applications Running on Multi-GPU Systems.
IEICE Trans. Inf. Syst., 2013

A versatile platform for multilevel modeling of physiological systems: Template/instance framework for large-scale modeling and simulation.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

2012
Sequence Homology Search Using Fine Grained Cycle Sharing of Idle GPUs.
IEEE Trans. Parallel Distributed Syst., 2012

A task parallel algorithm for finding all-pairs shortest paths using the GPU.
Int. J. High Perform. Comput. Netw., 2012

Cooperative multitasking for GPU-accelerated grid systems.
Concurr. Comput. Pract. Exp., 2012

Multilevel Modeling of Physiological Systems and Simulation Platform: PhysioDesigner, Flint and Flint K3 Service.
Proceedings of the 12th IEEE/IPSJ International Symposium on Applications and the Internet, 2012

Acceleration of variance of color differences-based demosaicing using CUDA.
Proceedings of the 2012 International Conference on High Performance Computing & Simulation, 2012

Improving Cache Locality for Ray Casting with CUDA.
Proceedings of the ARCS 2012 Workshops, 28. Februar - 2. März 2012, München, Germany, 2012

2011
An Open Platform toward Large-Scale Multilevel Modeling and Simulation of Physiological Systems.
Proceedings of the 11th Annual International Symposium on Applications and the Internet, 2011

Accelerating Parameter Sweep Applications Using CUDA.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

2010
High-performance cone beam reconstruction using CUDA compatible GPUs.
Parallel Comput., 2010

A middleware for efficient stream processing in CUDA.
Comput. Sci. Res. Dev., 2010

Accelerating Smith-Waterman Algorithm for Biological Database Search on CUDA-Compatible GPUs.
IEICE Trans. Inf. Syst., 2010

insilicoSim: an extendable engine for parallel heterogeneous biophysical simulations.
Proceedings of the 3rd International Conference on Simulation Tools and Techniques, 2010

A Multi-GPU Spectrometer System for Real-Time Wide Bandwidth Radio Signal Analysis.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2010

Out-of-core cone beam reconstruction using multiple GPUS.
Proceedings of the 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2010

2009
Harnessing the Power of Idle GPUs for Acceleration of Biological Sequence Alignment.
Parallel Process. Lett., 2009

Optimization Techniques for Parallel Biophysical Simulations Generated by <i>insilico</i>IDE.
Inf. Media Technol., 2009

Computing Low Latency Batches with Unreliable Workers in Volunteer Computing Environments.
J. Grid Comput., 2009

Harnessing the power of idle GPUs for acceleration of biological sequence alignment.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

PyMW - A Python module for desktop grid and volunteer computing.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008
A Resource Selection System for Cycle Stealing in GPU Grids.
J. Grid Comput., 2008

A decompression pipeline for accelerating out-of-core volume rendering of time-varying data.
Comput. Graph., 2008

Static Load Distribution for Communication Intensive Parallel Computing in Multiclusters.
Proceedings of the 16th Euromicro International Conference on Parallel, 2008

A Task Parallel Algorithm for Computing the Costs of All-Pairs Shortest Paths on the CUDA-Compatible GPU.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008

Computing low latency batches with unreliable workers in volunteer computing environments.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Accelerating Cone Beam Reconstruction Using the CUDA-Enabled GPU.
Proceedings of the High Performance Computing, 2008

Design and implementation of the Smith-Waterman algorithm on the CUDA-compatible GPU.
Proceedings of the 8th IEEE International Conference on Bioinformatics and Bioengineering, 2008

2007
Parallel Adaptive Estimation of Hip Range of Motion for Total Hip Replacement Surgery.
IEICE Trans. Inf. Syst., 2007

Priority Control to Avoid Job Overtaking in Multiple Job Scheduling for a Desktop Grid.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2007

Application of Grid Task Scheduling Algorithm RR to Medium-Grained Evolution Strategies.
Proceedings of the Third International Conference on Natural Computation, 2007

Real-time rendering of time-varying volume data using a single cots computer.
Proceedings of the GRAPP 2007, 2007

Grid task scheduling algorithm R3Q for evolution strategies.
Proceedings of the IEEE Congress on Evolutionary Computation, 2007

2006
Trace reduction for performance improvement assessment of message passing parallel programs.
Syst. Comput. Jpn., 2006

A parallel implementation of 2-D/3-D image registration for computer-assisted surgery.
Int. J. Bioinform. Res. Appl., 2006

Grid Resource Monitoring and Selection for Rapid Turnaround Applications.
IEICE Trans. Inf. Syst., 2006

Developing a Web Crawler for Massive Mobile Search Services.
Proceedings of the 7th International Conference on Mobile Data Management (MDM 2006), 2006

A Resource Selection Method for Cycle Stealing in the GPU Grid.
Proceedings of the Frontiers of High Performance Computing and Networking, 2006

A GPGPU Approach for Accelerating 2-D/3-D Rigid Registration of Medical Images.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

Minimizing Data Size for Efficient Data Reuse in Grid-Enabled Medical Applications.
Proceedings of the Biological and Medical Data Analysis, 7th International Symposium, 2006

A code motion technique for accelerating general-purpose computation on the GPU.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Two-stage compression for fast volume rendering of time-varying scalar data.
Proceedings of the 4th International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia 2006, Kuala Lumpur, Malaysia, November 29, 2006

A 2-Approximation Algorithm for Scheduling Independent Tasks onto a Uniform Parallel Machine and its Extension to a Computational Grid.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

2005
A data distributed parallel algorithm for nonrigid image registration.
Parallel Comput., 2005

Prediction-Aware Experimental Evaluation of Dynamic Task Scheduling Algorithms for Parametric Study on a Desktop Grid.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2005

Performance Study of Nonrigid Registration Algorithm for Investigating Lung Disease on Clusters.
Proceedings of the Sixth International Conference on Parallel and Distributed Computing, 2005

Performance Study of LU Decomposition on the Programmable GPU.
Proceedings of the High Performance Computing, 2005

2004
High-performance computing service over the Internet for intraoperative image processing.
IEEE Trans. Inf. Technol. Biomed., 2004

Evaluation of a compiler with user-selectable execution strategies for parallel recursion.
Syst. Comput. Jpn., 2004

Evaluation of Performance Prediction Method for Master/Slave Parallel Programs.
IEICE Trans. Inf. Syst., 2004

PerWiz: A What-If Prediction Tool for Tuning Message Passing Programs.
Proceedings of the High Performance Computing for Computational Science, 2004

A Comparison among Grid Scheduling Algorithms for Independent Coarse-Grained Tasks.
Proceedings of the 2004 Symposium on Applications and the Internet Workshops (SAINT 2004 Workshops), 2004

Real-Time Estimation of Hip Range of Motion for Total Hip Replacement Surgery.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention -- MICCAI 2004, 2004

Parallel Volume Rendering with Early Ray Termination for Visualizing Large-Scale Datasets.
Proceedings of the Parallel and Distributed Processing and Applications, 2004

A Performance Analysis Tool for Performance Debugging of Message Passing Parallel Programs.
Proceedings of the 33rd International Conference on Parallel Processing Workshops (ICPP 2004 Workshops), 2004

2003
On Approximation of the Bulk Synchronous Task Scheduling Problem.
IEEE Trans. Parallel Distributed Syst., 2003

An improved binary-swap compositing for sort-last parallel rendering on distributed memory multiprocessors.
Parallel Comput., 2003

Debugging Tool for Localizing Faulty Processes in Message Passing Programs
CoRR, 2003

An Improvement on Binary-Swap Compositing for Sort-Last Parallel Rendering.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

Design and Implementation of Parallel Nonrigid Image Registration Using Off-the-Shelf Supercomputers.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2003

Near-Optimal Dynamic Task Scheduling of Precedence Constrained Coarse-Grained Tasks onto a Computational Grid.
Proceedings of the 2nd International Symposium on Parallel and Distributed Computing (ISPDC 2003), 2003

A Divided-Screenwise Hierarchical Compositing for Sort-Last Parallel Volume Rendering.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Near-Optimal Dynamic Task Scheduling of Independent Coarse-Grained Tasks onto a Computational Grid.
Proceedings of the 32nd International Conference on Parallel Processing (ICPP 2003), 2003

A High Performance Computing System for Medical Imaging in the Remote Operating Room.
Proceedings of the High Performance Computing - HiPC 2003, 10th International Conference, 2003

An Emulation System for Predicting Master/Slave Program Performance.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003

A high-performance computing service over the Internet for nonrigid image registration.
Proceedings of the CARS 2003. Computer Assisted Radiology and Surgery. Proceedings of the 17th International Congress and Exhibition, 2003

2002
Non-approximability of the Bulk Synchronous Task Scheduling Problem.
Proceedings of the Euro-Par 2002, 2002

2001
On Message Packaging in Task Scheduling for Distributed Memory Parallel Machines.
Int. J. Found. Comput. Sci., 2001

LogGPS: a parallel computational model for synchronization analysis.
Proceedings of the 2001 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPOPP'01), 2001

Optimal Task Scheduling of a Complete K-Ary Tree with Communication Delays.
Proceedings of the Parallel Processing and Applied Mathematics, 2001

2000
NP-Completeness of the Bulk Synchronous Task Scheduling Problem and Its Approximation Algorithm.
Proceedings of the 5th International Symposium on Parallel Architectures, 2000

1999
A Task Scheduling Algorithm to Package Messages on Distributed Memory Parallel Machines.
Proceedings of the 1999 International Symposium on Parallel Architectures, 1999

1992
Efficient distributed algorithm to solve updating minimum spanning tree problem.
Syst. Comput. Jpn., 1992

1991
Efficient distributed algorithms solving problems about the connectivity of network.
Syst. Comput. Jpn., 1991

A fault-tolerant algorithm for election in complete networks with a sense of direction.
Syst. Comput. Jpn., 1991

1990
Distributed Algorithms for Reconstructing MST after Topology Change.
Proceedings of the Distributed Algorithms, 4th International Workshop, 1990

1989
An efficient distributed algorithm for constructing a breadth-first search tree.
Syst. Comput. Jpn., 1989

Page-number of hypercubes and cube-connected cycles.
Syst. Comput. Jpn., 1989

Optimal Fault-Tolerant Distributed Algorithms for Election in Complete Networks with a Global Sense of Direction.
Proceedings of the Distributed Algorithms, 1989

1988
Distributed algorithms for fault diagnosis of processors.
Syst. Comput. Jpn., 1988

1987
Distributed algorithms tolerant of link failures.
Syst. Comput. Jpn., 1987

An optimal time algorithm for the k-vertex-connectivity unweighted augmentation problem for rooted directed trees.
Discret. Appl. Math., 1987

1986
Area-time complexity on a vlsi model with boundary layout assumption.
Syst. Comput. Jpn., 1986

Embedding area of d-way shuffle graph on a VLSI model.
Syst. Comput. Jpn., 1986

Complexity to determine containment among inequality tableau queries.
Syst. Comput. Jpn., 1986

Optimal-Time Algorithm for the k-Node-Connectivity Augmentation Problem for Ternary Trees.
Syst. Comput. Jpn., 1986

Minimum separation layout for cmos circuits realizing tree-shape monotone decreasing logic circuits.
Syst. Comput. Jpn., 1986

1985
Vulnerability of a communication network with a satellite.
Syst. Comput. Jpn., 1985

1984
Area-Time Optimal Fast Implementation of Several Functions in a VLSI Model.
IEEE Trans. Computers, 1984

1982
Effect of Practical Assumption in Area Complexity of VLSI Computation.
Proceedings of the RIMS Symposium on Software Science and Engineering, 1982

An Editor for Documentation in pi-System to Support Software Development and Maintenance.
Proceedings of the Proceedings, 1982

1980
Specification of schedulers with algebraic specification techniques.
Proceedings of the Operating Systems Engineering: Proceedings of the 14th IBM Computer SCience Symposium, 1980

1979
Decision Problems for Multivalued Dependencies in Relational Databases.
SIAM J. Comput., 1979


  Loading...