Hiroaki Kobayashi
Orcid: 0000-0002-3350-1413
According to our database1,
Hiroaki Kobayashi
authored at least 215 papers
between 1978 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
An Asymptotic Parallel Linear Solver and Its Application to Direct Numerical Simulation for Compressible Turbulence.
Proceedings of the Computational Science - ICCS 2024, 2024
File I/O Cache Performance of Supercomputer Fugaku Using an Out-of-Core Direct Numerical Simulation Code of Turbulence.
Proceedings of the Computational Science - ICCS 2024, 2024
Adaptive Parallelization based on Frame-level and Tile-level Parallelisms for VVC Encoding.
Proceedings of the Twelfth International Symposium on Computing and Networking, 2024
A Graph-based Molecular Structure Identification Method via Feature Extraction for Three-dimensional Electron Diffraction Data.
Proceedings of the Twelfth International Symposium on Computing and Networking, CANDAR 2024, 2024
2023
Door Opening and Closing Considering Forces Using a Mobile Manipulator with an Admittance Controlled Arm.
J. Robotics Mechatronics, December, 2023
An Efficient Reference Image Sharing Method for the Image-Division Parallel Video Encoding Architecture.
IEICE Trans. Electron., June, 2023
ICT Express, February, 2023
Concurr. Comput. Pract. Exp., 2023
Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023
Proceedings of the 16th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2023
Proceedings of the 16th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
iWAPT2023 Keynote Speaker QC & HPC hybrid computing for simulation & data-analysis hybrid applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the International Conference on Machine Learning and Applications, 2023
Performance Evaluation of Tsunami Evacuation Route Planning on Multiple Annealing Machines.
Proceedings of the 20th ACM International Conference on Computing Frontiers, 2023
2022
IEICE Trans. Electron., 2022
Page-Address Coalescing of Vector Gather Instructions for Efficient Address Translation.
Proceedings of the 12th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2022
Proceedings of the Parallel and Distributed Computing, Applications and Technologies, 2022
Proceedings of the Parallel and Distributed Computing, Applications and Technologies, 2022
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
An Efficient Reference Image Sharing Method for the Parallel Video Encoding Architecture.
Proceedings of the IEEE Symposium in Low-Power and High-Speed Chips, 2022
2021
VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture.
J. Supercomput., 2021
Optimizing Load Balance in a Parallel CFD Code for a Large-scale Turbine Simulation on a Vector Supercomputer.
Supercomput. Front. Innov., 2021
Supercomput. Front. Innov., 2021
Distributed Graph Algorithms for Multiple Vector Engines of NEC SX-Aurora TSUBASA Systems.
Supercomput. Front. Innov., 2021
Int. J. Netw. Comput., 2021
An External Definition of the One-Hot Constraint and Fast QUBO Generation for High-Performance Combinatorial Clustering.
Int. J. Netw. Comput., 2021
Proceedings of the 33rd IEEE International Symposium on Computer Architecture and High Performance Computing, 2021
Optimizations of a Linear Matrix Solver in a Composite Simulation for a Vector Computer.
Proceedings of the 12th International Symposium on Parallel Architectures, 2021
Proceedings of the 14th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2021
A Processor Selection Method based on Execution Time Estimation for Machine Learning Programs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021
Proceedings of the Ninth International Symposium on Computing and Networking, 2021
2020
Effects of Using a Memory Stalled Core for Handling MPI Communication Overlapping in the SOR Solver on SX-ACE and SX-Aurora TSUBASA.
Supercomput. Front. Innov., 2020
Proceedings of the 23rd International Symposium on Wireless Personal Multimedia Communications, 2020
Proceedings of the Parallel and Distributed Computing, Applications and Technologies, 2020
Proceedings of the Parallel Architectures, Algorithms and Programming, 2020
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020
An Efficient Skinny Matrix-Matrix Multiplication Method by Folding Input Matrices into Tensor Core Operations.
Proceedings of the Eighth International Symposium on Computing and Networking Workshops, 2020
Proceedings of the Eighth International Symposium on Computing and Networking, 2020
Proceedings of the 2020 IEEE Symposium in Low-Power and High-Speed Chips, 2020
Proceedings of the Benchmarking, Measuring, and Optimizing, 2020
2019
Performance Evaluation of Different Implementation Schemes of an Iterative Flow Solver on Modern Vector Machines.
Supercomput. Front. Innov., 2019
Supercomput. Front. Innov., 2019
Supercomput. Front. Innov., 2019
Optimizing Memory Layout of Hyperplane Ordering for Vector Supercomputer SX-Aurora TSUBASA.
Proceedings of the 2019 IEEE/ACM Workshop on Memory Centric High Performance Computing, 2019
Proceedings of the 9th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2019
Analysis of Relationship Between SIMD-Processing Features Used in NVIDIA GPUs and NEC SX-Aurora TSUBASA Vector Processors.
Proceedings of the Parallel Computing Technologies, 2019
An Appropriate Computing System and Its System Parameters Selection Based on Bottleneck Prediction of Applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
Proceedings of the Computational Science - ICCS 2019, 2019
Proceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, 2019
Proceedings of the IEEE Symposium in Low-Power and High-Speed Chips, 2019
2018
IEEE Trans. Multi Scale Comput. Syst., 2018
Real-time tsunami inundation forecast system for tsunami disaster prevention and mitigation.
J. Supercomput., 2018
Developing Efficient Implementations of Bellman-Ford and Forward-Backward Graph Algorithms for NEC SX-ACE.
Supercomput. Front. Innov., 2018
A Machine Learning-Based Approach for Selecting SpMV Kernels and Matrix Storage Formats.
IEICE Trans. Inf. Syst., 2018
Proceedings of the International Conference for High Performance Computing, 2018
Proposal of Detour Path Suppression Method in PS Reinforcement Learning and Its Application to Altruistic Multi-agent Environment.
Proceedings of the PRIMA 2018: Principles and Practice of Multi-Agent Systems - 21st International Conference, Tokyo, Japan, October 29, 2018
Search Space Reduction for Parameter Tuning of a Tsunami Simulation on the Intel Knights Landing Processor.
Proceedings of the 12th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2018
Proposal and Evaluation of an Indirect Reward Assignment Method for Reinforcement Learning by Profit Sharing Method.
Proceedings of the Intelligent Systems and Applications, 2018
Proceedings of the 2018 IEEE Symposium in Low-Power and High-Speed Chips, 2018
2017
Potential of a modern vector supercomputer for practical applications: performance evaluation of SX-ACE.
J. Supercomput., 2017
J. Adv. Comput. Intell. Intell. Informatics, 2017
Int. J. Netw. Comput., 2017
A Directive Generation Approach to High Code-Maintainability for Various HPC Systems.
Int. J. Netw. Comput., 2017
IEICE Trans. Inf. Syst., 2017
Proceedings of the 2017 IEEE Symposium in Low-Power and High-Speed Chips, 2017
Proceedings of the 2017 IEEE Symposium in Low-Power and High-Speed Chips, 2017
Proceedings of the 2017 IEEE Symposium in Low-Power and High-Speed Chips, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
2016
Supercomput. Front. Innov., 2016
Effects of Stacking Granularity on 3-D Stacked Floating-point Fused Multiply Add Units.
SIGARCH Comput. Archit. News, 2016
Int. J. Netw. Comput., 2016
Translation of Large-Scale Simulation Codes for an OpenACC Platform Using the Xevolver Framework.
Int. J. Netw. Comput., 2016
Proceedings of the 10th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2016
Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multi-agent Learning.
Proceedings of the IEEE International Conference on Agents, 2016
The Importance of Dynamic Load Balancing among OpenMP Thread Teams for Irregular Workloads.
Proceedings of the Fourth International Symposium on Computing and Networking, 2016
Proceedings of the Fourth International Symposium on Computing and Networking, 2016
A User-Defined Code Transformation Approach to Overlapping MPI Communication with Computation.
Proceedings of the Fourth International Symposium on Computing and Networking, 2016
Proposal of an Action Selection Strategy with Expected Failure Probability and Its Evaluation in Multi-agent Reinforcement Learning.
Proceedings of the Multi-Agent Systems and Agreement Technologies, 2016
Proceedings of the 2016 IEEE Symposium in Low-Power and High-Speed Chips, 2016
Proceedings of the 2016 IEEE International 3D Systems Integration Conference, 2016
2015
Sci. Program., 2015
Identification and Elimination of Platform-Specific Code Smells in High Performance Computing Applications.
Int. J. Netw. Comput., 2015
IEICE Trans. Electron., 2015
IEICE Trans. Inf. Syst., 2015
A Visualization Technique to Support Searching and Comparing Features of Multivariate Datasets.
Proceedings of the 19th International Conference on Information Visualisation, 2015
Design of tendon-driven mechanisms for fault tolerance from tendon-breaking by using centroid vectors.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015
Proceedings of the Third International Symposium on Computing and Networking, 2015
A Case Study of Memory Optimization for Migration of a Plasmonics Simulation Application to SX-ACE.
Proceedings of the Third International Symposium on Computing and Networking, 2015
Migration of an Atmospheric Simulation Code to an OpenACC Platform Using the Xevolver Framework.
Proceedings of the Third International Symposium on Computing and Networking, 2015
Proceedings of the Third International Symposium on Computing and Networking, 2015
Proceedings of the 2015 IEEE Symposium in Low-Power and High-Speed Chips, 2015
Proceedings of the 2015 International 3D Systems Integration Conference, 2015
2014
IEEE Trans. Robotics, 2014
MVP-Cache: A Multi-Banked Cache Memory for Energy-Efficient Vector Processing of Multimedia Applications.
IEICE Trans. Inf. Syst., 2014
Design and control of a three-fingered tendon-driven robotic hand with active and passive tendons.
Auton. Robots, 2014
Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014
A Compiler-Assisted OpenMP Migration Method Based on Automatic Parallelizing Information.
Proceedings of the Supercomputing - 29th International Conference, 2014
An Approach to Customization of Compiler Directives for Application-Specific Code Transformations.
Proceedings of the IEEE 8th International Symposium on Embedded Multicore/Manycore SoCs, 2014
Proceedings of the 18th International Conference on Information Visualisation, 2014
A Platform-Specific Code Smell Alert System for High Performance Computing Applications.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014
Xevolver: An XML-based code translation framework for supporting HPC application migration.
Proceedings of the 21st International Conference on High Performance Computing, 2014
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014
Proceedings of the 2014 IEEE Symposium on Low-Power and High-Speed Chips, 2014
Design and control methodology for fine grain power gating based on energy characterization and code profiling of microprocessors.
Proceedings of the 19th Asia and South Pacific Design Automation Conference, 2014
Proceedings of the 2014 International 3D Systems Integration Conference, 2014
Proceedings of the 2014 International 3D Systems Integration Conference, 2014
2013
A Capacity-Aware Thread Scheduling Method Combined with Cache Partitioning to Reduce Inter-Thread Cache Conflicts.
IEICE Trans. Inf. Syst., 2013
Proceedings of the 17th International Conference on Information Visualisation, 2013
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013
Design and evaluation of a media-oriented vector processor with a multi-banked cache memory.
Proceedings of the 11th IEEE Symposium on Embedded Systems for Real-time Multimedia, 2013
Proceedings of the 2013 IEEE Symposium on Low-Power and High-Speed Chips, 2013
Proceedings of the 2013 IEEE Symposium on Low-Power and High-Speed Chips, 2013
Proceedings of the 2013 IEEE International 3D Systems Integration Conference (3DIC), 2013
Proceedings of the 2013 IEEE International 3D Systems Integration Conference (3DIC), 2013
2012
Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation.
J. Adv. Comput. Intell. Intell. Informatics, 2012
Poster: Exploring Design Space of a 3D Stacked Vector Cache - Designing a 3D Stacked Vector Cache using Conventional EDA Tools.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
Proceedings of the 7th IEEE Conference on Visual Analytics Science and Technology, 2012
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Proceedings of the 2012 IEEE Symposium on Low-Power and High-Speed Chips, 2012
Proceedings of the Computing Frontiers Conference, CF'12, 2012
Proceedings of the Computing Frontiers Conference, CF'12, 2012
Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment.
Proceedings of the Intelligent Information and Database Systems - 4th Asian Conference, 2012
2011
Trans. High Perform. Embed. Archit. Compil., 2011
A Self-Organized Overlay Network Management Mechanism for Heterogeneous Environments.
J. Inf. Process., 2011
IEICE Trans. Inf. Syst., 2011
A History-Based Performance Prediction Model with Profile Data Classification for Automatic Task Allocation in Heterogeneous Computing Systems.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2011
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011
A middle-grain circuit partitioning strategy for 3-D integrated floating-point multipliers.
Proceedings of the 2011 IEEE International 3D Systems Integration Conference (3DIC), Osaka, Japan, January 31, 2011
Proceedings of the 2011 IEEE International 3D Systems Integration Conference (3DIC), Osaka, Japan, January 31, 2011
2010
A Fast Ray-Tracing Using Bounding Spheres and Frustum Rays for Dynamic Scene Rendering.
IEICE Trans. Inf. Syst., 2010
Proceedings of the Tenth Annual International Symposium on Applications and the Internet, 2010
Proceedings of the Tenth Annual International Symposium on Applications and the Internet, 2010
Proceedings of the 28th International Conference on Computer Design, 2010
Proceedings of the Facing the Multicore-Challenge, 2010
Proceedings of the 13th Euromicro Conference on Digital System Design, 2010
A block-parallel signal processing system for CMOS image sensor with three-dimensional structure.
Proceedings of the IEEE International Conference on 3D System Integration, 2010
Proceedings of the IEEE International Conference on 3D System Integration, 2010
Proceedings of the IEEE International Conference on 3D System Integration, 2010
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010
2009
Object exploration and manipulation using a robotic finger equipped with an optical three-axis tactile sensor.
Robotica, 2009
A New Improved Penalty Avoiding Rational Policy Making Algorithm for Keepaway with Continuous State Spaces.
J. Adv. Comput. Intell. Intell. Informatics, 2009
Int. J. Grid High Perform. Comput., 2009
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009
Proceedings of the 2009 International Conference on Parallel and Distributed Computing, 2009
Performance tuning and analysis of future vector processors based on the roofline model.
Proceedings of the 10th workshop on MEmory performance, 2009
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009
Proceedings of the IEEE International Conference on 3D System Integration, 2009
Proceedings of the IEEE International Conference on 3D System Integration, 2009
2008
Proceedings of the 2008 International Symposium on Applications and the Internet, 2008
Consideration of Resource Access History for Optimizing Overlay Networks in P2P-Based Resource Discovery.
Proceedings of the 2008 International Symposium on Applications and the Internet, 2008
Proceedings of the 9th workshop on MEmory performance, 2008
Proceedings of the 9th workshop on MEmory performance, 2008
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008
Effects of MSHR and Prefetch Mechanisms on an On-Chip Cache of the Vector Architecture.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2008
Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008
Implementation and evaluation of a distributed and cooperative load-balancing mechanism for dependable volunteer computing.
Proceedings of the 38th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2008
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008
Proceedings of the High Performance Computing on Vector Systems 2008, 2008
2007
Neural Networks, 2007
Proceedings of the 2007 workshop on MEmory performance, 2007
A power-aware shared cache mechanism based on locality assessment of memory reference for CMPs.
Proceedings of the 2007 workshop on MEmory performance, 2007
Proceedings of the 25th International Conference on Computer Design, 2007
2006
Hierarchical parallel processing of large scale data clustering on a PC cluster with GPU co-processing.
J. Supercomput., 2006
Preparation and Evaluation of Aligned Naphthacene Thin Films Using Surface Plasmon Excitation.
IEICE Trans. Electron., 2006
Evaluating Computational Performance of Backpropagation Learning on Graphics Hardware.
Proceedings of the Irish Conference on the Mathematical Foundations of Computer Science and Information Technology, 2006
Proceedings of the 2006 International Symposium on Applications and the Internet Workshops (SAINT 2006 Workshops), 2006
Design and Implementation of an Efficient Search Mechanism Based on the Hybrid P2P Model for Ubiquitous Computing Systems.
Proceedings of the 2006 International Symposium on Applications and the Internet (SAINT 2006), 2006
Proceedings of the 15th IEEE International Symposium on Robot and Human Interactive Communication, 2006
Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications.
Proceedings of the Parallel and Distributed Processing and Applications, 2006
An Efficient Text Capture Method for Moving Robots Using DCT Feature and Text Tracking.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Proceedings of the 5th Annual IEEE/ACIS International Conference on Computer and Information Science (ICIS 2006) and 1st IEEE/ACIS International Workshop on Component-Based Software Engineering, 2006
2005
SIGARCH Comput. Archit. News, 2005
A Self-Organizing Overlay Network to Exploit the Locality of Interests for Effective Resource Discovery in P2P Systems.
Proceedings of the 2005 IEEE/IPSJ International Symposium on Applications and the Internet (SAINT 2005), 31 January, 2005
Proceedings of the Parallel and Distributed Processing and Applications, 2005
Sensing characteristics of an optical three-axis tactile sensor mounted on a multi-fingered robotic hand.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005
Text Detection in Color Scene Images based on Unsupervised Clustering of Multi-channel Wavelet Features.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005
An Incremental Photon-Mapping Algorithm for Fast Walk-Through Animations.
Proceedings of the Eighth IASTED International Conference on Computer Graphics and Imaging, 2005
2004
Parallel Comput., 2004
Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'04), 2004
Multi-grain Parallel Processing of Data-Clustering on Programmable Graphics Hardware.
Proceedings of the Parallel and Distributed Processing and Applications, 2004
2003
Autom., 2003
A new impedance control concept for elastic joint robots -a case of a 1 DOF robot with programmable linear passive impedance.
Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003
A Comparison Study of Vector Quantization Codebook Design Algorithms based on the Equidistortion Principle.
Proceedings of the 21st IASTED International Multi-Conference on Applied Informatics (AI 2003), 2003
2002
Parallel Algorithm for the Law-of-the-Jungle Learning to the Fast Design of Optimal Codebooks.
Proceedings of the International Conference on Parallel and Distributed Computing Systems, 2002
2001
Proceedings of the 19th International Conference on Computer Design (ICCD 2001), 2001
2000
Developing a practical parallel multi-pass renderer in Java and C++: toward a Grande application in Java.
Proceedings of the ACM 2000 Java Grande Conference, San Francisco, CA, USA, 2000
1999
Syst. Comput. Jpn., 1999
A scheduling method for instruction-level parallel processing of vectorand scalar instructions.
Syst. Comput. Jpn., 1999
A self-organizing network system forming memory from nonstationary probability distributions.
Proceedings of the International Joint Conference Neural Networks, 1999
1998
Int. J. Robotics Res., 1998
Proceedings of the ASP-DAC '98, 1998
1997
Parallel processing of the shear-warp factorization with the binary-swap method on a distributed-memory multiprocessor system.
Proceedings of the IEEE Symposium on Parallel Rendering, 1997
Proceedings of the Computer Graphics International Conference, 1997
1996
Proceedings of IPPS '96, 1996
1994
Proceedings of the International Symposium on Parallel Architectures, 1994
1993
Proceedings of the Proceedings IEEE INFOCOM '93, The Conference on Computer Communications, Twelfth Annual Joint Conference of the IEEE Computer and Communications Societies, Networking: Foundation for the Future, San Francisco, CA, USA, March 28, 1993
Proceedings of the Robotics, Mechatronics and Manufacturing Systems, 1993
1988
Load balancing strategies for a parallel ray-tracing system based on constant subdivision.
Vis. Comput., 1988
1987
Vis. Comput., 1987
Numerical study for increasing high-temperature regions in hyperthermia with ferromagnetic seed implants.
Syst. Comput. Jpn., 1987
1986
Proceedings of the 1986 IEEE International Conference on Robotics and Automation, 1986
1984
A Language Processor of an Intelligent Link System.
Proceedings of the IEEE International Conference on Communications: Links for the Future, 1984
1978
Autom., 1978