Mohamed Wahib
Orcid: 0000-0002-7165-2095Affiliations:
- RIKEN Center for Computational Science, Kobe, Japan
According to our database1,
Mohamed Wahib
authored at least 81 papers
between 2007 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
CG-Kit: Code Generation Toolkit for performant and maintainable variants of source code applied to Flash-X hydrodynamics simulations.
Future Gener. Comput. Syst., 2025
2024
J. Supercomput., June, 2024
J. Adv. Comput. Intell. Intell. Informatics, January, 2024
CoRR, 2024
A Pairwise Comparison Relation-assisted Multi-objective Evolutionary Neural Architecture Search Method with Multi-population Mechanism.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Appl. Soft Comput., 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the International Joint Conference on Neural Networks, 2024
Proceedings of the 38th ACM International Conference on Supercomputing, 2024
Surrogate-Assisted Evolutionary Neural Architecture Search with Isomorphic Training and Prediction.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024
Proceedings of the IEEE International Conference on Cluster Computing, 2024
Proceedings of the IEEE International Conference on Cluster Computing, 2024
Proceedings of the IEEE International Conference on Cluster Computing, 2024
Proceedings of the IEEE International Conference on Cluster Computing, 2024
2023
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads.
ACM Trans. Archit. Code Optim., December, 2023
IEEE Trans. Parallel Distributed Syst., October, 2023
Int. J. High Perform. Comput. Appl., July, 2023
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 37th International Conference on Supercomputing, 2023
PERKS: a Locality-Optimized Execution Model for Iterative Memory-bound GPU Applications.
Proceedings of the 37th International Conference on Supercomputing, 2023
Proceedings of the 37th International Conference on Supercomputing, 2023
Exploiting Scratchpad Memory for Deep Temporal Blocking: A case study for 2D Jacobian 5-point iterative stencil kernel (j2d5pt).
Proceedings of the 15th Workshop on General Purpose Processing Using GPU, 2023
2022
Automatic Generation of High-Performance Convolution Kernels on ARM CPUs for Deep Learning.
IEEE Trans. Parallel Distributed Syst., 2022
At the Locus of Performance: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache.
CoRR, 2022
Proceedings of the Joint 12th International Conference on Soft Computing and Intelligent Systems and 23rd International Symposium on Advanced Intelligent Systems, 2022
Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction.
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
2021
Parallel Comput., 2021
Structured Adaptive Mesh Refinement Adaptations to Retain Performance Portability With Increasing Heterogeneity.
Comput. Sci. Eng., 2021
MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.
CoRR, 2021
Concurr. Comput. Pract. Exp., 2021
Proceedings of the International Conference for High Performance Computing, 2021
MLPerf™ HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.
Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021
Matrix Engines for High Performance Computing: A Paragon of Performance or Grasping at Straws?
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
Performance portable back-projection algorithms on CPUs: agnostic data locality and vectorization optimizations.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks.
Proceedings of the HPDC '21: The 30th International Symposium on High-Performance Parallel and Distributed Computing, 2021
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021
An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning.
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021
2020
Proceedings of the Software for Exascale Computing - SPPEXA 2016-2019, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
Proceedings of the CGO '20: 18th ACM/IEEE International Symposium on Code Generation and Optimization, 2020
2019
Proceedings of the International Conference for High Performance Computing, 2019
Proceedings of the International Conference for High Performance Computing, 2019
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Proceedings of the 38th IEEE International Performance Computing and Communications Conference, 2019
2018
Hierarchical Distributed-Memory Multi-Leader MPI-Allreduce for Deep Learning Workloads.
Proceedings of the Sixth International Symposium on Computing and Networking, 2018
Proceedings of the IEEE International Conference on Cluster Computing, 2018
2017
Proceedings of the Applications of Evolutionary Computation - 20th European Conference, 2017
2016
Proceedings of the International Conference for High Performance Computing, 2016
2015
Proceedings of the 5th Workshop on Irregular Applications - Architectures and Algorithms, 2015
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015
2014
Proceedings of the International Conference for High Performance Computing, 2014
2013
arGA: Adaptive Resolution Micro-genetic Algorithm with Tabu Search to Solve MINLP Problems Using GPU.
Proceedings of the Massively Parallel Evolutionary Computation on GPGPUs, 2013
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013
2011
Proceedings of the World Congress on Services, 2011
Solving Extremely Difficult MINLP Problems Using Adaptive Resolution Micro-GA with Tabu Search.
Proceedings of the Learning and Intelligent Optimization - 5th International Conference, 2011
Proceedings of the IEEE Congress on Evolutionary Computation, 2011
Proceedings of the IEEE Congress on Evolutionary Computation, 2011
2010
The design, usage, and performance of GridUFO: A Grid based Unified Framework for Optimization.
Future Gener. Comput. Syst., 2010
A Light Framework for the Unified Representation and Execution of Variant Tasks in a Grid Based Environment.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2010
A Bayesian Optimization Algorithm for De Novo ligand design based docking running over GPU.
Proceedings of the IEEE Congress on Evolutionary Computation, 2010
2009
Hybrid of genetic algorithm and local search to solve MAX-SAT problem using nVidia CUDA framework.
Genet. Program. Evolvable Mach., 2009
Theoretical and Empirical Analysis of a GPU Based Parallel Bayesian Optimization Algorithm.
Proceedings of the 2009 International Conference on Parallel and Distributed Computing, 2009
2008
Proceedings of the Linkage in Evolutionary Computation, 2008
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008
SOAG: Service Oriented Architectured Grids and adoption of application specific QoS attributes.
Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008
Model for dynamic grain sizing through compound parallelization for an optimization problem solving grid application.
Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008
A General Service-Oriented Grid Computing Framework for Global Optimization Problem Solving.
Proceedings of the 2008 IEEE International Conference on Services Computing (SCC 2008), 2008
2007
MHGrid: Towards an Ideal Optimization Environment for Global Optimization Problems Using Grid Computing.
Proceedings of the Eighth International Conference on Parallel and Distributed Computing, 2007