Georgios I. Goumas
Orcid: 0000-0001-7811-4831Affiliations:
- National Technical University of Athens (NTUA), Greece
According to our database1,
Georgios I. Goumas
authored at least 99 papers
between 2000 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
Proceedings of the IEEE International Conference on Cluster Computing, 2024
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2024
2023
PARALiA: A Performance Aware Runtime for Auto-tuning Linear Algebra on Heterogeneous Systems.
ACM Trans. Archit. Code Optim., December, 2023
J. Supercomput., April, 2023
DaeMon: Architectural Support for Efficient Data Movement in Fully Disaggregated Systems.
Proc. ACM Meas. Anal. Comput. Syst., March, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the Abstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Invited paper: An Artificial Matrix Generator for Multi-platform SpMV Performance Analysis.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
2022
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures.
Proc. ACM Meas. Anal. Comput. Syst., 2022
Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems.
CoRR, 2022
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems.
CoRR, 2022
Proceedings of the SYSTOR '22: The 15th ACM International Systems and Storage Conference, Haifa, Israel, June 13, 2022
Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures.
Proceedings of the SIGMETRICS/PERFORMANCE '22: ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, Mumbai, India, June 6, 2022
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022
SparseP: Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2022
DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022
2021
RCU-HTM: A generic synchronization technique for highly efficient concurrent search trees.
Concurr. Comput. Pract. Exp., 2021
Modeling the Scalability of the EuroExa Reconfigurable Accelerators - Preliminary Results - Invited Paper.
Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2021
CoCoPeLia: Communication-Computation Overlap Prediction for Efficient Linear Algebra on GPUs.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021
2020
Comput., 2020
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020
Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020
2019
J. Syst. Softw., 2019
Proceedings of the 38th Symposium on Reliable Distributed Systems, 2019
Conflict-free symmetric sparse matrix-vector multiplication on multicore architectures.
Proceedings of the International Conference for High Performance Computing, 2019
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019
On the Performance and Energy Efficiency of Sparse Matrix-Vector Multiplication on FPGAs.
Proceedings of the Parallel Computing: Technology Trends, 2019
Proceedings of the 20th International Middleware Conference Demos and Posters, 2019
Proceedings of the 48th International Conference on Parallel Processing, 2019
Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
2018
SparseX: A Library for High-Performance Sparse Matrix-Vector Multiplication on Multicore Platforms.
ACM Trans. Math. Softw., 2018
Future Gener. Comput. Syst., 2018
Proceedings of the High Performance Computing - 33rd International Conference, 2018
Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018
Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018
Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018
2017
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017
Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Intel Xeon Phi.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017
Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors.
Proceedings of the 46th International Conference on Parallel Processing, 2017
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017
Improving QoS and Utilisation in modern multi-core servers with Dynamic Cache Partitioning.
Proceedings of the Joined Workshops COSH 2017 and VisorHPC 2017, 2017
BONSEYES: Platform for Open Development of Systems of Artificial Intelligence: Invited paper.
Proceedings of the Computing Frontiers Conference, 2017
RCU-HTM: Combining RCU with HTM to Implement Highly Efficient Concurrent Binary Search Trees.
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017
2016
Improving virtual host efficiency through resource and interference aware scheduling.
CoRR, 2016
Proceedings of the 24th Euromicro International Conference on Parallel, 2016
Proceedings of the Co-Scheduling of HPC Applications [extended versions of all papers from COSH@HiPEAC 2016, 2016
Proceedings of the 1st COSH Workshop on Co-Scheduling of HPC Applications, 2016
2015
CoRR, 2015
A Machine-Learning Approach for Communication Prediction of Large-Scale Applications.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
CIRANO: An Integrated Programming Environment for Multi-tier Cloud Based Applications.
Proceedings of the 1st International Conference on Cloud Forward: From Distributed to Complete Computing, 2015
2014
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014
2013
An Extended Compression Format for the Optimization of Sparse Matrix-Vector Multiplication.
IEEE Trans. Parallel Distributed Syst., 2013
Improving the Performance of the Symmetric Sparse Matrix-Vector Multiplication in Multicore.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013
2012
User Adaptation in a Hybrid MT System - Feeding User Corrections into Synchronous Grammars and System Dictionaries.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012
Using State-of-the-Art Sparse Matrix Optimizations for Accelerating the Performance of Multiphysics Simulations.
Proceedings of the Applied Parallel and Scientific Computing, 2012
2011
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011
2010
Exploiting compression opportunities to improve SpMxV performance on shared memory systems.
ACM Trans. Archit. Code Optim., 2010
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Exploring I/O Virtualization Data Paths for MPI Applications in a Cluster of VMs: A Networking Perspective.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010
2009
Performance evaluation of the sparse matrix-vector multiplication on modern architectures.
J. Supercomput., 2009
Nucleic Acids Res., 2009
BMC Bioinform., 2009
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009
Proceedings of the ICPP 2009, 2009
Proceedings of the ICPP 2009, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
A Comparative Study of Blocking Storage Methods for Sparse Matrices on Multicore Architectures.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009
Overlapping computation and communication in SMT clusters with commodity interconnects.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009
2008
Proceedings of the 16th Euromicro International Conference on Parallel, 2008
Evaluation of dynamic scheduling methods in simulations of storm-time ion acceleration.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression.
Proceedings of the 2008 International Conference on Parallel Processing, 2008
Proceedings of the 5th Conference on Computing Frontiers, 2008
2007
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
2006
Parallel Comput., 2006
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
2004
Proceedings of the 2004 ACM Symposium on Applied Computing (SAC), 2004
2003
IEEE Trans. Parallel Distributed Syst., 2003
A pipelined schedule to minimize completion time for loop tiling with computation and communication overlapping.
J. Parallel Distributed Comput., 2003
Delivering High Performance to Parallel Applications Using Advanced Scheduling.
Proceedings of the Parallel Computing: Software Technology, 2003
2002
Automatic code generation for executing tiled nested loops onto parallel architectures.
Proceedings of the 2002 ACM Symposium on Applied Computing (SAC), 2002
Data Parallel Code Generation for Arbitrarily Tiled Loop Nests.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002
2001
Minimizing Completion Time for Loop Tiling with Computation and Communication Overlapping.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001
2000
Proceedings of the 2000 International Conference on Parallel Processing, 2000