Georgios I. Goumas

Dionisios N. Pnevmatikatos

Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021

RCU-HTM: A generic synchronization technique for highly efficient concurrent search trees.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2021

Modeling the Scalability of the EuroExa Reconfigurable Accelerators - Preliminary Results - Invited Paper.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2021

CoCoPeLia: Communication-Computation Overlap Prediction for Efficient Linear Algebra on GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

Online Weight Pruning Via Adaptive Sparsity Loss.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

SynCron: Efficient Synchronization Support for Near-Data-Processing Architectures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

2020

Weight Pruning via Adaptive Sparsity Loss.

[BibT_eX]

[DOI]

CoRR, 2020

Leveraging Blockchain Technology to Break the Cloud Computing Market Monopoly.

[BibT_eX]

[DOI]

Comput., 2020

Efficient Concurrent Range Queries in B+-trees using RCU-HTM.

[BibT_eX]

[DOI]

Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020

Enhancing and Exploiting Contiguity for Fast Memory Virtualization.

[BibT_eX]

[DOI]

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

2019

Efficient accelerator sharing in virtualized environments: A Xeon Phi use-case.

[BibT_eX]

[DOI]

Stefanos Gerangelos

J. Syst. Softw., 2019

Building Ad-Hoc Clouds with CloudAgora.

[BibT_eX]

[DOI]

Proceedings of the 38th Symposium on Reliable Distributed Systems, 2019

Conflict-free symmetric sparse matrix-vector multiplication on multicore architectures.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

BASMAT: bottleneck-aware sparse matrix-vector multiplication auto-tuning on GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

On the Performance and Energy Efficiency of Sparse Matrix-Vector Multiplication on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: Technology Trends, 2019

ACTiManager: An end-to-end interference-aware cloud resource manager.

[BibT_eX]

[DOI]

Proceedings of the 20th International Middleware Conference Demos and Posters, 2019

DICER: Diligent Cache Partitioning for Efficient Workload Consolidation.

[BibT_eX]

[DOI]

Proceedings of the 48th International Conference on Parallel Processing, 2019

CloudAgora: Democratizing the Cloud.

[BibT_eX]

[DOI]

Proceedings of the Blockchain - ICBC 2019, 2019

An adaptive concurrent priority queue for NUMA architectures.

[BibT_eX]

[DOI]

Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019

RecNets: Channel-wise Recurrent Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018

SparseX: A Library for High-Performance Sparse Matrix-Vector Multiplication on Multicore Platforms.

[BibT_eX]

[DOI]

Theodoros Gkountouvas

ACM Trans. Math. Softw., 2018

A distributed modular platform for the development of cloud based applications.

[BibT_eX]

[DOI]

Aristidis Sotiropoulos

Ilias Maglogiannis

Future Gener. Comput. Syst., 2018

Combining HTM with RCU to Speed Up Graph Coloring on Multicore Platforms.

[BibT_eX]

[DOI]

Christina Giannoula

Proceedings of the High Performance Computing - 33rd International Conference, 2018

Efficient resource management for data centers: the ACTiCLOUD approach.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

RACCEX: Towards Remote Accelerated Computing Environments.

[BibT_eX]

[DOI]

Konstantinos Fertakis

Stefanos Gerangelos

Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018

Performance Prediction of NUMA Placement: A Machine-Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Cloud Computing Technology and Science, 2018

2017

Predictive communication modeling for HPC applications.

[BibT_eX]

[DOI]

Nikela Papadopoulou

Clust. Comput., 2017

An efficient and fair scheduling policy for multiprocessor platforms.

[BibT_eX]

[DOI]

Theodoros Marinakis

Konstantinos Nikas

Iraklis Anagnostopoulos

Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Intel Xeon Phi.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors.

[BibT_eX]

[DOI]

Maria del Milagro Fernández-Carrobles

Proceedings of the 46th International Conference on Parallel Processing, 2017

ACTiCLOUD: Enabling the Next Generation of Cloud Applications.

[BibT_eX]

[DOI]

Proceedings of the 37th IEEE International Conference on Distributed Computing Systems, 2017

Improving QoS and Utilisation in modern multi-core servers with Dynamic Cache Partitioning.

[BibT_eX]

[DOI]

Proceedings of the Joined Workshops COSH 2017 and VisorHPC 2017, 2017

BONSEYES: Platform for Open Development of Systems of Artificial Intelligence: Invited paper.

[BibT_eX]

[DOI]

Tim Llewellynn

Proceedings of the Computing Frontiers Conference, 2017

RCU-HTM: Combining RCU with HTM to Implement Highly Efficient Concurrent Binary Search Trees.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016

Improving virtual host efficiency through resource and interference aware scheduling.

[BibT_eX]

[DOI]

CoRR, 2016

Massively Concurrent Red-Black Trees with Hardware Transactional Memory.

[BibT_eX]

[DOI]

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Contention-Aware Scheduling Policies for Fairness and Throughput.

[BibT_eX]

[DOI]

Proceedings of the Co-Scheduling of HPC Applications [extended versions of all papers from COSH@HiPEAC 2016, 2016

A resource-centric Application Classification Approach.

[BibT_eX]

[DOI]

Konstantinos Nikas

Proceedings of the 1st COSH Workshop on Co-Scheduling of HPC Applications, 2016

2015

A lightweight optimization selection method for Sparse Matrix-Vector Multiplication.

[BibT_eX]

[DOI]

CoRR, 2015

A Machine-Learning Approach for Communication Prediction of Large-Scale Applications.

[BibT_eX]

[DOI]

Nikela Papadopoulou

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

CIRANO: An Integrated Programming Environment for Multi-tier Cloud Based Applications.

[BibT_eX]

[DOI]

Dimitrios Athanasiadis

Ilias Maglogiannis

Proceedings of the 1st International Conference on Cloud Forward: From Distributed to Complete Computing, 2015

2014

LCA: a memory link and cache-aware co-scheduling approach for CMPs.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

An Extended Compression Format for the Optimization of Sparse Matrix-Vector Multiplication.

[BibT_eX]

[DOI]

Theodoros Gkountouvas

IEEE Trans. Parallel Distributed Syst., 2013

Improving the Performance of the Symmetric Sparse Matrix-Vector Multiplication in Multicore.

[BibT_eX]

[DOI]

Theodoros Gkountouvas

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

2012

User Adaptation in a Hybrid MT System - Feeding User Corrections into Synchronous Grammars and System Dictionaries.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Using State-of-the-Art Sparse Matrix Optimizations for Accelerating the Performance of Multiphysics Simulations.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel and Scientific Computing, 2012

2011

CSX: an extended compression format for spmv on shared memory systems.

[BibT_eX]

[DOI]

Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011

2010

Exploiting compression opportunities to improve SpMxV performance on shared memory systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2010

Solving the advection PDE on the cell broadband engine.

[BibT_eX]

[DOI]

Georgios Rokos

Gerassimos Peteinatos

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Exploring I/O Virtualization Data Paths for MPI Applications in a Cluster of VMs: A Networking Perspective.

[BibT_eX]

[DOI]

Anastassios Nanos

Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2009

Communication-Aware Supernode Shape.

[BibT_eX]

[DOI]

Nikolaos Drosinos

IEEE Trans. Parallel Distributed Syst., 2009

Performance evaluation of the sparse matrix-vector multiplication on modern architectures.

[BibT_eX]

[DOI]

J. Supercomput., 2009

DIANA-microT web server: elucidating microRNA functions through target prediction.

[BibT_eX]

[DOI]

Giorgos L. Papadopoulos

Artemis G. Hatzigeorgiou

Nucleic Acids Res., 2009

Accurate microRNA target prediction correlates with protein repression levels.

[BibT_eX]

[DOI]

Manolis Maragkakis

Panagiotis Alexiou

Giorgos L. Papadopoulos

Artemis G. Hatzigeorgiou

BMC Bioinform., 2009

Exploring the effect of block shapes on the performance of sparse kernels.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Early experiences on accelerating Dijkstra's algorithm using transactional memory.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Employing Transactional Memory and Helper Threads to Speedup Dijkstra's Algorithm.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2009, 2009

Perfomance Models for Blocked Sparse Matrix-Vector Multiplication Kernels.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2009, 2009

GridNews: A distributed automatic Greek broadcast transcription system.

[BibT_eX]

[DOI]

Dimitrios Dimitriadis

Proceedings of the IEEE International Conference on Acoustics, 2009

A Comparative Study of Blocking Storage Methods for Sparse Matrices on Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

Overlapping computation and communication in SMT clusters with commodity interconnects.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008

Understanding the Performance of Sparse Matrix-Vector Multiplication.

[BibT_eX]

[DOI]

Proceedings of the 16th Euromicro International Conference on Parallel, 2008

Evaluation of dynamic scheduling methods in simulations of storm-time ion acceleration.

[BibT_eX]

[DOI]

Ioannis Riakiotakis

Fiori-Anastasia Metallinou

Ioannis A. Daglis

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Improving the Performance of Multithreaded Sparse Matrix-Vector Multiplication Using Index and Value Compression.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

Optimizing sparse matrix-vector multiplication using index and value compression.

[BibT_eX]

[DOI]

Proceedings of the 5th Conference on Computing Frontiers, 2008

2007

Coarse-grain Parallel Execution for 2-dimensional PDE Problems.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006

Message-passing code generation for non-rectangular tiling transformations.

[BibT_eX]

[DOI]

Parallel Comput., 2006

Selecting the tile shape to reduce the total communication volume.

[BibT_eX]

[DOI]

Nikolaos Drosinos

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2004

Automatic parallel code generation for tiled nested loops.

[BibT_eX]

[DOI]

Proceedings of the 2004 ACM Symposium on Applied Computing (SAC), 2004

2003

An Efficient Code Generation Technique for Tiled Iteration Spaces.

[BibT_eX]

[DOI]

Maria Athanasaki

IEEE Trans. Parallel Distributed Syst., 2003

A pipelined schedule to minimize completion time for loop tiling with computation and communication overlapping.

[BibT_eX]

[DOI]

Aristidis Sotiropoulos

J. Parallel Distributed Comput., 2003

Delivering High Performance to Parallel Applications Using Advanced Scheduling.

[BibT_eX]

Proceedings of the Parallel Computing: Software Technology, 2003

2002

Code Generation Methods for Tiling Transformations .

[BibT_eX]

[DOI]

Maria Athanasaki

J. Inf. Sci. Eng., 2002

Automatic code generation for executing tiled nested loops onto parallel architectures.

[BibT_eX]

[DOI]

Maria Athanasaki

Proceedings of the 2002 ACM Symposium on Applied Computing (SAC), 2002

Data Parallel Code Generation for Arbitrarily Tiled Loop Nests.

[BibT_eX]

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2002

Compiling Tiled Iteration Spaces for Clusters.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002

2001

Minimizing Completion Time for Loop Tiling with Computation and Communication Overlapping.

[BibT_eX]

[DOI]

Aristidis Sotiropoulos