Dimosthenis Masouros

Orcid: 0000-0001-6147-6908

According to our database1, Dimosthenis Masouros authored at least 48 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AI-Driven QoS-Aware Scheduling for Serverless Video Analytics at the Edge.
Inf., August, 2024

Orchestration Extensions for Interference- and Heterogeneity-Aware Placement for Data-Analytics.
Int. J. Parallel Program., August, 2024

CollectiveHLS: Ultrafast Knowledge-Based HLS Design Optimization.
IEEE Embed. Syst. Lett., June, 2024

SLO-aware GPU Frequency Scaling for Energy Efficient LLM Inference Serving.
CoRR, 2024

Leveraging Core and Uncore Frequency Scaling for Power-Efficient Serverless Workflows.
CoRR, 2024

SLO-Aware GPU DVFS for Energy-Efficient LLM Inference Serving.
IEEE Comput. Archit. Lett., 2024

Disaggregated RDDs: Extending and Analyzing Apache Spark for Memory Disaggregated Infrastructures.
Proceedings of the IEEE International Conference on Cloud Engineering, 2024

Decoupled Access-Execute Enabled DVFS for TinyML Deployments on STM32 Microcontrollers.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Late Breaking Results: Language-level QoR modeling for High-Level Synthesis.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

Data-driven HLS optimization for reconfigurable accelerators.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

2023
DVFaaS: Leveraging DVFS for FaaS Workflows.
IEEE Comput. Archit. Lett., 2023


COMPSYS 2023 Invited Speaker Towards ML-Driven Resource Orchestration in Disaggregated Memory Systems: Challenges and Opportunities.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

On the Implications of Heterogeneous Memory Tiering on Spark In-Memory Analytics.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023


Adrias: Interference-Aware Memory Orchestration for Disaggregated Cloud Infrastructures.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

Adjacent LSTM-Based Page Scheduling for Hybrid DRAM/NVM Memory Systems.
Proceedings of the 14th Workshop on Parallel Programming and Run-Time Management Techniques for Many-Core Architectures and 12th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms, 2023

RoaD-RuNNer: Collaborative DNN partitioning and offloading on heterogeneous edge systems.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

The SERRANO platform: Stepping towards seamless application development & deployment in the heterogeneous edge-cloud continuum.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

Darly: Deep Reinforcement Learning for QoS-aware scheduling under resource heterogeneity Optimizing serverless video analytics.
Proceedings of the 16th IEEE International Conference on Cloud Computing, 2023

IRIS: Interference and Resource Aware Predictive Orchestration for ML Inference Serving.
Proceedings of the 16th IEEE International Conference on Cloud Computing, 2023

2022
HLF-Kubed: Blockchain-Based Resource Monitoring for Edge Clusters.
Ledger, 2022

HW/SW Acceleration of Multiple Workloads Within the SERRANO's Computing Continuum - Invited Paper.
Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2022

LSTM Acceleration with FPGA and GPU Devices for Edge Computing Applications in B5G MEC.
Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2022

Optimizing Savitzky-Golay Filter on GPU and FPGA Accelerators for Financial Applications.
Proceedings of the 11th International Conference on Modern Circuits and Systems Technologies, 2022


SGRM: Stackelberg Game-Based Resource Management for Edge Computing Systems.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

Towards making the most of NLP-based device mapping optimization for OpenCL kernels.
Proceedings of the IEEE International Conference on Omni-layer Intelligent Systems, 2022

Sequence Clock: A Dynamic Resource Orchestrator for Serverless Architectures.
Proceedings of the IEEE 15th International Conference on Cloud Computing, 2022

2021
Rusty: Runtime Interference-Aware Predictive Monitoring for Modern Multi-Tenant Systems.
IEEE Trans. Parallel Distributed Syst., 2021

FaaS and Curious: Performance Implications of Serverless Functions on Edge Computing Platforms.
Proceedings of the High Performance Computing - ISC High Performance Digital 2021 International Workshops, Frankfurt am Main, Germany, June 24, 2021

Leveraging HW Approximation for Exploiting Performance-Energy Trade-offs Within the Edge-Cloud Computing Continuum.
Proceedings of the High Performance Computing - ISC High Performance Digital 2021 International Workshops, Frankfurt am Main, Germany, June 24, 2021

Interference-Aware Workload Placement for Improving Latency Distribution of Converged HPC/Big Data Cloud Infrastructures.
Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2021

Towards Efficient HW Acceleration in Edge-Cloud Infrastructures: The SERRANO Approach - Invited Paper.
Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2021


Resource Aware GPU Scheduling in Kubernetes Infrastructure.
Proceedings of the 12th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 10th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms, 2021

FPGA acceleration in EVOLVE's Converged Cloud-HPC Infrastructure.
Proceedings of the 31st International Conference on Field-Programmable Logic and Applications, 2021

Performance Analysis and Auto-tuning for SPARK in-memory analytics.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

2020
Fast Operation Mode Selection for Highly Efficient IoT Edge Devices.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Interference-Aware Orchestration in Kubernetes.
Proceedings of the High Performance Computing, 2020

Exploration of GPU sharing policies under GEMM workloads.
Proceedings of the SCOPES '20: 23rd International Workshop on Software and Compilers for Embedded Systems, 2020

2019
Rusty: Runtime System Predictability Leveraging LSTM Neural Networks.
IEEE Comput. Archit. Lett., 2019

Co-design Implications of Cost-effective On-demand Acceleration for Cloud Healthcare Analytics: The AEGLE approach.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

DMRM: Distributed Market-Based Resource Management of Edge Computing Systems.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

2018
A Hierarchical Distributed Runtime Resource Management Scheme for NoC-Based Many-Cores.
ACM Trans. Embed. Comput. Syst., 2018

2017
SoftRM: Self-Organized Fault-Tolerant Resource Management for Failure Detection and Recovery in NoC Based Many-Cores.
ACM Trans. Embed. Comput. Syst., 2017

From edge to cloud: Design and implementation of a healthcare Internet of Things infrastructure.
Proceedings of the 27th International Symposium on Power and Timing Modeling, 2017

AEGLE's Cloud Infrastructure for Resource Monitoring and Containerized Accelerated Analytics.
Proceedings of the 2017 IEEE Computer Society Annual Symposium on VLSI, 2017


  Loading...