Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

2016

DReAM: An Approach to Estimate per-Task DRAM Energy in Multicore Systems.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2016

Thread Assignment in Multicore/Multithreaded Processors: A Statistical Approach.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2016

Sensible Energy Accounting with Abstract Metering for Multicore Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

PARSECSs: Evaluating the Impact of Task Parallelism in the PARSEC Benchmark Suite.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

MUSA: a multi-level simulation approach for next-generation HPC machines.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

TaskPoint: Sampled simulation of task-based programs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016

CATA: Criticality Aware Task Acceleration for Multicore Processors.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Supercomputing, 2016

POSTER: Exploiting Asymmetric Multi-Core Processors with Flexible System Sofware.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

Reducing Cache Coherence Traffic with Hierarchical Directory Cache and NUMA-Aware Runtime Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

Adaptive and application dependent runtime guided hardware prefetcher reconfiguration on the IBM POWER7.

[BibT_eX]

[DOI]

CoRR, 2015

Exploiting asynchrony from exact forward recovery for DUE in iterative solvers.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Evaluating the Impact of OpenMP 4.0 Extensions on Relevant Parallel Workloads.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Coherence protocol for transparent management of scratchpad memories in shared memory manycore architectures.

[BibT_eX]

[DOI]

Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015

Runtime-Aware Architectures.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Runtime-Guided Management of Scratchpad Memories in Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014

Runtime-Aware Architectures: A First Approach.

[BibT_eX]

[DOI]

Supercomput. Front. Innov., 2014

Per-task Energy Accounting in Computing Systems.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2014

DReAM: Per-Task DRAM Energy Metering in Multicore Systems.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014 Parallel Processing, 2014

Evaluating Execution Time Predictability of Task-Based Programs on Multi-Core Processors.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013

Fair CPU time accounting in CMP+SMT processors.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

Hardware support for accurate per-task energy metering in multicore systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

Task mapping in rectangular twisted tori.

[BibT_eX]

[DOI]

Proceedings of the 2013 Spring Simulation Multiconference, SpringSim '13, 2013

A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

On the convergence of mainstream and mission-critical markets.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

Tessellation: refactoring the OS around explicit resource containers with continuous adaptation.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

2012

CPU Accounting for Multicore Processors.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2012

Kernel Partitioning of Streaming Applications: A Statistical Approach to an NP-complete Problem.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

Characterizing thread placement in the IBM POWER7 processor.

[BibT_eX]

[DOI]

Stelios Manousopoulos

Proceedings of the 2012 IEEE International Symposium on Workload Characterization, 2012

Optimal task assignment in multithreaded processors: a statistical approach.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

2011

Dynamic Cache Partitioning Based on the MLP of Cache Misses.

[BibT_eX]

[DOI]

Trans. High Perform. Embed. Archit. Compil., 2011

Simulating Whole Supercomputer Applications.

[BibT_eX]

[DOI]

IEEE Micro, 2011

2010

Improving cache Behavior in CMP architectures throug cache partitioning techniques.

[BibT_eX]

[DOI]

Miquel Moretó

PhD thesis, 2010

Twisted Torus Topologies for Enhanced Interconnection Networks.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2010

Adapting cache partitioning algorithms to pseudo-LRU replacement policies.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Load balancing using dynamic cache allocation.

[BibT_eX]

[DOI]

Proceedings of the 7th Conference on Computing Frontiers, 2010

2009

FlexDCP: a QoS framework for CMP architectures.

[BibT_eX]

[DOI]

ACM SIGOPS Oper. Syst. Rev., 2009

CPU Accounting in CMP Processors.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2009

ITCA: Inter-task Conflict-Aware CPU Accounting for CMPs.

[BibT_eX]

[DOI]

Proceedings of the PACT 2009, 2009

2008

Modeling Toroidal Networks with the Gaussian Integers.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2008

Multicore Resource Management.

[BibT_eX]

[DOI]

IEEE Micro, 2008

MLP-Aware Dynamic Cache Partitioning.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2008

Architecture Performance Prediction Using Evolutionary Artificial Neural Networks.

[BibT_eX]

[DOI]

Pedro A. Castillo

Antonio Miguel Mora

Juan Julián Merelo Guervós

Juan Luis Jiménez Laredo

Proceedings of the Applications of Evolutionary Computing, 2008

Evolutionary system for prediction and optimization of hardware architecture performance.

[BibT_eX]

[DOI]

Pedro Ángel Castillo Valdivieso

Juan Julián Merelo Guervós

Juan Luis Jiménez Laredo

Sally A. McKee

Proceedings of the IEEE Congress on Evolutionary Computation, 2008

2007

Explaining Dynamic Cache Partitioning Speed Ups.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2007

Online Prediction of Applications Cache Utility.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Conference on Embedded Computer Systems: Architectures, 2007

Mixed-radix Twisted Torus Interconnection Networks.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006

Dense Gaussian Networks: Suitable Topologies for On-Chip Multiprocessors.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2006

A Generalization of Perfect Lee Codes over Gaussian Integers.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 2006 IEEE International Symposium on Information Theory, 2006

Miquel Moretó

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...