Márcio Castro

Concurr. Comput. Pract. Exp., 2019

A comprehensive performance evaluation of the BinLPT workload-aware loop scheduler.

[BibT_eX]

[DOI]

Antônio Tadeu A. Gomes

Patricia Della Méa Plentz

Henrique C. Freitas

François Broquedis

Concurr. Comput. Pract. Exp., 2019

On the Performance and Isolation of Asymmetric Microkernel Design for Lightweight Manycores.

[BibT_eX]

[DOI]

Alexandre de Limas Santana

João Vicente Souto

Davidson Francis Lima

Proceedings of the IX Brazilian Symposium on Computing Systems Engineering, 2019

Distributed Memory Graph Representation for Load Balancing Data: Accelerating Data Structure Generation for Decentralized Scheduling.

[BibT_eX]

[DOI]

Vinicius Freitas

Alexandre de Limas Santana

Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

2018

Reducing Global Schedulers Complexity through Runtime System Decoupling.

[BibT_eX]

[DOI]

Proceedings of the Symposium on High Performance Computing Systems, 2018

A Batch Task Migration Approach for Decentralized Global Rescheduling.

[BibT_eX]

[DOI]

Vinicius Freitas

Alexandre de Limas Santana

Bruno Marques do Nascimento

Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

Energy Efficient Stencil Computations on the Low-Power Manycore MPPA-256 Processor.

[BibT_eX]

[DOI]

Emmanuel Podestá Jr.

Proceedings of the Euro-Par 2018: Parallel Processing, 2018

2017

CAP Bench: a benchmark suite for performance and energy evaluation of low-power many-core processors.

[BibT_eX]

[DOI]

Matheus Alcântara Souza

Matheus M. Queiroz

Alyson D. Pereira

Henrique C. Freitas

Philippe O. A. Navaux

Pedro Henrique de Mello Morado Penna

Concurr. Comput. Pract. Exp., 2017

Design methodology for workload-aware loop scheduling strategies based on genetic algorithm and simulation.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2017

Using the Nanvix Operating System in Undergraduate Operating System Courses.

[BibT_eX]

[DOI]

Márcio Bastos Castro

Henrique Cota de Freitas

Joao Caram

Proceedings of the VII Brazilian Symposium on Computing Systems Engineering, 2017

Towards the Use of LITMUS RT as a Testbed for Multiprocessor Scheduling in Energy Harvesting Real-Time Systems.

[BibT_eX]

[DOI]

Lais Borin

Patricia Della Méa Plentz

Proceedings of the VII Brazilian Symposium on Computing Systems Engineering, 2017

Automatic Partitioning of Stencil Computations on Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Symposium on Computer Architecture and High Performance Computing Workshops, 2017

Extending OpenACC for Efficient Stencil Code Generation and Execution by Skeleton Frameworks.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

Enabling efficient stencil code generation in OpenACC.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2017

Assessing the Performance of the SRR Loop Scheduler with Irregular Workloads.

[BibT_eX]

[DOI]

Eduardo C. Inacio

Patricia Della Méa Plentz

Henrique C. Freitas

François Broquedis

Proceedings of the International Conference on Computational Science, 2017

Performance Improvement of Stencil Computations for Multi-core Architectures based on Machine Learning.

[BibT_eX]

[DOI]

Víctor Martínez

Fabrice Dupros

Madalena Pereira da Silva

Proceedings of the International Conference on Computational Science, 2017

Provisioning and Delivering Sepsis Data Supported by an Enhanced SDN Environment.

[BibT_eX]

[DOI]

Felipe Volpato

Alexandre Leopoldo Gonçalves

Mário Antônio Ribeiro Dantas

Proceedings of the 30th IEEE International Symposium on Computer-Based Medical Systems, 2017

2016

Seismic wave propagation simulations on low-power and performance-centric manycores.

[BibT_eX]

[DOI]

Philippe O. A. Navaux

Parallel Comput., 2016

Exploiting parallelism to speed up circuit legalization.

[BibT_eX]

[DOI]

Renan Netto

Chrystian Guth

Vinicius S. Livramento

José Luís Güntzel

Proceedings of the 2016 IEEE International Conference on Electronics, Circuits and Systems, 2016

A Low-Cost Energy-Efficient Raspberry Pi Cluster for Data Mining Algorithms.

[BibT_eX]

[DOI]

João Saffran

Gabriel Garcia

Matheus Alcântara Souza

Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

Exploration of Load Balancing Thresholds to Save Energy on Iterative Applications.

[BibT_eX]

[DOI]

Edson L. Padoin

Philippe O. A. Navaux

Proceedings of the High Performance Computing - Third Latin American Conference, 2016

2015

On the energy efficiency and performance of irregular application executions on multicore, NUMA and manycore platforms.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2015

Performance/energy trade-off in scientific computing: the case of ARM big.LITTLE and Intel Sandy Bridge.

[BibT_eX]

[DOI]

Edson L. Padoin

Francieli Zanon Boito

IET Comput. Digit. Tech., 2015

2014

Adaptive thread mapping strategies for transactional memory applications.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2014

Automatic Skeleton-Driven Memory Affinity for Transactional Worklist Applications.

[BibT_eX]

[DOI]

Christiane Pousa Ribeiro

Int. J. Parallel Program., 2014

Energy Efficient Seismic Wave Propagation Simulation on a Low-Power Manycore Processor.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Evaluating the Impact of Transactional Characteristics on the Performance of Transactional Memory Applications.

[BibT_eX]

[DOI]

Fernando Rui

Dalvan Griebler

Luiz Gustavo Fernandes

Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Saving energy by exploiting residual imbalances on iterative applications.

[BibT_eX]

[DOI]

Edson L. Padoin

Proceedings of the 21st International Conference on High Performance Computing, 2014

2013

Analysis of computing and energy performance of multicore, NUMA, and manycore platforms for an irregular application.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Irregular Applications - Architectures and Algorithms, 2013

2012

Optimisation de la performance des applications de mémoire transactionnelle sur des plates-formes multicoeurs : une approche basée sur l'apprentissage automatique. (Improving the Performance of Transactional Memory Applications on Multicores : A Machine Learning-based Approach).

[BibT_eX]

[DOI]

PhD thesis, 2012

Dynamic Thread Mapping Based on Machine Learning for Transactional Memory Applications.

[BibT_eX]

[DOI]

Luiz Gustavo Fernandes