Laércio Lima Pilla

Orcid: 0000-0003-0997-586X

Affiliations:
  • University of Bordeaux, France


According to our database1, Laércio Lima Pilla authored at least 63 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of two.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A 1.25(1+ε )-Approximation Algorithm for Scheduling with Rejection Costs Proportional to Processing Times.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

2023
Scheduling Algorithms for Federated Learning With Minimal Energy Consumption.
IEEE Trans. Parallel Distributed Syst., April, 2023

Optimizing performance and energy across problem sizes through a search space exploration and machine learning.
J. Parallel Distributed Comput., 2023

2022
Algorithm Selection Framework for Legalization Using Deep Convolutional Neural Networks and Transfer Learning.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

SimSGamE : Scheduling simulator for modern game engines.
J. Open Source Softw., 2022

Exploring Scheduling Algorithms for Parallel Task Graphs: A Modern Game Engine Case Study.
Proceedings of the Euro-Par 2022: Parallel Processing, 2022

2021
Online Thread and Data Mapping Using a Sharing-Aware Memory Management Unit.
ACM Trans. Model. Perform. Evaluation Comput. Syst., 2021

ARTful: A model for user-defined schedulers targeting multiple high-performance computing runtime systems.
Softw. Pract. Exp., 2021

PackStealLB: A scalable distributed load balancer based on work stealing and workload discretization.
J. Parallel Distributed Comput., 2021

Optimal Task Assignment for Heterogeneous Federated Learning Devices.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

2020
Optimal Task Assignment to Heterogeneous Federated Learning Devices.
CoRR, 2020

Mapping Matters: Application Process Mapping on 3-D Processor Topologies.
CoRR, 2020

Adaptive Load Balancing based on Machine Learning for Iterative Parallel Applications.
Proceedings of the 28th Euromicro International Conference on Parallel, 2020

2019
EagerMap: A Task Mapping Algorithm to Improve Communication and Load Balancing in Clusters of Multicore Systems.
ACM Trans. Parallel Comput., 2019

How Deep Learning Can Drive Physical Synthesis Towards More Predictable Legalization.
Proceedings of the 2019 International Symposium on Physical Design, 2019

Distributed Memory Graph Representation for Load Balancing Data: Accelerating Data Structure Generation for Decentralized Scheduling.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

On server-side file access pattern matching.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

2018
A Cost Model for IaaS Clouds Based on Virtual Machine Energy Consumption.
J. Grid Comput., 2018

MigPF: Towards on self-organizing process rescheduling of Bulk-Synchronous Parallel applications.
Future Gener. Comput. Syst., 2018

A branch and bound strategy for Fast Trajectory Similarity Measuring.
Data Knowl. Eng., 2018

Reducing Global Schedulers Complexity through Runtime System Decoupling.
Proceedings of the Symposium on High Performance Computing Systems, 2018

Samsara Architecture: Exploring Situation Awareness in Cloud Computing Management.
Proceedings of the Symposium on High Performance Computing Systems, 2018

Enhancing Multi-Threaded Legalization Through k-d Tree Circuit Partitioning.
Proceedings of the 31st Symposium on Integrated Circuits and Systems Design, 2018

A Batch Task Migration Approach for Decentralized Global Rescheduling.
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

Improving Communication and Load Balancing with Thread Mapping in Manycore Systems.
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

2017
Experimental and analytical study of Xeon Phi reliability.
Proceedings of the International Conference for High Performance Computing, 2017

Exploiting cache locality to speedup register clustering.
Proceedings of the 30th Symposium on Integrated Circuits and Systems Design: Chip on the Sands, 2017

How Game Engines Can Inspire EDA Tools Development: A use case for an open-source physical design library.
Proceedings of the 2017 ACM on International Symposium on Physical Design, 2017

Radiation-Induced Error Criticality in Modern HPC Parallel Accelerators.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

2016
Evaluation and Mitigation of Radiation-Induced Soft Errors in Graphics Processing Units.
IEEE Trans. Computers, 2016

Hardware-Assisted Thread and Data Mapping in Hierarchical Multicore Architectures.
ACM Trans. Archit. Code Optim., 2016

LAPT: A locality-aware page table for thread and data mapping.
Parallel Comput., 2016

Value Reuse Potential in ARM Architectures.
Proceedings of the 28th International Symposium on Computer Architecture and High Performance Computing, 2016

Exploiting parallelism to speed up circuit legalization.
Proceedings of the 2016 IEEE International Conference on Electronics, Circuits and Systems, 2016

A Sharing-Aware Memory Management Unit for Online Mapping in Multi-core Architectures.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

Exploration of Load Balancing Thresholds to Save Energy on Iterative Applications.
Proceedings of the High Performance Computing - Third Latin American Conference, 2016

2015
Characterizing communication and page usage of parallel applications for thread and data mapping.
Perform. Evaluation, 2015

Performance/energy trade-off in scientific computing: the case of ARM big.LITTLE and Intel Sandy Bridge.
IET Comput. Digit. Tech., 2015

Understanding the Effect of Multiple Factors on a Parallel File System's Performance.
Proceedings of the 24th IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises, 2015

An Efficient Algorithm for Communication-Based Task Mapping.
Proceedings of the 23rd Euromicro International Conference on Parallel, 2015

The Path to Exascale: Code Optimizations and Hardening Solutions Reliability.
Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, 2015

2014
Topology-Aware Load Balancing for Performance Portability over Parallel High Performance Systems. (Équilibrage de charge prenant en compte la topologie des plates-formes de calcul parallèle pour la portabilité des performances).
PhD thesis, 2014

A topology-aware load balancing algorithm for clustered hierarchical multi-core machines.
Future Gener. Comput. Syst., 2014

Optimizing Memory Locality Using a Locality-Aware Page Table.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Improving the Performance of Seismic Wave Simulations with Dynamic Load Balancing.
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Saving energy by exploiting residual imbalances on iterative applications.
Proceedings of the 21st International Conference on High Performance Computing, 2014

Impact of GPUs Parallelism Management on Safety-Critical and HPC Applications Reliability.
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

Radiation Sensitivity of High Performance Computing Applications on Kepler-Based GPGPUs.
Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2014

GPGPUs ECC efficiency and efficacy.
Proceedings of the 2014 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, 2014

2013
Evaluating application performance and energy consumption on hybrid CPU+GPU architecture.
Clust. Comput., 2013

Neutron sensitivity and software hardening strategies for matrix multiplication and FFT on graphics processing units.
Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale, 2013

2012
Atmospheric models hybrid OpenMP/MPI implementation multicore cluster evaluation.
Int. J. Inf. Technol. Commun. Convergence, 2012

A Hierarchical Approach for Load Balancing on Parallel Multi-core Systems.
Proceedings of the 41st International Conference on Parallel Processing, 2012

Asymptotically Optimal Load Balancing for Hierarchical Multi-Core Systems.
Proceedings of the 18th IEEE International Conference on Parallel and Distributed Systems, 2012

2011
Improving Performance on Atmospheric Models through a Hybrid OpenMP/MPI Implementation.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2011

Combining Multiple Metrics to Control BSP Process Rescheduling in Response to Resource and Application Dynamics.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2010
Observing the Impact of Multiple Metrics and Runtime Adaptations on BSP Process Rescheduling.
Parallel Process. Lett., 2010

Applying Process Migration on a BSP-Based LU Decomposition Application.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Supporting performance and adaptivity on BSP process rescheduling.
Proceedings of the 15th IEEE Symposium on Computers and Communications, 2010

2009
Applying Processes Rescheduling over Irregular BSP Application.
Proceedings of the Computational Science, 2009

MigBSP: A Novel Migration Model for Bulk-Synchronous Parallel Processes Rescheduling.
Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

2008
Controlling Processes Reassignment in BSP Applications.
Proceedings of the 20th International Symposium on Computer Architecture and High Performance Computing, 2008

ICE: Managing Multiple Clusters Using Web Services.
Proceedings of the 11th IEEE International Conference on Computational Science and Engineering, 2008


  Loading...