Lionel Eyraud-Dubois

Orcid: 0000-0003-2475-3309

Affiliations:
  • ENS Lyon, France


According to our database1, Lionel Eyraud-Dubois authored at least 65 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Optimal Re-Materialization Strategies for Heterogeneous Chains: How to Train Deep Neural Networks with Limited Memory.
ACM Trans. Math. Softw., June, 2024

Tightening I/O Lower Bounds through the Hourglass Dependency Pattern.
Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures, 2024

A 1.25(1+ε )-Approximation Algorithm for Scheduling with Rejection Costs Proportional to Processing Times.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

2023
Data Distribution Schemes for Dense Linear Algebra Factorizations on Any Number of Nodes.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM).
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch.
Proceedings of the International Conference on Machine Learning, 2023

2022
Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach.
Microprocess. Microsystems, November, 2022

Survey on Large Scale Neural Network Training.
CoRR, 2022

I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels.
Proceedings of the SPAA '22: 34th ACM Symposium on Parallelism in Algorithms and Architectures, Philadelphia, PA, USA, July 11, 2022

Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

MadPipe: Memory Aware Dynamic Programming Algorithm for Pipelined Model Parallelism.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Survey on Efficient Training of Large Neural Networks.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021
Scheduling on Two Types of Resources: A Survey.
ACM Comput. Surv., 2021

Efficient Combination of Rematerialization and Offloading for Training DNNs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Pipelined Model Parallelism: Complexity Results and Memory Considerations.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021


2020
Analysis of a List Scheduling Algorithm for Task Graphs on Two Types of Resources.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Algorithms for Preemptive Co-scheduling of Kernels on GPUs.
Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020

2D Static Resource Allocation for Compressed Linear Algebra and Communication Constraints.
Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020

Optimal GPU-CPU Offloading Strategies for Deep Neural Network Training.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Recent Advances in Matrix Partitioning for Parallel Computing on Heterogeneous Platforms.
IEEE Trans. Parallel Distributed Syst., 2019

Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory.
CoRR, 2019

Influence of Tasks Duration Variability on Task-Based Runtime Schedulers.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Sizing and Partitioning Strategies for Burst-Buffers to Reduce IO Contention.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
Fast approximation algorithms for task-based runtime systems.
Concurr. Comput. Pract. Exp., 2018

What Size Should Your Buffers to Disks be?
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Analyzing real cluster data for formulating allocation algorithms in cloud platforms.
Parallel Comput., 2016

A New Approximation Algorithm for Matrix Partitioning in Presence of Strongly Heterogeneous Processors.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Are Static Schedules so Bad? A Case Study on Cholesky Factorization.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources.
Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016

Cuboid Partitioning for Parallel Matrix Multiplication on Heterogeneous Platforms.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015
Parallel Scheduling of Task Trees with Limited Memory.
ACM Trans. Parallel Comput., 2015

Comparison of Static and Runtime Resource Allocation Strategies for Matrix Multiplication.
Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Allocating Jobs with Periodic Demand Variations.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Column Generation Integer Programming for Allocating Jobs with Periodic Demand Variations.
Proceedings of the Algorithmic Aspects of Cloud Computing - First International Workshop, 2015

2014
Broadcasting on Large Scale Heterogeneous Platforms under the Bounded Multi-Port Model.
IEEE Trans. Parallel Distributed Syst., 2014

Point-to-Point and Congestion Bandwidth Estimation: Experimental Evaluation on PlanetLab Data.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Efficient and robust allocation algorithms in clouds under memory constraints.
Proceedings of the 21st International Conference on High Performance Computing, 2014

2013
Heterogeneous Resource Allocation under Degree Constraints.
IEEE Trans. Parallel Distributed Syst., 2013

Efficient and Robust Allocation Algorithms in Clouds under Memory Constraints.
CoRR, 2013

Optimizing Resource allocation while handling SLA violations in Cloud Computing platforms.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Reliable Service Allocation in Clouds.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Reliable Service Allocation in Clouds with Memory and Capacity Constraints.
Proceedings of the Euro-Par 2013: Parallel Processing Workshops, 2013

2012
Minimizing Weighted Mean Completion Time for Malleable Tasks Scheduling.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011
Broadcasting on Large Scale Heterogeneous Platforms with Connectivity Artifacts under the Bounded Multi-port Model.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Using the Last-Mile Model as a Distributed Scheme for Available Bandwidth Prediction.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010
Allocation of Clients to Multiple Servers on Large Scale Heterogeneous Platforms.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

Broadcasting on large scale heterogeneous platforms under the bounded multi-port model.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
The Influence of Platform Models on Scheduling Techniques.
Proceedings of the Introduction to Scheduling., 2009

2008
A Distributed Algorithm for Resource Clustering in Large Scale Platforms.
Proceedings of the Principles of Distributed Systems, 12th International Conference, 2008

Scheduling divisibleworkloads on heterogeneous platforms under bounded multi-port model.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007
Analysis of Scheduling Algorithms with Reservations.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

A First Step Towards Automatically Building Network Representations.
Proceedings of the Euro-Par 2007, 2007

Assessing the Quality of Automatically Built Network Representations.
Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006
A Pragmatic Analysis Of Scheduling Environments On New Computing Platforms.
Int. J. High Perform. Comput. Appl., 2006

2005
Scheduling on large scale distributed platforms: from models to implementations.
Int. J. Found. Comput. Sci., 2005

2004
Bi-criteria algorithm for scheduling jobs on cluster platforms.
Proceedings of the SPAA 2004: Proceedings of the Sixteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2004

Models for Scheduling on Large Scale Platforms: Which Policy for which Application?
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

2002
Programming environments for high-performance Grid computing: the Albatross project.
Future Gener. Comput. Syst., 2002

High Performance Computing on Heterogeneous Clusters with the Madeleine II Communication Library.
Clust. Comput., 2002

2001
Efficient Inter-Device Data-Forwarding in the Madeleine Communication Library.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001


  Loading...