Lionel Eyraud-Dubois

Jean-Alexandre Collin

Alberto Riccardo Martinelli

Mathieu Vérité

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM).

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

2022

Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach.

[BibT_eX]

[DOI]

Pier Stanislao Paolucci

Microprocess. Microsystems, November, 2022

Survey on Large Scale Neural Network Training.

[BibT_eX]

[DOI]

CoRR, 2022

I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels.

[BibT_eX]

[DOI]

Proceedings of the SPAA '22: 34th ACM Symposium on Parallelism in Algorithms and Architectures, Philadelphia, PA, USA, July 11, 2022

Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization.

[BibT_eX]

[DOI]

Proceedings of the SC22: International Conference for High Performance Computing, 2022

MadPipe: Memory Aware Dynamic Programming Algorithm for Pipelined Model Parallelism.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Survey on Efficient Training of Large Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021

Scheduling on Two Types of Resources: A Survey.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2021

Efficient Combination of Rematerialization and Offloading for Training DNNs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Pipelined Model Parallelism: Complexity Results and Memory Considerations.

[BibT_eX]

[DOI]

Alberto Riccardo Martinelli

Proceedings of the Euro-Par 2021: Parallel Processing, 2021

TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale.

[BibT_eX]

[DOI]

Pier Stanislao Paolucci

Proceedings of the 24th Euromicro Conference on Digital System Design, 2021

2020

Analysis of a List Scheduling Algorithm for Task Graphs on Two Types of Resources.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Algorithms for Preemptive Co-scheduling of Kernels on GPUs.

[BibT_eX]

[DOI]

Cristiana Bentes

Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020

2D Static Resource Allocation for Compressed Linear Algebra and Communication Constraints.

[BibT_eX]

[DOI]

Mathieu Vérité

Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020

Optimal GPU-CPU Offloading Strategies for Deep Neural Network Training.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019

Recent Advances in Matrix Partitioning for Parallel Computing on Heterogeneous Platforms.

[BibT_eX]

[DOI]

Alexey L. Lastovetsky

IEEE Trans. Parallel Distributed Syst., 2019

Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory.

[BibT_eX]

[DOI]

CoRR, 2019

Influence of Tasks Duration Variability on Task-Based Runtime Schedulers.

[BibT_eX]

[DOI]

Yihong Gao

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Sizing and Partitioning Strategies for Burst-Buffers to Reduce IO Contention.

[BibT_eX]

[DOI]

Guillaume Aupy

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels.

[BibT_eX]

[DOI]

Sriram Krishnamoorthy

Proceedings of the 48th International Conference on Parallel Processing, 2019

2018

Fast approximation algorithms for task-based runtime systems.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2018

What Size Should Your Buffers to Disks be?

[BibT_eX]

[DOI]

Guillaume Aupy

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs.

[BibT_eX]

[DOI]

Thomas Lambert

Proceedings of the 47th International Conference on Parallel Processing, 2018

2017

Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016

Analyzing real cluster data for formulating allocation algorithms in cloud platforms.

[BibT_eX]

[DOI]

Juan Ángel Lorenzo del Castillo

Parallel Comput., 2016

A New Approximation Algorithm for Matrix Partitioning in Presence of Strongly Heterogeneous Processors.

[BibT_eX]

[DOI]

Thomas Lambert

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Are Static Schedules so Bad? A Case Study on Cholesky Factorization.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016

Cuboid Partitioning for Parallel Matrix Multiplication on Heterogeneous Platforms.

[BibT_eX]

[DOI]

Thomas Lambert

Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015

Parallel Scheduling of Task Trees with Limited Memory.

[BibT_eX]

[DOI]

ACM Trans. Parallel Comput., 2015

Comparison of Static and Runtime Resource Allocation Strategies for Matrix Multiplication.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Computer Architecture and High Performance Computing, 2015

Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Allocating Jobs with Periodic Demand Variations.

[BibT_eX]

[DOI]

Ikbel Belaid

Juan Ángel Lorenzo del Castillo

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Column Generation Integer Programming for Allocating Jobs with Periodic Demand Variations.

[BibT_eX]

[DOI]

Ikbel Belaid

Proceedings of the Algorithmic Aspects of Cloud Computing - First International Workshop, 2015

2014

Broadcasting on Large Scale Heterogeneous Platforms under the Bounded Multi-Port Model.

[BibT_eX]

[DOI]

Shailesh Kumar Agrawal

IEEE Trans. Parallel Distributed Syst., 2014

Point-to-Point and Congestion Bandwidth Estimation: Experimental Evaluation on PlanetLab Data.

[BibT_eX]

[DOI]

Przemyslaw Uznanski

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Efficient and robust allocation algorithms in clouds under memory constraints.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on High Performance Computing, 2014

2013

Heterogeneous Resource Allocation under Degree Constraints.

[BibT_eX]

[DOI]

Christopher Thraves Caro

Hejer Rejeb

IEEE Trans. Parallel Distributed Syst., 2013

Efficient and Robust Allocation Algorithms in Clouds under Memory Constraints.

[BibT_eX]

[DOI]

Paul Renaud-Goud

CoRR, 2013

Optimizing Resource allocation while handling SLA violations in Cloud Computing platforms.

[BibT_eX]

[DOI]

Hubert Larchevêque

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Reliable Service Allocation in Clouds.

[BibT_eX]

[DOI]

Hubert Larchevêque

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Reliable Service Allocation in Clouds with Memory and Capacity Constraints.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2013: Parallel Processing Workshops, 2013

2012

Minimizing Weighted Mean Completion Time for Malleable Tasks Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011

Broadcasting on Large Scale Heterogeneous Platforms with Connectivity Artifacts under the Bounded Multi-port Model.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Using the Last-Mile Model as a Distributed Scheme for Available Bandwidth Prediction.

[BibT_eX]

[DOI]

Young J. Won

Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010

Allocation of Clients to Multiple Servers on Large Scale Heterogeneous Platforms.

[BibT_eX]

[DOI]

Proceedings of the 18th Euromicro Conference on Parallel, 2010

Broadcasting on large scale heterogeneous platforms under the bounded multi-port model.

[BibT_eX]

[DOI]

Shailesh Kumar Agrawal

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009

The Influence of Platform Models on Scheduling Techniques.

[BibT_eX]

[DOI]

Arnaud Legrand

Proceedings of the Introduction to Scheduling., 2009

2008

A Distributed Algorithm for Resource Clustering in Large Scale Platforms.

[BibT_eX]

[DOI]

Proceedings of the Principles of Distributed Systems, 12th International Conference, 2008

Scheduling divisibleworkloads on heterogeneous platforms under bounded multi-port model.

[BibT_eX]

[DOI]

Nicolas Bonichon

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007

Analysis of Scheduling Algorithms with Reservations.

[BibT_eX]

[DOI]

Gregory Mounie

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

A First Step Towards Automatically Building Network Representations.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2007, 2007

Assessing the Quality of Automatically Built Network Representations.

[BibT_eX]

[DOI]

Martin Quinson

Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006

A Pragmatic Analysis Of Scheduling Environments On New Computing Platforms.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2006

2005

Scheduling on large scale distributed platforms: from models to implementations.

[BibT_eX]

[DOI]

Pierre-François Dutot

Grégory Mounié

Int. J. Found. Comput. Sci., 2005

2004

Bi-criteria algorithm for scheduling jobs on cluster platforms.

[BibT_eX]

[DOI]

Pierre-François Dutot

Grégory Mounié

Proceedings of the SPAA 2004: Proceedings of the Sixteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2004

Models for Scheduling on Large Scale Platforms: Which Policy for which Application?

[BibT_eX]

[DOI]

Pierre-François Dutot

Grégory Mounié

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

2002

Programming environments for high-performance Grid computing: the Albatross project.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2002

High Performance Computing on Heterogeneous Clusters with the Madeleine II Communication Library.

[BibT_eX]

[DOI]

Clust. Comput., 2002

2001

Efficient Inter-Device Data-Forwarding in the Madeleine Communication Library.

[BibT_eX]

[DOI]

Olivier Aumage