Thomas Ropars

According to our database1, Thomas Ropars authored at least 33 papers between 2007 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
ResPCT: fast checkpointing in non-volatile memory for multi-threaded applications.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

2021
CPU overheating prediction in HPC systems.
Concurr. Comput. Pract. Exp., 2021

LogFlow: Simplified Log Analysis for Large Scale Systems.
Proceedings of the ICDCN '21: International Conference on Distributed Computing and Networking, 2021

2020
KDetect: Unsupervised Anomaly Detection for Cloud Systems Based on Time Series Clustering.
Proceedings of the 3rd International Workshop on Systems and Network Telemetry and Analytics, 2020

On the Detection of Silent Data Corruptions in HPC Applications Using Redundant Multi-threading.
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020

2019
Data and Thread Placement in NUMA Architectures: A Statistical Learning Approach.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
An Efficient Wait-free Resizable Hash Table.
Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, 2018

CPU Overheating Characterization in HPC Systems: A Case Study.
Proceedings of the IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale, 2018

2016
Leveraging Hardware Message Passing for Efficient Thread Synchronization.
ACM Trans. Parallel Comput., 2016

Panels: Panel session I: Resiliency in extreme scale high performance computing systems and applications.
Proceedings of the International Conference on High Performance Computing & Simulation, 2016

2015
Efficient Process Replication for MPI Applications: Sharing Work between Replicas.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

On the Performance of Delegation over Cache-Coherent Shared Memory.
Proceedings of the 2015 International Conference on Distributed Computing and Networking, 2015

Addressing the Last Roadblock for Message Logging in HPC: Alleviating the Memory Requirement Using Dedicated Resources.
Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015

2014
High-Throughput Maps on Message-Passing Manycore Architectures: Partitioning versus Replication.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
SPBC: leveraging the characteristics of MPI HPC applications for scalable checkpointing.
Proceedings of the International Conference for High Performance Computing, 2013

Replication for send-deterministic MPI HPC applications.
Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale, 2013

2012
HydEE, vers un protocole de recouvrement arrière hiérarchique pour les machines exascales. De l'exploitation du déterminisme des émissions dans les protocoles de recouvrement arrière.
Tech. Sci. Informatiques, 2012

High-performance RMA-based broadcast on the intel SCC.
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012

Asynchronous Broadcast on the Intel SCC using Interrupts.
Proceedings of the 6th Many-core Applications Research Community (MARC) Symposium. Proceedings of the 6th MARC Symposium, 2012

HydEE: Failure Containment without Event Logging for Large Scale Send-Deterministic MPI Applications.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Hierarchical Clustering Strategies for Fault Tolerance in Large Scale HPC Systems.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011
Active optimistic and distributed message logging for message-passing applications.
Concurr. Comput. Pract. Exp., 2011

Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic MPI Applications.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

On the Use of Cluster-Based Partial Message Logging to Improve Fault Tolerance for MPI HPC Applications.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010
Semias: Self-Healing Active Replication on Top of a Structured Peer-to-Peer Overlay.
Proceedings of the 29th IEEE Symposium on Reliable Distributed Systems (SRDS 2010), New Delhi, Punjab, India, October 31, 2010

Improving Message Logging Protocols Scalability through Distributed Event Logging.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Using a Failure History Service for Reliable Grid Node Information.
Proceedings of the 3PGCIC 2010, 2010

2009
Services et protocoles pour l'exécution fiable d'applications distribuées dans les grilles de calcul. (Services and protocols for reliable execution of distributed applications in computational grids).
PhD thesis, 2009

Active Optimistic Message Logging for Reliable Execution of MPI Applications.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

The Architecture of the XtreemOS Grid Checkpointing Service.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Reasons for a pessimistic or optimistic message logging protocol in MPI uncoordinated failure, recovery.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
Fault Tolerance in Cluster Federations with O2P-CF.
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

2007
GAMoSe: An Accurate Monitoring Service For Grid Applications.
Proceedings of the 6th International Symposium on Parallel and Distributed Computing (ISPDC 2007), 2007


  Loading...