Author of the publication

Extending the scope of the Checkpoint-on-Failure protocol for forward recovery in standard MPI.

, , , , , and . Concurr. Comput. Pract. Exp., 25 (17): 2381-2393 (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems, , , , and . IPDPS, page 607-618. IEEE Computer Society, (2012)Probabilistic verification of sensor networks., , and . RIVF, page 45-54. IEEE, (2006)When to checkpoint at the end of a fixed-length reservation?, , , , and . SC Workshops, page 466-476. ACM, (2023)Software-Defined Events through PAPI., , , , and . IPDPS Workshops, page 363-372. IEEE, (2019)DAGuE: A Generic Distributed DAG Engine for High Performance Computing., , , , , and . IPDPS Workshops, page 1151-1158. IEEE, (2011)A Distributed and Replicated Service for Checkpoint Storage., , , and . CoreGRID Workshop - Making Grids Work, page 295-306. Springer, (2007)Hybrid Preemptive Scheduling of Message Passing Interface Applications on Grids., , , , and . Int. J. High Perform. Comput. Appl., 20 (1): 77-90 (2006)Overhead of using spare nodes., , , , , and . Int. J. High Perform. Comput. Appl., (2020)From MPI to OpenSHMEM: Porting LAMMPS., , , , and . OpenSHMEM, volume 9397 of Lecture Notes in Computer Science, page 121-137. Springer, (2015)Process Distance-Aware Adaptive MPI Collective Communications., , , and . CLUSTER, page 196-204. IEEE Computer Society, (2011)