Author of the publication

Exploring Partial Replication to Improve Lightweight Silent Data Corruption Detection for HPC Applications.

, , , , and . Euro-Par, volume 9833 of Lecture Notes in Computer Science, page 419-430. Springer, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Interpretable Modeling of Deep Reinforcement Learning Driven Scheduling., , and . MASCOTS, page 1-8. IEEE, (2023)Adaptive Metric-Aware Job Scheduling for Production Supercomputers., , , and . ICPP Workshops, page 107-115. IEEE Computer Society, (2012)MRSch: Multi-Resource Scheduling for HPC., , , , , , and . CLUSTER, page 47-57. IEEE, (2022)A Dynamic Power Capping Library for HPC Applications., , , and . CLUSTER, page 797-798. IEEE, (2021)Automatic and coordinated job recovery for high performance computing., , , and . MTAGS@SC, page 1-9. IEEE Computer Society, (2010)Prophesy: An Infrastructure for Analyzing and Modeling the Performance of Parallel and Distributed Applications., , , , , , , and . HPDC, page 302-303. IEEE Computer Society, (2000)System log pre-processing to improve failure prediction., , , and . DSN, page 572-577. IEEE Computer Society, (2009)Dynamic Load Balancing for Structured Adaptive Mesh Refinement Applications., , and . ICPP, page 571-579. IEEE Computer Society, (2001)Improving Job Scheduling on Production Supercomputers., , and . IPDPS Workshops, page 2073-2076. IEEE, (2011)Interpretable Modeling of Deep Reinforcement Learning Driven Scheduling., , and . CoRR, (2024)