Now showing items 1-1 of 1

    • Unified fault-tolerance framework for hybrid task-parallel message-passing applications 

      Subasi, Omer; Martsinkevich, Tatiana; Zyulkyarov, Ferad; Unsal, Osman Sabri; Labarta Mancho, Jesús José; Cappello, Franck (SAGE Publications, 2016-09-26)
      Article
      Open Access
      We present a unified fault-tolerance framework for task-parallel message-passing applications to mitigate transient errors. First, we propose a fault-tolerant message-logging protocol that only requires the restart of the ...