Show simple item record

dc.contributor.authorMeyer, Hugo
dc.contributor.authorMuresano, Ronal
dc.contributor.authorCastro-León, Marcela
dc.contributor.authorRexachs, Dolores
dc.contributor.authorLuque, Emilio
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2017-03-06T14:38:37Z
dc.date.available2019-06-01T02:31:40Z
dc.date.issued2017-06-01
dc.identifier.citationMeyer, H. [et al.]. Hybrid Message Pessimistic Logging. Improving current pessimistic message logging protocols. "Journal of Parallel and Distributed Computing", 1 Juny 2017, vol. 104, p. 206-222.
dc.identifier.issn0743-7315
dc.identifier.urihttp://hdl.handle.net/2117/101973
dc.description.abstractWith the growing scale of HPC applications, there has been an increase in the number of interruptions as a consequence of hardware failures. The remarkable decrease of Mean Time Between Failures (MTBF) in current systems encourages the research of suitable fault tolerance solutions. Message logging combined with uncoordinated checkpoint compose a scalable rollback-recovery solution. However, message logging techniques are usually responsible for most of the overhead during failure-free executions. Taking this into consideration, this paper proposes the Hybrid Message Pessimistic Logging (HMPLHMPL) which focuses on combining the fast recovery feature of pessimistic receiver-based message logging with the low failure-free overhead introduced by pessimistic sender-based message logging. The HMPLHMPL manages messages using a distributed controller and storage to avoid harming system’s scalability. Experiments show that the HMPLHMPL is able to reduce overhead by 34% during failure-free executions and 20% in faulty executions when compared with a pessimistic receiver-based message logging.
dc.description.sponsorshipThis research has been supported by the MINECO (MICINN) Spain under contracts TIN2011-24384 and TIN2014-53172-P.
dc.format.extent17 p.
dc.language.isoeng
dc.publisherElsevier
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria electrònica
dc.subject.lcshScalability of computer networks
dc.subject.lcshFault-tolerant computing
dc.subject.lcshAvailability, Systems
dc.subject.otherFault tolerance
dc.subject.otherAvailability
dc.subject.otherScalability
dc.subject.otherPerformance
dc.subject.otherMPI
dc.subject.otherMessage logging
dc.titleHybrid Message Pessimistic Logging. Improving current pessimistic message logging protocols
dc.typeArticle
dc.subject.lemacSupercomputadors
dc.identifier.doi10.1016/j.jpdc.2017.02.003
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://www.sciencedirect.com/science/article/pii/S0743731517300515
dc.rights.accessOpen Access
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO/1PE/TIN2014-53172-P
local.citation.publicationNameJournal of Parallel and Distributed Computing
local.citation.volume104
local.citation.startingPage206
local.citation.endingPage222


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivs 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-NoDerivs 3.0 Spain