Mostra el registre d'ítem simple

dc.contributor.authorSirvent Pardell, Raül
dc.contributor.authorBadia Sala, Rosa Maria
dc.contributor.authorLabarta Mancho, Jesús José
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2014-11-06T18:42:33Z
dc.date.created2009
dc.date.issued2009
dc.identifier.citationSirvent, R.; Badia, R.M.; Labarta, J. Graph-based task replication for workflow applications. A: IEEE International Conference on High Performance Computing and Communications. "2009 11th IEEE international conference on high performance computing and communications: 25-27 June, Seoul, Korea: proceedings". Seül: Institute of Electrical and Electronics Engineers (IEEE), 2009, p. 20-28.
dc.identifier.isbn978-0-7695-3738-2
dc.identifier.urihttp://hdl.handle.net/2117/24586
dc.description.abstractThe Grid is an heterogeneous and dynamic environment which enables distributed computation. This makes it a technology prone to failures. Some related work uses replication to overcome failures in a set of independent tasks, and in workflow applications, but they do not consider possible resource limitations when scheduling the replicas. In this paper, we focus on the use of task replication techniques for workflow applications, trying to achieve not only tolerance to the possible failures in an execution, but also to speed up the computation without demanding the user to implement an application-level checkpoint, which may be a difficult task depending on the application. Moreover, we also study what to do when there are not enough resources for replicating all running tasks. We establish different priorities of replication depending on the graph of the workflow application, giving more priority to tasks with a higher output degree. We have implemented our proposed policy in the GRID superscalar system, and we have run the fastDNAml as an experiment to prove our objectives are reached. Finally, we have identified and studied a problem which may arise due to the use of replication in workflow applications: the replication wait time.
dc.format.extent9 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures distribuïdes
dc.subject.lcshFault-tolerant computing
dc.subject.lcshElectronic data processing--Distributed processing
dc.subject.otherCheckpointing
dc.subject.otherGraph theory
dc.subject.otherGrid computing
dc.subject.otherSoftware fault tolerance
dc.subject.otherWorkflow management software
dc.subject.otherApplication-level checkpoint
dc.subject.otherDistributed computation
dc.subject.otherFailure tolerance
dc.subject.otherFfastDNAml
dc.subject.otherGraph-based task replication
dc.subject.otherGrid superscalar system
dc.subject.otherReplication wait time
dc.subject.otherWorkflow applications AUTHOR KEYWORDS Grid computing fault tolerance task replication workflow scheduling IEEE TERMS Computer architecture Distributed computing Fault tolerance Fault tolerant systems Grid computing High performance computing Processor scheduling Proposals
dc.titleGraph-based task replication for workflow applications
dc.typeConference report
dc.subject.lemacTolerància als errors (Informàtica)
dc.subject.lemacProcessament distribuït de dades
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/HPCC.2009.29
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/xpl/abstractKeywords.jsp?tp=&arnumber=5166972&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F5166953%2F5166954%2F05166972.pdf%3Farnumber%3D5166972
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac15117137
dc.description.versionPostprint (published version)
dc.date.lift10000-01-01
local.citation.authorSirvent, R.; Badia, R.M.; Labarta, J.
local.citation.contributorIEEE International Conference on High Performance Computing and Communications
local.citation.pubplaceSeül
local.citation.publicationName2009 11th IEEE international conference on high performance computing and communications: 25-27 June, Seoul, Korea: proceedings
local.citation.startingPage20
local.citation.endingPage28


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple