Mostra el registre d'ítem simple

dc.contributor.authorConejero, Javier
dc.contributor.authorCorella, Sandra
dc.contributor.authorBadia Sala, Rosa Maria
dc.contributor.authorLabarta Mancho, Jesús José
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2017-05-29T07:33:22Z
dc.date.available2017-05-29T07:33:22Z
dc.date.issued2017-04-06
dc.identifier.citationConejero, J., Corella, S., Badia, R. M., Labarta, J. Task-based programming in COMPSs to converge from HPC to big data. "International journal of high performance computing applications", 1 Gener 2018, vol. 32, núm. 1, p. 45-60.
dc.identifier.issn1094-3420
dc.identifier.urihttp://hdl.handle.net/2117/104954
dc.description.abstractTask-based programming has proven to be a suitable model for high-performance computing (HPC) applications. Different implementations have been good demonstrators of this fact and have promoted the acceptance of task-based programming in the OpenMP standard. Furthermore, in recent years, Apache Spark has gained wide popularity in business and research environments as a programming model for addressing emerging big data problems. COMP Superscalar (COMPSs) is a task-based environment that tackles distributed computing (including Clouds) and is a good alternative for a task-based programming model for big data applications. This article describes why we consider that task-based programming models are a good approach for big data applications. The article includes a comparison of Spark and COMPSs in terms of architecture, programming model, and performance. It focuses on the differences that both frameworks have in structural terms, on their programmability interface, and in terms of their efficiency by means of three widely known benchmarking kernels: Wordcount, Kmeans, and Terasort. These kernels enable the evaluation of the more important functionalities of both programming models and analyze different work flows and conditions. The main results achieved from this comparison are (1) COMPSs is able to extract the inherent parallelism from the user code with minimal coding effort as opposed to Spark, which requires the existing algorithms to be adapted and rewritten by explicitly using their predefined functions, (2) it is an improvement in terms of performance when compared with Spark, and (3) COMPSs has shown to scale better than Spark in most cases. Finally, we discuss the advantages and disadvantages of both frameworks, highlighting the differences that make them unique, thereby helping to choose the right framework for each particular objective.
dc.description.sponsorshipThis work is supported by the Spanish Government (SEV2015-0493), by the Spanish Ministry of Science and Innovation (contract TIN2015-65316-P), by Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272). Javier Conejero’s postdoctoral contract is cofinanced by the Ministry of Economy and Competitiveness under the Juan de la Cierva Formación postdoctoral fellowship number FJCI-2015-24651. This work is also supported by the Intel-BSC Exascale Lab. The Human Brain Project receives funding from the EU’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 604102.
dc.format.extent16 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Programació
dc.subject.lcshHigh performance computing
dc.subject.lcshBig data
dc.subject.otherProgramming models
dc.subject.otherDistributed computing
dc.subject.otherFramework comparison
dc.subject.otherBig data programming
dc.titleTask-based programming in COMPSs to converge from HPC to big data
dc.typeArticle
dc.subject.lemacCàlcul intensiu (Informàtica)
dc.subject.lemacMacrodades
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1177/1094342017701278
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://journals.sagepub.com/doi/pdf/10.1177/1094342017701278
dc.rights.accessOpen Access
local.identifier.drac20802419
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO//FJCI-2015-24651/ES/FJCI-2015-24651/
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/
dc.relation.projectidinfo:eu-repo/grantAgreement/AGAUR/V PRI/2014 SGR 1051
dc.relation.projectidinfo:eu-repo/grantAgreement/AGAUR/V PRI/2014 SGR 1272
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/604102/EU/The Human Brain Project/HBP
local.citation.authorConejero, J., Corella, S., Badia, R. M., Labarta, J.
local.citation.publicationNameInternational journal of high performance computing applications
local.citation.startingPage45
local.citation.endingPage60


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple