Mostra el registre d'ítem simple

dc.contributor.authorTous Liesa, Rubén
dc.contributor.authorGounaris, Anastasios
dc.contributor.authorTripiana, Carlos
dc.contributor.authorTorres Viñals, Jordi
dc.contributor.authorGirona Turell, Sergi
dc.contributor.authorAyguadé Parra, Eduard
dc.contributor.authorLabarta Mancho, Jesús José
dc.contributor.authorBecerra Fontal, Yolanda
dc.contributor.authorCarrera Pérez, David
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2017-01-27T09:14:37Z
dc.date.available2017-01-27T09:14:37Z
dc.date.issued2015
dc.identifier.citationTous, R., Gounaris, A., Tripiana, C., Torres, J., Girona, S., Ayguadé, E., Labarta, J., Becerra, Y., Carrera, D., Valero, M. Spark deployment and performance evaluation on the MareNostrum supercomputer. A: IEEE International Conference on Big Data. "2015 IEEE International Conference on Big Data: Oct 29-Nov 01, 2015, Santa Clara, CA, USA: proceedings". Santa Clara, CA: Institute of Electrical and Electronics Engineers (IEEE), 2015, p. 299-306.
dc.identifier.isbn978-1-4799-9925-5
dc.identifier.urihttp://hdl.handle.net/2117/100165
dc.description.abstractIn this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to investigate optimized deployment configurations of Spark on a petascale HPC setup. We detail the design of the framework and present some benchmark data to provide insights into the scalability of the system. We examine the impact of different configurations including parallelism, storage and networking alternatives, and we discuss several aspects in executing Big Data workloads on a computing system that is based on the compute-centric paradigm. Further, we derive conclusions aiming to pave the way towards systematic and optimized methodologies for fine-tuning data-intensive application on large clusters emphasizing on parallelism configurations.
dc.format.extent8 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshBig data
dc.subject.lcshParallel processing (Electronic computers)
dc.subject.otherSparks
dc.subject.otherBenchmark testing
dc.subject.otherSupercomputers
dc.subject.otherScalability
dc.subject.otherHeart beat
dc.titleSpark deployment and performance evaluation on the MareNostrum supercomputer
dc.typeConference report
dc.subject.lemacMacrodades
dc.subject.lemacProcessament en paral·lel (Ordinadors)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/BigData.2015.7363768
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/abstract/document/7363768/
dc.rights.accessOpen Access
local.identifier.drac19377684
dc.description.versionPostprint (author's final draft)
local.citation.authorTous, R.; Gounaris, A.; Tripiana, C.; Torres, J.; Girona, S.; Ayguadé, E.; Labarta, J.; Becerra, Y.; Carrera, D.; Valero, M.
local.citation.contributorIEEE International Conference on Big Data
local.citation.pubplaceSanta Clara, CA
local.citation.publicationName2015 IEEE International Conference on Big Data: Oct 29-Nov 01, 2015, Santa Clara, CA, USA: proceedings
local.citation.startingPage299
local.citation.endingPage306


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple