Mostra el registre d'ítem simple

dc.contributor.authorCiesko, Jan
dc.contributor.authorBueno Hedo, Javier
dc.contributor.authorPuzovic, Nikola
dc.contributor.authorRamírez Bellido, Alejandro
dc.contributor.authorBadia Sala, Rosa Maria
dc.contributor.authorLabarta Mancho, Jesús José
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2014-06-17T09:13:00Z
dc.date.created2013
dc.date.issued2013
dc.identifier.citationCiesko, J. [et al.]. Programmable and scalable reductions on clusters. A: IEEE International Parallel and Distributed Processing Symposium. "IEEE 27th International Parallel and Distributed Processing Symposium: 20–24 May 2013, Boston, Massachusetts: proceedings". Boston: Institute of Electrical and Electronics Engineers (IEEE), 2013, p. 560-568.
dc.identifier.isbn978-0-7685-4971-2
dc.identifier.urihttp://hdl.handle.net/2117/23241
dc.description.abstractReductions matter and they are here to stay. Wide adoption of parallel processing hardware in a broad range of computer applications has encouraged recent research efforts on their efficient parallelization. Furthermore, trends towards high productivity languages in mainstream computing increases the demand for efficient programming support. In this paper we present a new approach on parallel reductions for distributed memory systems that provides both scalability and programmability. Using OmpSs, a task-based parallel programming model, the developer has the ability to express scalable reductions through a single pragma annotation. This pragma annotation is applicable for tasks as well as for work-sharing constructs (with implicit tasking) and instructs the compiler to generate the required runtime calls. The supporting runtime handles data and task distribution, parallel execution and data reduction. Scalability is achieved through a software cache that maximizes local and temporal data reuse and allows overlapped computation and communication. Results confirm scalability for up to 32 12-core cluster nodes.
dc.description.sponsorshipWe thankfully acknowledge the support of the European Commission through the ENCORE project (FP7-248647), the TERAFLUX project (FP7-249013), the TEXT project (FP7- 261580), and the HiPEAC-3 Network of Excellence (FP7/ICT217068), further the support of Intel-BSC Exascale Center, the Spanish Ministry of Education (TIN2007-60625, CSD2007-00050 and FPU program) and the Generalitat de Catalunya (2009-SGR-980).
dc.format.extent9 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
dc.subject.lcshParallel programming (Computer science)
dc.subject.otherDistributed systems
dc.subject.otherParallel programming
dc.subject.otherReductions
dc.subject.otherRuntime systems
dc.subject.otherSoftware cache
dc.titleProgrammable and scalable reductions on clusters
dc.typeConference report
dc.subject.lemacProgramació en paral·lel (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/IPDPS.2013.63
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac12857456
dc.description.versionPostprint (published version)
dc.relation.projectidinfo:eu-repo/grantAgreement/MEC//TIN2007-60625/ES/COMPUTACION DE ALTAS PRESTACIONES V/
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/249013/EU/Exploiting dataflow parallelism in Teradevice Computing/TERAFLUX
dc.date.lift10000-01-01
local.citation.authorCiesko, J.; Bueno, J.; Puzovic, N.; Alex Ramirez; Badia, R.M.; Labarta, J.
local.citation.contributorIEEE International Parallel and Distributed Processing Symposium
local.citation.pubplaceBoston
local.citation.publicationNameIEEE 27th International Parallel and Distributed Processing Symposium: 20–24 May 2013, Boston, Massachusetts: proceedings
local.citation.startingPage560
local.citation.endingPage568


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple