Mostra el registre d'ítem simple

dc.contributor.authorAlvanos, Michail
dc.contributor.authorFarreras Esclusa, Montserrat
dc.contributor.authorTiotto, Ettore
dc.contributor.authorAmaral, José Nelson
dc.contributor.authorMartorell Bofill, Xavier
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2013-09-19T17:10:58Z
dc.date.created2013
dc.date.issued2013
dc.identifier.citationAlvanos, M. [et al.]. Improving communication in PGAS environments: Static and dynamic coalescing in UPC. A: International Conference on Supercomputing. "ICS'13: proceedings of the 2013 ACM International Conference on Supercomputing: June 10-14, 2013: Eugene, Oregon, USA". Eugene, Oregon: ACM, 2013, p. 129-138.
dc.identifier.isbn978-1-4503-2130-3
dc.identifier.urihttp://hdl.handle.net/2117/20168
dc.description.abstractThe goal of Partitioned Global Address Space (PGAS) languages is to improve programmer productivity in large scale parallel machines. However, PGAS programs may have many fine-grained shared accesses that lead to performance degradation. Manual code transformations or compiler optimizations are required to improve the performance of programs with fine-grained accesses. The downside of manual code transformations is the increased program complexity that hinders programmer productivity. On the other hand, most compiler optimizations of fine-grain accesses require knowledge of physical data mapping and the use of parallel loop constructs. This paper presents an optimization for the Unified Parallel C language that combines compile time (static) and runtime (dynamic) coalescing of shared data, without the knowledge of physical data mapping. Larger messages increase the network efficiency and static coalescing decreases the overhead of library calls. The performance evaluation uses two microbenchmarks and three benchmarks to obtain scaling and absolute performance numbers on up to 32768 cores of a Power 775 machine. Our results show that the compiler transformation results in speedups from 1.15X up to 21X compared with the baseline versions and that they achieve up to 63% the performance of the MPI versions.
dc.format.extent10 p.
dc.language.isoeng
dc.publisherACM
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Enginyeria del software
dc.subject.lcshSoftware engineering
dc.subject.otherOne-sided communication
dc.subject.otherPartitioned global address space
dc.subject.otherPerformance evaluation
dc.subject.otherUnified parallel c
dc.titleImproving communication in PGAS environments: Static and dynamic coalescing in UPC
dc.typeConference report
dc.subject.lemacEnginyeria de programari
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1145/2464996.2465006
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://dl.acm.org/citation.cfm?id=2465006
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac12674192
dc.description.versionPostprint (published version)
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/287759/EU/High Performance and Embedded Architecture and Compilation/HIPEAC
dc.date.lift10000-01-01
local.citation.authorAlvanos, M.; Farreras, M.; Tiotto, E.; Amaral, J.N.; Martorell, X.
local.citation.contributorInternational Conference on Supercomputing
local.citation.pubplaceEugene, Oregon
local.citation.publicationNameICS'13: proceedings of the 2013 ACM International Conference on Supercomputing: June 10-14, 2013: Eugene, Oregon, USA
local.citation.startingPage129
local.citation.endingPage138


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple