Mostra el registre d'ítem simple

dc.contributor.authorCaminal Pallarés, Helena
dc.contributor.authorCaballero de Gea, Diego
dc.contributor.authorCebrián González, Juan Manuel
dc.contributor.authorFerrer, Roger
dc.contributor.authorCasas, Marc
dc.contributor.authorMoretó Planas, Miquel
dc.contributor.authorMartorell Bofill, Xavier
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2018-09-27T09:56:34Z
dc.date.available2019-03-13T01:30:44Z
dc.date.issued2018-06
dc.identifier.citationCaminal, H., Caballero, D., Cebrián, J.M., Ferrer, R., Casas, M., Moreto, M., Martorell, X., Valero, M. Performance and energy effects on task-based parallelized applications: User-directed versus manual vectorization. "Journal of supercomputing", Juny 2018, vol. 74, núm. 6, p. 2627-2637.
dc.identifier.issn0920-8542
dc.identifier.urihttp://hdl.handle.net/2117/121542
dc.description.abstractHeterogeneity, parallelization and vectorization are key techniques to improve the performance and energy efficiency of modern computing systems. However, programming and maintaining code for these architectures poses a huge challenge due to the ever-increasing architecture complexity. Task-based environments hide most of this complexity, improving scalability and usage of the available resources. In these environments, while there has been a lot of effort to ease parallelization and improve the usage of heterogeneous resources, vectorization has been considered a secondary objective. Furthermore, there has been a swift and unstoppable burst of vector architectures at all market segments, from embedded to HPC. Vectorization can no longer be ignored, but manual vectorization is tedious, error-prone and not practical for the average programmer. This work evaluates the feasibility of user-directed vectorization in task-based applications. Our evaluation is based on the OmpSs programming model, extended to support user-directed vectorization for different SIMD architectures (i.e., SSE, AVX2, AVX512). Results show that user-directed codes achieve manually optimized code performance and energy efficiency with minimal code modifications, favoring portability across different SIMD architectures.
dc.format.extent11 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
dc.subject.lcshParallel processing (Electronic computers)
dc.subject.lcshMicroprocessors -- Energy consumption
dc.subject.lcshVector processing (Computer science)
dc.subject.otherData-level parallelism
dc.subject.otherTask-level parallelism
dc.subject.otherVectorization
dc.subject.otherEnergy efficiency
dc.titlePerformance and energy effects on task-based parallelized applications: User-directed versus manual vectorization
dc.typeArticle
dc.subject.lemacProcessament en paral·lel (Ordinadors)
dc.subject.lemacMicroprocessadors -- Consum d'energia
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1007/s11227-018-2294-9
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://link.springer.com/article/10.1007%2Fs11227-018-2294-9
dc.rights.accessOpen Access
local.identifier.drac22036349
dc.description.versionPostprint (author's final draft)
local.citation.authorCaminal, H.; Caballero, D.; Cebrián, J.M.; Ferrer, R.; Casas, M.; Moreto, M.; Martorell, X.; Valero, M.
local.citation.publicationNameJournal of supercomputing
local.citation.volume74
local.citation.number6
local.citation.startingPage2627
local.citation.endingPage2637


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple