Mostra el registre d'ítem simple

dc.contributor.authorOzen, Guray
dc.contributor.authorAyguadé Parra, Eduard
dc.contributor.authorLabarta Mancho, Jesús José
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2015-05-11T18:16:46Z
dc.date.available2016-05-11T00:30:46Z
dc.date.created2014
dc.date.issued2014
dc.identifier.citationOzen, G.; Ayguade, E.; Labarta, J. On the roles of the programmer, the compiler and the runtime system when programming accelerators in OpenMP. A: International Workshop on OpenMP. "Using and Improving OpenMP for Devices, Tasks, and More 10th International Workshop on OpenMP, IWOMP 2014 Salvador, Brazil, September 28-30, 2014: proceedings". Salvador: Springer, 2014, p. 215-229.
dc.identifier.isbn978-3-319-11453-8
dc.identifier.urihttp://hdl.handle.net/2117/27886
dc.description.abstractOpenMP includes in its latest 4.0 specification the accelerator model. In this paper we present a partial implementation of this specification in the OmpSs programming model developed at the Barcelona Supercomputing Center with the aim of identifying which should be the roles of the programmer, the compiler and the runtime system in order to facilitate the asynchronous execution of tasks in architectures with multiple accelerator devices and processors. The design of OmpSs is highly biassed to delegate most of the decisions to the runtime system, which based on the task graph built at runtime (depend clauses) is able to schedule tasks in a data flow way to the available processors and accelerator devices and orchestrate data transfers and reuse among multiple address spaces. For this reason our implementation is partial, just considering from 4.0 those directives that enable the compiler the generation of the so called “kernels” to be executed on the target device. Several extensions to the current specification are also presented, such as the specification of tasks in “native” CUDA and OpenCL or how to specify the device and data privatization in the target construct. Finally, the paper also discusses some challenges found in code generation and a preliminary performance evaluation with some kernel applications.
dc.format.extent15 p.
dc.language.isoeng
dc.publisherSpringer
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles
dc.subjectÀrees temàtiques de la UPC::Informàtica::Programació
dc.subject.lcshOpenMP
dc.subject.lcshParallel programming (Computer science)
dc.subject.otherOpenMP accelerator model
dc.subject.otherOmpSs
dc.subject.otherOpenCL
dc.subject.otherCUDA
dc.titleOn the roles of the programmer, the compiler and the runtime system when programming accelerators in OpenMP
dc.typeConference report
dc.subject.lemacOpenMP
dc.subject.lemacProgramació en paral·lel (Informàtica)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1007/978-3-319-11454-5_16
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://link.springer.com/chapter/10.1007%2F978-3-319-11454-5_16
dc.rights.accessOpen Access
local.identifier.drac15626183
dc.description.versionPostprint (author’s final draft)
local.citation.authorOzen, G.; Ayguade, E.; Labarta, J.
local.citation.contributorInternational Workshop on OpenMP
local.citation.pubplaceSalvador
local.citation.publicationNameUsing and Improving OpenMP for Devices, Tasks, and More 10th International Workshop on OpenMP, IWOMP 2014 Salvador, Brazil, September 28-30, 2014: proceedings
local.citation.startingPage215
local.citation.endingPage229


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple