dc.contributor.author | Yu, Chenle |
dc.contributor.author | Royuela Alcázar, Sara |
dc.contributor.author | Quiñones Moreno, Eduardo |
dc.contributor.other | Universitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors |
dc.contributor.other | Barcelona Supercomputing Center |
dc.date.accessioned | 2021-09-16T08:30:12Z |
dc.date.available | 2021-09-16T08:30:12Z |
dc.date.issued | 2021 |
dc.identifier.citation | Yu, C.; Royuela, S.; Quiñones, E. Enhancing OpenMP tasking model: performance and portability. A: International Workshop on OpenMP. "OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop on OpenMP, IWOMP 2021: Bristol, UK, September 14–16, 2021: proceedings". Berlín: Springer, 2021, p. 35-49. ISBN 978-3-030-85262-7. DOI 10.1007/978-3-030-85262-7_3. |
dc.identifier.isbn | 978-3-030-85262-7 |
dc.identifier.uri | http://hdl.handle.net/2117/351422 |
dc.description.abstract | OpenMP, as the de-facto standard programming model in symmetric multiprocessing for HPC, has seen its performance boosted continuously by the community, either through implementation enhancements or specification augmentations. Furthermore, the language has evolved from a prescriptive nature, as defined by the thread-centric model, to a descriptive behavior, as defined by the task-centric model. However, the overhead related to the orchestration of tasks is still relatively high. Applications exploiting very fine-grained parallelism and systems with a large number of cores available might fail on scaling. In this work, we propose to include the concept of Task Dependency Graph (TDG) in the specification by introducing a new clause, named taskgraph, attached to task or target directives. By design, the TDG allows alleviating the overhead associated with the OpenMP tasking model, and it also facilitates linking OpenMP with other programming models that support task parallelism. According to our experiments, a GCC implementation of the taskgraph is able to significantly reduce the execution time of fine-grained task applications and increase their scalability with regard to the number of threads. |
dc.description.sponsorship | This work has been supported by the EU H2020 project AMPERE under the grant agreement no. 871669. |
dc.format.extent | 15 p. |
dc.language.iso | eng |
dc.publisher | Springer |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles |
dc.subject.lcsh | Supercomputers |
dc.subject.lcsh | Parallel programming (Computer science) |
dc.subject.lcsh | Multiprocessors |
dc.subject.other | OpenMP specification |
dc.subject.other | Tasking model |
dc.subject.other | Runtime overhead |
dc.title | Enhancing OpenMP tasking model: performance and portability |
dc.type | Conference report |
dc.subject.lemac | Supercomputadors |
dc.subject.lemac | Programació en paral·lel (Informàtica) |
dc.subject.lemac | Multiprocessadors |
dc.identifier.doi | 10.1007/978-3-030-85262-7_3 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://link.springer.com/chapter/10.1007/978-3-030-85262-7_3 |
dc.rights.access | Open Access |
local.identifier.drac | 32029531 |
dc.description.version | Postprint (author's final draft) |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/871669/EU/A Model-driven development framework for highly Parallel and EneRgy-Efficient computation supporting multi-criteria optimisation/AMPERE |
local.citation.author | Yu, C.; Royuela, S.; Quiñones, E. |
local.citation.contributor | International Workshop on OpenMP |
local.citation.pubplace | Berlín |
local.citation.publicationName | OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop on OpenMP, IWOMP 2021: Bristol, UK, September 14–16, 2021: proceedings |
local.citation.startingPage | 35 |
local.citation.endingPage | 49 |