OmpSs-2@Cluster: Distributed memory execution of nested OpenMP-style tasks
Visualitza/Obre
10.1007/978-3-031-12597-3_20
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/377512
Tipus de documentText en actes de congrés
Data publicació2022
EditorSpringer Nature
Condicions d'accésAccés obert
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
ProjecteEuroEXA - Co-designed Innovation and System for Resilient Exascale Computing in Europe: From Applications to Silicon (EC-H2020-754337)
UPC-COMPUTACION DE ALTAS PRESTACIONES VIII (AEI-PID2019-107255GB-C22)
UPC-COMPUTACION DE ALTAS PRESTACIONES VIII (AEI-PID2019-107255GB-C22)
Abstract
State-of-the-art programming approaches generally have a strict division between intra-node shared memory parallelism and inter-node MPI communication. Tasking with dependencies offers a clean, dependable abstraction for a wide range of hardware and situations within a node, but research on task offloading between nodes is still relatively immature.
This paper presents a flexible task offloading extension of the OmpSs-2 programming model, which inherits task ordering from a sequential version of the code and uses a common address space to avoid address translation and simplify the use of data structures with pointers. It uses weak dependencies to enable work to be created concurrently. The program is executed in distributed dataflow fashion, and the runtime system overlaps the construction of the distributed dependency graph, enforces dependencies, transfers data, and schedules tasks for execution. Asynchronous task parallelism avoids synchronization that is often required in MPI+OpenMP tasks. Task scheduling is flexible, and data location is tracked through the dependencies. We wish to enable future work in resiliency, scalability, load balancing and malleability, and therefore release all source code and examples open source.
CitacióAguilar, J. [et al.]. OmpSs-2@Cluster: Distributed memory execution of nested OpenMP-style tasks. A: International European Conference on Parallel and Distributed Computing. "Euro-Par 2022: Parallel Processing: 28th International Conference on Parallel and Distributed Computing: Glasgow, UK, August 22-26, 2022: proceedings". Springer Nature, 2022, p. 319-334. ISBN 978-3-031-12597-3. DOI 10.1007/978-3-031-12597-3_20.
ISBN978-3-031-12597-3
Versió de l'editorhttps://link.springer.com/chapter/10.1007/978-3-031-12597-3_20
Col·leccions
- Doctorat en Arquitectura de Computadors - Ponències/Comunicacions de congressos [285]
- Computer Sciences - Ponències/Comunicacions de congressos [565]
- CAP - Grup de Computació d'Altes Prestacions - Ponències/Comunicacions de congressos [784]
- Departament d'Arquitectura de Computadors - Ponències/Comunicacions de congressos [1.948]
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
aguilar2022europar.pdf | 627,8Kb | Visualitza/Obre |