A hierarchical parallel implementation for heterogeneous computing. Application to algebra-based CFD simulations on hybrid supercomputers

Álvarez Farré, Xavier; Gorobets, Andrei; Trias Miquel, Francesc Xavier

doi:10.1016/j.compfluid.2020.104768

Visualitza/Obre

caf_paper.pdf (1,251Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Álvarez Farré, Xavier

Gorobets, Andrei

Trias Miquel, Francesc Xavier

Tipus de documentArticle

Data publicació2021-01

EditorElsevier

Condicions d'accésAccés obert

Attribution-NonCommercial-NoDerivs 4.0 International

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 4.0 Internacional

ProjecteALGORITMOS NUMERICOS AVANZADOS PARA LA MEJORA DE LA EFICIENCIA ENERGETICA EN LOS SECTORES EOLICO Y SOLAR-TERMICO: DESARROLLO%2FADAPTACION A NUEVAS ARQUITECTURAS COMPUTACIONALES (AEI-ENE2017-88697-R)

Abstract

The quest for new portable implementations of simulation algorithms is motivated by the increasing variety of computing architectures. Moreover, the hybridization of high-performance computing systems imposes additional constraints, since heterogeneous computations are needed to efficiently engage processors and massively-parallel accelerators. This, in turn, involves different parallel paradigms and computing frameworks and requires complex data exchanges between computing units. Typically, simulation codes rely on sophisticated data structures and computing subroutines, so-called kernels, which makes portability terribly cumbersome. Thus, a natural way to achieve portability is to dramatically reduce the complexity of both data structures and computing kernels. In our algebra-based approach, the scale-resolving simulation of incompressible turbulent flows on unstructured meshes relies on three fundamental kernels: the sparse matrix-vector product, the linear combination of vectors and the dot product. It is noteworthy that this approach is not limited to a particular kind of numerical method or a set of governing equations. In our code, an auto-balanced multilevel partitioning distributes workload among computing devices of various architectures. The overlap of computations and multistage communications efficiently hides the data exchanges overhead in large-scale supercomputer simulations. In addition to computing on accelerators, special attention is paid at efficiency on manycore processors in multiprocessor nodes with significant non-uniform memory access factor. Parallel efficiency and performance are studied in detail for different execution modes on various supercomputers using up to 9,600 processor cores and up to 256 graphics processor units. The heterogeneous implementation model described in this work is a general-purpose approach that is well suited for various subroutines in numerical simulation codes.

CitacióAlvarez, X.; Gorobets, A.; Trias, F.X. A hierarchical parallel implementation for heterogeneous computing. Application to algebra-based CFD simulations on hybrid supercomputers. "Computers and fluids", 2021, vol. 214, p. 104768/1-104768/13.

URIhttp://hdl.handle.net/2117/335542

DOI10.1016/j.compfluid.2020.104768

ISSN0045-7930

Versió de l'editorhttps://www.sciencedirect.com/science/article/pii/S0045793020303388

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
caf_paper.pdf		1,251Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

A hierarchical parallel implementation for heterogeneous computing. Application to algebra-based CFD simulations on hybrid supercomputers

Visualitza/Obre

Explora