Efficient CFD code implementation for the ARM-based Mont-Blanc architecture
10.1016/j.future.2017.09.029
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/111369
Tipus de documentArticle
Data publicació2018-02
EditorElsevier
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
ProjecteMONT-BLANC - Mont-Blanc, European scalable and power efficient HPC platform based on low-power embedded technology (EC-FP7-288777)
MONT-BLANC 2 - Mont-Blanc 2, European scalable and power efficient HPC platform based on low-power embedded technology (EC-FP7-610402)
Mont-Blanc 3 - Mont-Blanc 3, European scalable and power efficient HPC platform based on low-power embedded technology (EC-H2020-671697)
DESARROLLO DE CODIGOS Y ALGORITMOS PARALELOS DE ALTAS PRESTACIONES PARA LA MEJORA DE LA EFICIENCIA EN LOS SECTORES EOLICO, SOLARTERMICO Y EDIFICACION (MINECO-ENE2014-60577-R)
IJCI-2014-21034 (MINECO-IJCI-2014-21034)
MONT-BLANC 2 - Mont-Blanc 2, European scalable and power efficient HPC platform based on low-power embedded technology (EC-FP7-610402)
Mont-Blanc 3 - Mont-Blanc 3, European scalable and power efficient HPC platform based on low-power embedded technology (EC-H2020-671697)
DESARROLLO DE CODIGOS Y ALGORITMOS PARALELOS DE ALTAS PRESTACIONES PARA LA MEJORA DE LA EFICIENCIA EN LOS SECTORES EOLICO, SOLARTERMICO Y EDIFICACION (MINECO-ENE2014-60577-R)
IJCI-2014-21034 (MINECO-IJCI-2014-21034)
Abstract
Since 2011, the European project Mont-Blanc has been focused on enabling ARM-based technology for HPC, developing both hardware platforms and system software. The latest Mont-Blanc prototypes use system-on-chip (SoC) devices that combine a CPU and a GPU sharing a common main memory. Specific developments of parallel computing software and well-suited implementation approaches are of crucial importance for such a heterogeneous architecture in order to efficiently exploit its potential.
This paper is devoted to the optimizations carried out in the TermoFluids CFD code to efficiently run it on the Mont-Blanc system. The underlying numerical method is based on an unstructured finite-volume discretization of the Navier–Stokes equations for the numerical simulation of incompressible turbulent flows. It is implemented using a portable and modular operational approach based on a minimal set of linear algebra operations. An architecture-specific heterogeneous multilevel MPI+OpenMP+OpenCL implementation of such kernels is proposed. It includes optimizations of the storage formats, dynamic load balancing between the CPU and GPU devices and hiding of communication overheads by overlapping computations and data transfers. A detailed performance study shows time reductions of up to on the kernels’ execution with the new heterogeneous implementation, its scalability on up to 128 Mont-Blanc nodes and the energy savings (around ) achieved with the Mont-Blanc system versus the high-end hybrid supercomputer MinoTauro.
CitacióOyarzun, G. [et al.]. Efficient CFD code implementation for the ARM-based Mont-Blanc architecture. "Future Generation Computer Systems", Febrer 2018, vol. 79, núm. 3, p. 786-796.
ISSN0167-739X
Versió de l'editorhttp://www.sciencedirect.com/science/article/pii/S0167739X17305733
Col·leccions
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
Efficient CFD c ... e ARM-based Mont-Blanc.pdf | 1,124Mb | Visualitza/Obre |