DSpace DSpace UPC
 Català   Castellano   English  

E-prints UPC >
Altres >
Enviament des de DRAC >

Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/2117/13326

Ítem no disponible en accés obert per política de l'editorial

Arxiu Descripció MidaFormat
Hybrid MPI+Open MP ....pdf581,26 kBAdobe PDF Accés restringit

Citació: Gorobets, A. [et al.]. Hybrid MPI+OpenMP parallelization of an FFT-based 3D Poisson solver with one periodic direction. "Computers and fluids", Octubre 2011, vol. 49, núm. 1, p. 101-109.
Títol: Hybrid MPI+OpenMP parallelization of an FFT-based 3D Poisson solver with one periodic direction
Autor: Gorobets, Andrei Veure Producció científica UPC; Trias Miquel, Francesc Xavier Veure Producció científica UPC; Borrell Pol, Ricard Veure Producció científica UPC; Lehmkuhl Barba, Oriol Veure Producció científica UPC; Oliva Llena, Asensio Veure Producció científica UPC
Data: oct-2011
Tipus de document: Article
Resum: This work is devoted to the development of efficient parallel algorithms for the direct numerical simulation (DNS) of incompressible flows on modern supercomputers. In doing so, a Poisson equation needs to be solved at each time-step to project the velocity field onto a divergence-free space. Due to the non-local nature of its solution, this elliptic system is the part of the algorithm that is most difficult to parallelize. The Poisson solver presented here is restricted to problems with one uniform periodic direction. It is a combination of a block preconditioned Conjugate Gradient (PCG) and an FFT diagonalization. The latter decomposes the original system into a set of mutually independent 2D systems that are solved by means of the PCG algorithm. For the most ill-conditioned systems, that correspond to the lowest Fourier frequencies, the PCG is replaced by a direct Schur-complement based solver. The previous version of the Poisson solver was conceived for single-core (also dual-core) processors and therefore, the distributed memory model with message-passing interface (MPI) was used. The irruption of multi-core architectures motivated the use of a two-level hybrid MPI + OpenMP parallelization with the shared memory model on the second level. Advantages and implementation details for the additional OpenMP parallelization are presented and discussed in this paper. Numerical experiments show that, within its range of efficient scalability, the previous MPI-only parallelization is slightly outperformed by the MPI + OpenMP approach. But more importantly, the hybrid parallelization has allowed to significantly extend the range of efficient scalability. Here, the solver has been successfully tested up to 12800 CPU cores for meshes with up to 109 grid points. However, estimations based on the presented results show that this range can be potentially stretched up until 200,000 cores approximately. Finally, several examples of DNS simulations are briefly presented to illustrate some potential applications of the solver.
ISSN: 0045-7930
URI: http://hdl.handle.net/2117/13326
DOI: 10.1016/j.compfluid.2011.05.003
Apareix a les col·leccions:Altres. Enviament des de DRAC
CTTC - Centre Tecnològic de la Transferència de Calor. Articles de revista
Departament de Màquines i Motors Tèrmics. Articles de revista
Comparteix:


Stats Mostra les estadístiques d'aquest ítem

SFX Query

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets.

Per a qualsevol ús que se'n vulgui fer no previst a la llei, dirigiu-vos a: sepi.bupc@upc.edu

 

Valid XHTML 1.0! Programari DSpace Copyright © 2002-2004 MIT and Hewlett-Packard Comentaris
Universitat Politècnica de Catalunya. Servei de Biblioteques, Publicacions i Arxius