|
E-prints UPC >
Altres >
Enviament des de DRAC >
Empreu aquest identificador per citar o enllaçar aquest ítem:
http://hdl.handle.net/2117/13326
|
Ítem no disponible en accés obert per política de l'editorial
| Arxiu |
Descripció |
Mida | Format |
| Hybrid MPI+Open MP ....pdf | | 581.26 kB | Adobe PDF |  |
|
| Citació: | Gorobets, A. [et al.]. Hybrid MPI+OpenMP parallelization of an FFT-based 3D Poisson solver with one periodic direction. "Computers and fluids", Octubre 2011, vol. 49, núm. 1, p. 101-109. |
| Títol: | Hybrid MPI+OpenMP parallelization of an FFT-based 3D Poisson solver with one periodic direction |
| Autor: | Gorobets, Andrei ; Trias Miquel, Francesc Xavier ; Borrell Pol, Ricard ; Lehmkuhl Barba, Oriol ; Oliva Llena, Asensio  |
| Data: | oct-2011 |
| Tipus de document: | Article |
| Resum: | This work is devoted to the development of efficient parallel algorithms for the direct numerical simulation (DNS) of incompressible flows on modern supercomputers. In doing so, a Poisson equation needs to be solved at each time-step to project the velocity field onto a divergence-free space. Due to the non-local nature of its solution, this elliptic system is the part of the algorithm that is most difficult to parallelize.
The Poisson solver presented here is restricted to problems with one uniform periodic direction. It is a combination of a block preconditioned Conjugate Gradient (PCG) and an FFT diagonalization. The latter
decomposes the original system into a set of mutually independent 2D systems that are solved by means of the PCG algorithm. For the most ill-conditioned systems, that correspond to the lowest Fourier frequencies,
the PCG is replaced by a direct Schur-complement based solver.
The previous version of the Poisson solver was conceived for single-core (also dual-core) processors and therefore, the distributed memory model with message-passing interface (MPI) was used. The irruption of multi-core architectures motivated the use of a two-level hybrid MPI + OpenMP parallelization with the shared memory model on the second level. Advantages and implementation details for the additional
OpenMP parallelization are presented and discussed in this paper. Numerical experiments show that, within its range of efficient scalability, the previous MPI-only parallelization is slightly outperformed
by the MPI + OpenMP approach. But more importantly, the hybrid parallelization has allowed to significantly extend the range of efficient scalability. Here, the solver has been successfully tested up to 12800 CPU cores for meshes with up to 109 grid points. However, estimations based on the presented
results show that this range can be potentially stretched up until 200,000 cores approximately.
Finally, several examples of DNS simulations are briefly presented to illustrate some potential applications of the solver. |
| ISSN: | 0045-7930 |
| URI: | http://hdl.handle.net/2117/13326 |
| Versió de l'editor: | 10.1016/j.compfluid.2011.05.003 |
| Apareix a les col·leccions: | Altres. Enviament des de DRAC Departament de Màquines i Motors Tèrmics. Articles de revista CTTC - Centre Tecnològic de la Transferència de Calor. Articles de revista
|
| Comparteix: |
|
Queda prohibida la reproducció, transformació, distribució i comunicació pública d'aquesta obra. Es permet, en tot cas, la reproducció per a ús privat sempre i quan la còpia que se'n faci no sigui objecte d'utilització col·lectiva ni lucrativa (art. 31.2 del Reial Decret Legislatiu 1/1996, de 12 d'abril, pel qual s'aprova el Text Refós de la Llei de Propietat Intel·lectual, http://bibliotecnica.upc.es/sepi/legislacio.asp).
Per a qualsevol ús que es vulgui fer diferent al permès, dirigiu-vos a: sepi@upc.edu
|