Optimization of atmospheric transport models on HPC platforms
10.1016/j.cageo.2016.08.019
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/90707
Tipus de documentArticle
Data publicació2016-12
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
The performance and scalability of atmospheric transport models on high performance computing environments is often far from optimal for multiple reasons including, for example, sequential input and output, synchronous communications, work unbalance, memory access latency or lack of task overlapping. We investigate how different software optimizations and porting to non general-purpose hardware architectures improve code scalability and execution times considering, as an example, the FALL3D volcanic ash transport model. To this purpose, we implement the FALL3D model equations in the WARIS framework, a software designed from scratch to solve in a parallel and efficient way different geoscience problems on a wide variety of architectures. In addition, we consider further improvements in WARIS such as hybrid MPI-OMP parallelization, spatial blocking, auto-tuning and thread affinity. Considering all these aspects together, the FALL3D execution times for a realistic test case running on general-purpose cluster architectures (Intel Sandy Bridge) decrease by a factor between 7 and 40 depending on the grid resolution. Finally, we port the application to Intel Xeon Phi (MIC) and NVIDIA GPUs (CUDA) accelerator-based architectures and compare performance, cost and power consumption on all the architectures. Implications on time-constrained operational model configurations are discussed.
CitacióDe la Cruz, R., Folch, A., Farré, P., Cabezas, J., Navarro, N., Cela, J. Optimization of atmospheric transport models on HPC platforms. "Computers and geosciences", Desembre 2016, vol. 97, p. 30-39.
ISSN0098-3004
Versió de l'editorhttp://www.sciencedirect.com/science/article/pii/S0098300416303077
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
Optimization of ... odels on HPC platforms.pdf | 1,649Mb | Visualitza/Obre |