CUDAlign 4.0: incremental speculative traceback for exact chromosome-wide alignment in GPU clusters
Visualitza/Obre
Cita com:
hdl:2117/99741
Tipus de documentArticle
Data publicació2016-10-01
Condicions d'accésAccés obert
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
ProjecteCOMPUTACION DE ALTAS PRESTACIONES VII (MINECO-TIN2015-65316-P)
BARCELONA SUPERCOMPUTING CENTER - CENTRO. NACIONAL DE SUPERCOMPUTACION (MINECO-SEV-2015-0493)
BARCELONA SUPERCOMPUTING CENTER - CENTRO. NACIONAL DE SUPERCOMPUTACION (MINECO-SEV-2015-0493)
Abstract
This paper proposes and evaluates CUDAlign 4.0, a parallel strategy to obtain the optimal alignment of huge DNA sequences in multi-GPU platforms, using the exact Smith-Waterman (SW) algorithm. In the first phase of CUDAlign 4.0, a huge Dynamic Programming (DP) matrix is computed by multiple GPUs, which asynchronously communicate border elements to the right
neighbor in order to find the optimal score. After that, the traceback phase of SW is executed. The efficient parallelization of the
traceback phase is very challenging because of the high amount of data dependency, which particularly impacts the performance and limits the application scalability. In order to obtain a multi-GPU highly parallel traceback phase, we propose and evaluate a new parallel traceback algorithm called Incremental Speculative Traceback (IST), which pipelines the traceback phase, speculating incrementally over the values calculated so far, producing results in advance. With CUDAlign 4.0, we were able to calculate SW matrices with up to 60 Peta cells, obtaining the optimal local alignments of all Human and Chimpanzee homologous chromosomes, whose sizes range from 26 Millions of Base Pairs (MBP) up to 249 MBP. As far as we know, this is the first time such comparison was made with the SW exact method. We also show that the IST algorithm is able to reduce the traceback time from 2.15¿ up to 21.03¿, when compared with the baseline traceback algorithm. The human¿chimpanzee chromosome 5 comparison (180 MBP¿183 MBP) attained 10,370.00 GCUPS (Billions of Cells Updated per Second) using 384 GPUs, with a speculation hit ratio of 98.2%.
CitacióDe Sandes, E., Miranda, G., Martorell, X., Ayguade, E., Teodoro, G., de Melo, A. CUDAlign 4.0: incremental speculative traceback for exact chromosome-wide alignment in GPU clusters. "IEEE transactions on parallel and distributed systems", 1 Octubre 2016, vol. 27, núm. 10, p. 2838-2850.
ISSN1045-9219
Versió de l'editorhttp://ieeexplore.ieee.org/abstract/document/7374729/
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
CUDAlign+4.0+Incremental+Speculative.pdf | 3,864Mb | Visualitza/Obre |