Mostra el registre d'ítem simple
Efficient parallel construction of suffix trees for genomes larger than main memory
dc.contributor.author | Comin, Matteo |
dc.contributor.author | Farreras Esclusa, Montserrat |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2014-04-01T14:47:31Z |
dc.date.created | 2013 |
dc.date.issued | 2013 |
dc.identifier.citation | Comin, M.; Farreras, M. Efficient parallel construction of suffix trees for genomes larger than main memory. A: European MPI Users' Group Meeting. "Proceedings of the 20th European MPI Users' Group Meeting (EuroMPI 2013): Madrid, Spain: September 15-18, 2013". Madrid: ACM, 2013, p. 211-216. |
dc.identifier.isbn | 978-846165133-7 |
dc.identifier.uri | http://hdl.handle.net/2117/22471 |
dc.description.abstract | The construction of suffix tree for very long sequences is essential for many applications, and it plays a central role in the bioinformatic domain. With the advent of modern sequencing technologies, biological sequence databases have grown dramatically. Also the methodologies required to analyze these data have become everyday more complex, requiring fast queries to multiple genomes. In this paper we presented Parallel Continuous Flow PCF, a parallel suffix tree construction method that is suitable for very long strings. We tested our method on the construction of suffix tree of the entire human genome, about 3GB. We showed that PCF can scale gracefully as the size of the input string grows. Our method can work with an efficiency of 90% with 36 processors and 55% with 172 processors. We can index the Human genome in 7 minutes using 172 nodes. |
dc.format.extent | 6 p. |
dc.language.iso | eng |
dc.publisher | ACM |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Informàtica teòrica::Algorísmica i teoria de la complexitat |
dc.subject | Àrees temàtiques de la UPC::Informàtica |
dc.subject.lcsh | Parallel algorithms |
dc.subject.lcsh | Bioinformatics |
dc.subject.other | Parallel algorithms |
dc.subject.other | Suffix tree |
dc.subject.other | Whole genome indexing |
dc.title | Efficient parallel construction of suffix trees for genomes larger than main memory |
dc.type | Conference report |
dc.subject.lemac | Algorismes paral·lels |
dc.subject.lemac | Bioinformàtica |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.identifier.doi | 10.1145/2488551.2488579 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://dl.acm.org/citation.cfm?id=2488579 |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 12893326 |
dc.description.version | Postprint (published version) |
dc.date.lift | 10000-01-01 |
local.citation.author | Comin, M.; Farreras, M. |
local.citation.contributor | European MPI Users' Group Meeting |
local.citation.pubplace | Madrid |
local.citation.publicationName | Proceedings of the 20th European MPI Users' Group Meeting (EuroMPI 2013): Madrid, Spain: September 15-18, 2013 |
local.citation.startingPage | 211 |
local.citation.endingPage | 216 |