Mostra el registre d'ítem simple

dc.contributor.authorMaynou Fernández, Joan
dc.contributor.authorPairó, Erola
dc.contributor.authorMarco, Santiago
dc.contributor.authorPerera Lluna, Alexandre
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial
dc.date.accessioned2017-01-19T13:09:40Z
dc.date.available2017-01-19T13:09:40Z
dc.date.issued2015-11-09
dc.identifier.citationMaynou, J., Pairó, E., Marco, S., Perera, A. Sequence information gain based motif analysis. "BMC bioinformatics", 9 Novembre 2015, vol. 16, núm. 377, p. 1-13.
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/2117/99688
dc.description.abstractBackground: The detection of regulatory regions in candidate sequences is essential for the understanding of the regulation of a particular gene and the mechanisms involved. This paper proposes a novel methodology based on information theoretic metrics for finding regulatory sequences in promoter regions. Results: This methodology (SIGMA) has been tested on genomic sequence data for Homo sapiens and Mus musculus. SIGMA has been compared with different publicly available alternatives for motif detection, such as MEME/MAST, Biostrings (Bioconductor package), MotifRegressor, and previous work such Qresiduals projections or information theoretic based detectors. Comparative results, in the form of Receiver Operating Characteristic curves, show how, in 70 % of the studied Transcription Factor Binding Sites, the SIGMA detector has a better performance and behaves more robustly than the methods compared, while having a similar computational time. The performance of SIGMA can be explained by its parametric simplicity in the modelling of the non-linear co-variability in the binding motif positions. Conclusions: Sequence Information Gain based Motif Analysis is a generalisation of a non-linear model of the cis-regulatory sequences detection based on Information Theory. This generalisation allows us to detect transcription factor binding sites with maximum performance disregarding the covariability observed in the positions of the training set of sequences. SIGMA is freely available to the public at http://b2slab.upc.edu.
dc.format.extent13 p.
dc.language.isoeng
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Enginyeria biomèdica
dc.subject.lcshGenetics -- Computer science
dc.titleSequence information gain based motif analysis
dc.typeArticle
dc.subject.lemacGenètica -- Informàtica
dc.contributor.groupUniversitat Politècnica de Catalunya. SISBIO - Senyals i Sistemes Biomèdics
dc.identifier.doi10.1186/s12859-015-0811-x
dc.relation.publisherversionhttp://www.biomedcentral.com
dc.rights.accessOpen Access
local.identifier.drac17240012
dc.description.versionPostprint (published version)
local.citation.authorMaynou, J.; Pairó, E.; Marco, S.; Perera, A.
local.citation.publicationNameBMC bioinformatics
local.citation.volume16
local.citation.number377
local.citation.startingPage1
local.citation.endingPage13
dc.identifier.pmid26553056


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple