Mostra el registre d'ítem simple
Biological structure discovery using Bayesian Networks
dc.contributor | González, Juan Ramón |
dc.contributor | Reverter, Ferràn |
dc.contributor | Vegas, Esteban |
dc.contributor.author | Esnaola, Mikel |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa |
dc.date.accessioned | 2013-05-28T12:26:30Z |
dc.date.issued | 2011-07 |
dc.identifier.uri | http://hdl.handle.net/2099.1/18304 |
dc.description.abstract | This work aims to describe, implement and apply to real data some of the existing structure search methods with Bayesian Networks. Due to the vast dimensions of the graph space, complex search methods based on Markov Chain Monte Carlo (MCMC) are often required. In order to extract as much information as possible from the posterior distributions obtained with the MCMC methods, Bayesian model averaging is introduced and adapted to the particular case of Bayesian Networks. We apply the structure search methods to two different datasets. Firstly, we use a synthetic dataset whose graph is known a priori. This allows us to compare each of the search methods, as well as to check the convergence of the MCMC methods. Afterwards, we use a real dataset containing information about childhood neurodevelopment from a cohort of the INMA project. The three methods presented in this work have been implemented in R and C. The code has been made available as an R package on a public server at http://www.creal.cat/jrgonzalez/software.htm.. Bayesian Networks are graphical models that describe the probabilistic relationships between certain variables. They have been applied to a wide range of statistical issues. Among them is the discovery of biological structures such as disease-phenotype networks or gene-protein pathways. This is not an easy task because the number of possible graphs grows super-exponentially with the number of variables. As a result, complex search methods based on Markov Chain Monte Carlo (MCMC) are often required. The aims of this project are: analysing the statistical basis of these structure search methods, implementing them as efficiently as possible, comparing them using synthetic data and applying them to real biological data. For this last objective data of neurodevelopment during childhood will be used, in order to find the interdependencies between socioeconomical and cognitive-attention variables |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.subject | Àrees temàtiques de la UPC::Matemàtiques i estadística::Estadística matemàtica |
dc.subject.lcsh | Mathematical statistics |
dc.subject.other | Bayesian networks |
dc.subject.other | MCMC |
dc.subject.other | Bayesian model averaging |
dc.subject.other | childhood neurodevelopment |
dc.title | Biological structure discovery using Bayesian Networks |
dc.type | Master thesis |
dc.subject.lemac | Estadística matemàtica |
dc.subject.ams | Classificació AMS::62 Statistics |
dc.rights.access | Restricted access - author's decision |
dc.date.lift | 10000-01-01 |
dc.audience.educationlevel | Màster |
dc.audience.mediator | Universitat Politècnica de Catalunya. Facultat de Matemàtiques i Estadística |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Màster universitari en Estadística i Investigació Operativa (UPC-UB) [437]
Titulació interuniversitària UPC-UB