Show simple item record

dc.contributorGonzález, Juan Ramón
dc.contributorReverter, Ferràn
dc.contributorVegas, Esteban
dc.contributor.authorEsnaola, Mikel
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa
dc.date.accessioned2013-05-28T12:26:30Z
dc.date.issued2011-07
dc.identifier.urihttp://hdl.handle.net/2099.1/18304
dc.description.abstractThis work aims to describe, implement and apply to real data some of the existing structure search methods with Bayesian Networks. Due to the vast dimensions of the graph space, complex search methods based on Markov Chain Monte Carlo (MCMC) are often required. In order to extract as much information as possible from the posterior distributions obtained with the MCMC methods, Bayesian model averaging is introduced and adapted to the particular case of Bayesian Networks. We apply the structure search methods to two different datasets. Firstly, we use a synthetic dataset whose graph is known a priori. This allows us to compare each of the search methods, as well as to check the convergence of the MCMC methods. Afterwards, we use a real dataset containing information about childhood neurodevelopment from a cohort of the INMA project. The three methods presented in this work have been implemented in R and C. The code has been made available as an R package on a public server at http://www.creal.cat/jrgonzalez/software.htm.. Bayesian Networks are graphical models that describe the probabilistic relationships between certain variables. They have been applied to a wide range of statistical issues. Among them is the discovery of biological structures such as disease-phenotype networks or gene-protein pathways. This is not an easy task because the number of possible graphs grows super-exponentially with the number of variables. As a result, complex search methods based on Markov Chain Monte Carlo (MCMC) are often required. The aims of this project are: analysing the statistical basis of these structure search methods, implementing them as efficiently as possible, comparing them using synthetic data and applying them to real biological data. For this last objective data of neurodevelopment during childhood will be used, in order to find the interdependencies between socioeconomical and cognitive-attention variables
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Matemàtiques i estadística::Estadística matemàtica
dc.subject.lcshMathematical statistics
dc.subject.otherBayesian networks
dc.subject.otherMCMC
dc.subject.otherBayesian model averaging
dc.subject.otherchildhood neurodevelopment
dc.titleBiological structure discovery using Bayesian Networks
dc.typeMaster thesis
dc.subject.lemacEstadística matemàtica
dc.subject.amsClassificació AMS::62 Statistics
dc.rights.accessRestricted access - author's decision
dc.date.lift10000-01-01
dc.audience.educationlevelMàster
dc.audience.mediatorUniversitat Politècnica de Catalunya. Facultat de Matemàtiques i Estadística


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder