Mostra el registre d'ítem simple

dc.contributor.authorGraffelman, Jan
dc.contributor.authorGalván Femenía, Iván
dc.contributor.authorde Cid, Rafael
dc.contributor.authorBarceló Vidal, Carles
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Estadística i Investigació Operativa
dc.date.accessioned2019-06-12T11:55:49Z
dc.date.available2019-06-12T11:55:49Z
dc.date.issued2019-04-24
dc.identifier.citationGraffelman, J. [et al.]. A log-ratio biplot approach for exploring genetic relatedness based on identity by state. "Frontiers in Genetics", 24 Abril 2019, vol. 10, p.341-1/341-16
dc.identifier.issn1664-8021
dc.identifier.urihttp://hdl.handle.net/2117/134340
dc.description.abstractThe detection of cryptic relatedness in large population-based cohorts is of great importance in genome research. The usual approach for detecting closely related individuals is to plot allele sharing statistics, based on identity-by-state or identity-by-descent, in a two-dimensional scatterplot. This approach ignores that allele sharing data across individuals has in reality a higher dimensionality, and neither regards the compositional nature of the underlying counts of shared genotypes. In this paper we develop biplot methodology based on log-ratio principal component analysis that overcomes these restrictions. This leads to entirely new graphics that are essentially useful for exploring relatedness in genetic databases from homogeneous populations. The proposed method can be applied in an iterative manner, acting as a looking glass for more remote relationships that are harder to classify. Datasets from the 1,000 Genomes Project and the Genomes For Life-GCAT Project are used to illustrate the proposed method. The discriminatory power of the log-ratio biplot approach is compared with the classical plots in a simulation study. In a non-inbred homogeneous population the classification rate of the log-ratio principal component approach outperforms the classical graphics across the whole allele frequency spectrum, using only identity by state. In these circumstances, simulations show that with 35,000 independent bi-allelic variants, log-ratio principal component analysis, combined with discriminant analysis, can correctly classify relationships up to and including the fourth degree
dc.format.extent16 p.
dc.language.isoeng
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Matemàtiques i estadística::Estadística aplicada
dc.subject.lcshMathematical statistics
dc.subject.lcshGenetics
dc.subject.otherAllele sharing
dc.subject.otherComposition
dc.subject.otherIdentity by state
dc.subject.otherIidentity by descent
dc.subject.otherLog-ratio transformation
dc.titleA log-ratio biplot approach for exploring genetic relatedness based on identity by state
dc.typeArticle
dc.subject.lemacEstadística matemàtica
dc.subject.lemacGenètica
dc.contributor.groupUniversitat Politècnica de Catalunya. COSDA-UPC - COmpositional and Spatial Data Analysis
dc.identifier.doi10.3389/fgene.2019.00341
dc.relation.publisherversionhttps://www.frontiersin.org/articles/10.3389/fgene.2019.00341/full
dc.rights.accessOpen Access
local.identifier.drac24903595
dc.description.versionPostprint (published version)
local.citation.authorGraffelman, J.; Galván, I.; de Cid, R.; Barceló, C.
local.citation.publicationNameFrontiers in Genetics
local.citation.volume341-16
local.citation.startingPage341-1
local.citation.endingPage16


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple