A genome-wide study of Hardy–Weinberg equilibrium with next generation sequence data

Graffelman, Jan; Jain, Deepti; Weir, B.S.

doi:10.1007/s00439-017-1786-7

Visualitza/Obre

Final paper (is open access) (1,958Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Graffelman, Jan

Jain, Deepti

Weir, B.S.

Tipus de documentArticle

Data publicació2017-04-03

Condicions d'accésAccés obert

Attribution-NonCommercial-NoDerivs 3.0 Spain

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

Abstract

Statistical tests for Hardy–Weinberg equilibrium have been an important tool for detecting genotyping errors in the past, and remain important in the quality control of next generation sequence data. In this paper, we analyze complete chromosomes of the 1000 genomes project by using exact test procedures for autosomal and X-chromosomal variants. We find that the rate of disequilibrium largely exceeds what might be expected by chance alone for all chromosomes. Observed disequilibrium is, in about 60% of the cases, due to heterozygote excess. We suggest that most excess disequilibrium can be explained by sequencing problems, and hypothesize mechanisms that can explain exceptional heterozygosities. We report higher rates of disequilibrium for the MHC region on chromosome 6, regions flanking centromeres and p-arms of acrocentric chromosomes. We also detected long-range haplotypes and areas with incidental high disequilibrium. We report disequilibrium to be related to read depth, with variants having extreme read depths being more likely to be out of equilibrium. Disequilibrium rates were found to be 11 times higher in segmental duplications and simple tandem repeat regions. The variants with significant disequilibrium are seen to be concentrated in these areas. For next generation sequence data, Hardy–Weinberg disequilibrium seems to be a major indicator for copy number variation.

CitacióGraffelman, J., Jain, D., Weir, B. A genome-wide study of Hardy–Weinberg equilibrium with next generation sequence data. "Human genetics", 3 Abril 2017, vol. 136, núm. 6, p. 727-741.

URIhttp://hdl.handle.net/2117/104543

DOI10.1007/s00439-017-1786-7

ISSN0340-6717

Versió de l'editorhttps://link.springer.com/article/10.1007%2Fs00439-017-1786-7

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
GraffelmanJainWeirHumanGenetics2017.pdf	Final paper (is open access)	1,958Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

A genome-wide study of Hardy–Weinberg equilibrium with next generation sequence data

Visualitza/Obre

Explora