Re-evaluating data quality of dog mitochondrial, Y chromosomal, and autosomal SNPs genotyped by SNP array
نویسندگان
چکیده
Quality deficiencies in single nucleotide polymorphism (SNP) analyses have important implications. We used missingness rates to investigate the quality of a recently published dataset containing 424 mitochondrial, 211 Y chromosomal, and 160 432 autosomal SNPs generated by a semicustom Illumina SNP array from 5 392 dogs and 14 grey wolves. Overall, the individual missingness rate for mitochondrial SNPs was ~43.8%, with 980 (18.1%) individuals completely missing mitochondrial SNP genotyping (missingness rate=1). In males, the genotype missingness rate was ~28.8% for Y chromosomal SNPs, with 374 males recording rates above 0.96. These 374 males also exhibited completely failed mitochondrial SNPs genotyping, indicative of a batch effect. Individual missingness rates for autosomal markers were greater than zero, but less than 0.5. Neither mitochondrial nor Y chromosomal SNPs achieved complete genotyping (locus missingness rate=0), whereas 5.9% of autosomal SNPs had a locus missingness rate=1. The high missingness rates and possible batch effect show that caution and rigorous measures are vital when genotyping and analyzing SNP array data for domestic animals. Further improvements of these arrays will be helpful to future studies.
منابع مشابه
The association of genetic polymorphisms of bone formation genes with canine hip dysplasia
Background: Canine hip dysplasia (CHD) is an orthopedic disorder characterized by abnormal laxity of the hip joint. It is considered multifactorial and polygenic and affects predominantly medium and large sized dog breeds. Aims: The aim of this study was to identify CHD associated polymorphisms in chromosomal regions on CFA19, CFA24, CFA26, and CFA34. M...
متن کاملI-44: Mutagenesis during Embryogenesis
We developed several novel tools to genome wide screen for CNVs and SNPs in single cells. When applied to cleavage stage embryos from young fertile couples we discovered, unexpectedly, an extremely high incidence of chromosomal instability, a hallmark of tumorigenesis (Vanneste et al., Nature Medicine, 2009; Vanneste et al., Hum.Reprod., 2011). Not only mosaicisms for whole chromosome aneuploid...
متن کاملEstimation of effective population size using single-nucleotide polymorphism (SNP) data in Jeju horse
This study was conducted to estimate the effective population size using SNPs data of 240 Jeju horses that had raced at the Jeju racing park. Of the total 61,746 genotyped autosomal SNPs, 17,320 (28.1%) SNPs (missing genotype rate of >10%, minor allele frequency of <0.05 and Hardy-Weinberg equilibrium test P-value of <10(-6)) were excluded after quality control processes. SNPs on the X and Y ch...
متن کاملExtensive population structure in San, Khoe, and mixed ancestry populations from southern Africa revealed by 44 short 5-SNP haplotypes.
The San and Khoe people currently represent remnant groups of a much larger and widely distributed population of hunter-gatherers and pastoralists who had exclusive occupation of southern Africa before the arrival of Bantu-speaking groups in the past 1,200 years and sea-borne immigrants within the last 350 years. Genetic studies [mitochondrial deoxyribonucleic acid (DNA) and Y-chromosome] condu...
متن کاملImputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method
The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...
متن کامل