Increasing the Power of Association Studies by Imputation-based Sparse Tag Snp Selection
نویسنده
چکیده
In lowand medium-budget association studies, a limited number of tag SNPs are selected out of a large set of available SNPs previously typed in an initial cohort. These tag SNPs are then typed in a larger set of control and affected individuals. Current association studies pick the set of tag SNPs based on the correlation criterion. Here we show that association studies that use tag SNPs selected according to their imputation accuracy are more powerful than those relying on tag SNPs selected by the correlation criterion. The advantage is particularly striking when the set of tag SNPs is sparse; thus, picking tag SNPs to maximize the imputation accuracy will increase the effectiveness of future association studies without additional cost.
منابع مشابه
The Pattern of Linkage Disequilibrium in Livestock Genome
Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...
متن کاملEfficient association study design via power-optimized tag SNP selection.
Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SN...
متن کاملHaplotype block partitioning and tag SNP selection using genotype data and their applications to association studies.
Recent studies have revealed that linkage disequilibrium (LD) patterns vary across the human genome with some regions of high LD interspersed by regions of low LD. A small fraction of SNPs (tag SNPs) is sufficient to capture most of the haplotype structure of the human genome. In this paper, we develop a method to partition haplotypes into blocks and to identify tag SNPs based on genotype data ...
متن کاملGenome-wide selection of tag SNPs using multiple-marker correlation
MOTIVATIONS The tag SNP approach is a valuable tool in whole genome association studies, and a variety of algorithms have been proposed to identify the optimal tag SNP set. Currently, most tag SNP selection is based on two-marker (pairwise) linkage disequilibrium (LD). Recent literature has shown that multiple-marker LD also contains useful information that can further increase the genetic cove...
متن کاملImputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method
The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...
متن کامل