Interchromosomal LD in HapMap Genotypes
نویسنده
چکیده
Then, using all of these tags were used to calculate intrachromosomal LD within each panel. Analysis was done using an EM algorithm based on code from Goncalo Abecasis. Marker pairs with r2 0.6 were stored for further analysis. As shown in Table 2, the number of distinct pairs of SNPs with r2 0.6 and 0.8 are shown. The numbers are shown for the original tag set. Additionally, to facilitate cross panel comparisons, all the SNPs which were captured by the original tag sets, were also analyzed for high interchromosomal LD. This extension was done because we wanted to investigate if any SNP pairs displayed high interchromosomal LD in multiple analysis panels. With initial analysis limited to tag SNPs, while specific regions might overlap, the exact SNPs used as tags might not be the same. The higher number of SNP pairs seen for the YRI panel, particularly at the lower threshold, presumably derives from the larger number of polymorphic SNPs in this panel, and is artifactual. Since it was presumed that a large source of observed interchromosomal LD might come from SNPs which were mismapped, the maximum local r2 value was determined. It was presumed that if (at least) one of a SNP pair was mismapped, then they would be much less likely to display high local LD for both SNPs.
منابع مشابه
A second generation human haplotype map of over 3.1 million SNPs (Supplementary Information)
S1 The density of common SNPs in the Phase II HapMap and the assembled human genome S2 Analysis of data quality S2.1 Analysis of amplicon structure to genotyping error S2.2 Analysis of genotype discordance from overlap with Seattle SNPs S2.3 Analysis of genotype discordance from fosmid end sequences S2.4 Analysis of monomorphism/polymorphism discrepancies S2.5 Interchromosomal LD S3. Analysis o...
متن کاملAssessing Accuracy of Genotype Imputation in American Indians
BACKGROUND Genotype imputation is commonly used in genetic association studies to test untyped variants using information on linkage disequilibrium (LD) with typed markers. Imputing genotypes requires a suitable reference population in which the LD pattern is known, most often one selected from HapMap. However, some populations, such as American Indians, are not represented in HapMap. In the pr...
متن کاملInvestigation into the Ability of SNP Chipsets and Microsatellites to Detect Association with a Disease Locus
We wished to investigate the ability of different SNP chipsets to detect association with a disease and to investigate the linkage disequilibrium (LD) relationships between microsatellites and nearby SNPs in order to assess their potential usefulness to detect association. SNP genotypes were obtained from HapMap and microsatellite genotypes from CEPH. 5000 SNPs were simulated as disease genes w...
متن کاملNavigating the HapMap
With the availability of the HapMap--a resource which describes common patterns of linkage disequilibrium (LD) in four different human population samples, we now have a powerful tool to help dissect the role of genetic variation in the biology of the genome. HapMap is entirely complimentary to the human genome map and so it is particularly fitting that it should be viewed in a full genomic cont...
متن کاملGenome-wide selection of tag SNPs using multiple-marker correlation
MOTIVATIONS The tag SNP approach is a valuable tool in whole genome association studies, and a variety of algorithms have been proposed to identify the optimal tag SNP set. Currently, most tag SNP selection is based on two-marker (pairwise) linkage disequilibrium (LD). Recent literature has shown that multiple-marker LD also contains useful information that can further increase the genetic cove...
متن کامل