The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research.
نویسندگان
چکیده
Technological and scientific advances, stemming in large part from the Human Genome and HapMap projects, have made large-scale, genome-wide investigations feasible and cost effective. These advances have the potential to dramatically impact drug discovery and development by identifying genetic factors that contribute to variation in disease risk as well as drug pharmacokinetics, treatment efficacy, and adverse drug reactions. In spite of the technological advancements, successful application in biomedical research would be limited without access to suitable sample collections. To facilitate exploratory genetics research, we have assembled a DNA resource from a large number of subjects participating in multiple studies throughout the world. This growing resource was initially genotyped with a commercially available genome-wide 500,000 single-nucleotide polymorphism panel. This project includes nearly 6,000 subjects of African-American, East Asian, South Asian, Mexican, and European origin. Seven informative axes of variation identified via principal-component analysis (PCA) of these data confirm the overall integrity of the data and highlight important features of the genetic structure of diverse populations. The potential value of such extensively genotyped collections is illustrated by selection of genetically matched population controls in a genome-wide analysis of abacavir-associated hypersensitivity reaction. We find that matching based on country of origin, identity-by-state distance, and multidimensional PCA do similarly well to control the type I error rate. The genotype and demographic data from this reference sample are freely available through the NCBI database of Genotypes and Phenotypes (dbGaP).
منابع مشابه
Reference Values for Serum Total Cholesterol Concentrations Using Percentile Regression Model: A Population Study in Mashhad
Background and Purpose: Serum total cholesterol (TC) concentrations are affected by several factors including ethnicity, diet, geographic, and environmental determinants, and are related to another disease, including hypothyroidism, and renal and liver disease. It is associated with an increased risk of cardiovascular disease, particularly if associated with high levels of serum low-density lip...
متن کاملPosterior predictive checks to quantify lack-of-fit in admixture models of latent population structure.
Admixture models are a ubiquitous approach to capture latent population structure in genetic samples. Despite the widespread application of admixture models, little thought has been devoted to the quality of the model fit or the accuracy of the estimates of parameters of interest for a particular study. Here we develop methods for validating admixture models based on posterior predictive checks...
متن کاملAn Investigation on Population Structure and Inbreeding of Sangsari Sheep
The aim of this study was to describe inbreeding and population structure in Sangsari sheep breeding station. For this reason, data from 7028 Sangsari sheep which were collected during 1987-2014 in Sangsari sheep breeding station located near to Damghan city, Semnan province were used. Lambs born during 2010-2014 were considered as reference population. The genetic structure analysis of the pop...
متن کاملThe Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an asse...
متن کاملAnalysis of rs6725887 in the WD Repeat Protein 12 in Association With Coronary Artery Disease in Iranian Patients
Although genetic variants that affect susceptibility to coronary artery disease (CAD) have been greatly known, a number of these single nucleotide polymorphisms (SNPs) remain to be analyzed in populations with different ethnicities. CAD is influenced by numerous genetic, environmental, and lifestyle factors, and is an important reason for mortality around the globe. In this study, a novel SNP (...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- American journal of human genetics
دوره 83 3 شماره
صفحات -
تاریخ انتشار 2008