GENETICS | INVESTIGATION Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort

نویسندگان

  • Mark N. Kvale
  • Stephanie Hesselson
  • Thomas J. Hoffmann
  • Yang Cao
  • David Chan
  • Sheryl Connell
  • Lisa A. Croen
  • Brad P. Dispensa
  • Jasmin Eshragh
  • Andrea Finn
  • Jeremy Gollub
  • Carlos Iribarren
  • Eric Jorgenson
  • Lawrence H. Kushi
  • Richard Lao
  • Yontao Lu
  • Dana Ludwig
  • Gurpreet K. Mathauda
  • William B. McGuire
  • Gangwu Mei
  • Sunita Miles
  • Michael Mittman
  • Mohini Patil
  • Charles P. Quesenberry
  • Dilrini Ranatunga
  • Sarah Rowell
  • Marianne Sadler
  • Lori C. Sakoda
  • Michael Shapero
  • Ling Shen
  • Tanu Shenoy
  • David Smethurst
  • Carol P. Somkin
  • Stephen K. Van Den Eeden
  • Lawrence Walter
  • Eunice Wan
  • Teresa Webster
  • Rachel A. Whitmer
  • Simon Wong
  • Chia Zau
  • Yiping Zhan
  • Catherine Schaefer
  • Pui-Yan Kwok
  • Neil Risch
چکیده

The Kaiser Permanente (KP) Research Program on Genes, Environment and Health (RPGEH), in collaboration with the University of California—San Francisco, undertook genome-wide genotyping of .100,000 subjects that constitute the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. The project, which generated .70 billion genotypes, represents the first large-scale use of the Affymetrix Axiom Genotyping Solution. Because genotyping took place over a short 14-month period, creating a near-real-time analysis pipeline for experimental assay quality control and final optimized analyses was critical. Because of the multi-ethnic nature of the cohort, four different ethnic-specific arrays were employed to enhance genome-wide coverage. All assays were performed on DNA extracted from saliva samples. To improve sample call rates and significantly increase genotype concordance, we partitioned the cohort into disjoint packages of plates with similar assay contexts. Using strict QC criteria, the overall genotyping success rate was 103,067 of 109,837 samples assayed (93.8%), with a range of 92.1–95.4% for the four different arrays. Similarly, the SNP genotyping success rate ranged from 98.1 to 99.4% across the four arrays, the variation depending mostly on how many SNPs were included as single copy vs. double copy on a particular array. The high quality and large scale of genotype data created on this cohort, in conjunction with comprehensive longitudinal data from the KP electronic health records of participants, will enable a broad range of highly powered genome-wide association studies on a diversity of traits and conditions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort.

The Kaiser Permanente (KP) Research Program on Genes, Environment and Health (RPGEH), in collaboration with the University of California-San Francisco, undertook genome-wide genotyping of >100,000 subjects that constitute the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. The project, which generated >70 billion genotypes, represents the first large-scale use of the Affy...

متن کامل

Automated Assay of Telomere Length Measurement and Informatics for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort.

The Kaiser Permanente Research Program on Genes, Environment, and Health (RPGEH) Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort includes DNA specimens extracted from saliva samples of 110,266 individuals. Because of its relationship to aging, telomere length measurement was considered an important biomarker to develop on these subjects. To assay relative telomere length (...

متن کامل

Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort.

Using genome-wide genotypes, we characterized the genetic structure of 103,006 participants in the Kaiser Permanente Northern California multi-ethnic Genetic Epidemiology Research on Adult Health and Aging Cohort and analyzed the relationship to self-reported race/ethnicity. Participants endorsed any of 23 race/ethnicity/nationality categories, which were collapsed into seven major race/ethnici...

متن کامل

A Large Genome-Wide Association Study of Age-Related Hearing Impairment Using Electronic Health Records

Age-related hearing impairment (ARHI), one of the most common sensory disorders, can be mitigated, but not cured or eliminated. To identify genetic influences underlying ARHI, we conducted a genome-wide association study of ARHI in 6,527 cases and 45,882 controls among the non-Hispanic whites from the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. We identified two novel...

متن کامل

Investigating the Predictive Factors of the Quality of Life in the Staff of Shahid Beheshti University of Medical Sciences

Background and Objectives: Quality of life is a valuable indicator for measuring people's health. The purpose of this study was to determine the predictors of quality of life in the staff of Shahid Beheshti University of Medical Sciences, Tehran, Iran using the path analysis model.   Methods: This cross-sectional study was performed on subjects participating in the Health Cohort Study of Shah...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015