Title: Powerful SNP Set Analysis for Case-Control Genome Wide Association Studies Running Title: Powerful SNP Set Analysis

نویسندگان

  • Michael C. Wu
  • Peter Kraft
  • Michael P. Epstein
  • Deanne M. Taylor
  • Stephen J. Chanock
  • David J. Hunter
  • Xihong Lin
چکیده

Genome wide association studies (GWAS) have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strategy, we propose grouping SNPs together into SNP sets based on proximity to genomic features such as genes or haplotype blocks, and then testing the joint effect of each SNP set. Testing of each SNP set proceeds via the logistic kernel machine based test which is based on a statistical framework that allows for flexible modeling of epistatic and nonlinear SNP effects. This flexibility as well as the ability to naturally adjust for covariate effects are important features of our test that make it appealing compared to individual SNP tests and existing multi-marker tests. Using simulated data based on the International HapMap Project, we show that SNP set testing can have improved power over standard individual SNP analysis under a wide range of settings. In particular, we find that our approach has higher power than individual SNP analysis when the median correlation between disease susceptibility variant and the genotyped SNPs is moderate to high. When the correlation is low, both individual SNP analysis and the SNP set analysis tend to have low power. We apply SNP set analysis to analyze the CGEMS breast cancer GWAS discovery phase data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Powerful SNP-set analysis for case-control genome-wide association studies.

GWAS have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strate...

متن کامل

Weighted SNP Set Analysis in Genome-Wide Association Study

Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus a...

متن کامل

Genome-wide Association Study to Identify Genes and Biological Pathways Associated with Type Traits in Cattle using Pathway Analysis

Extended Abstract Introduction and Objective: Type traits describing the skeletal characteristics of an animal are moderately to strongly genetically correlate with other economically important traits in cattle including fertility, longevity and carcass traits. The present study aimed to conduct a genome wide association studies (GWAS) based on gene-set enrichment analysis for identifying the ...

متن کامل

SNP Set Association Analysis for Genome-Wide Association Studies

Genome-wide association study (GWAS) is a promising approach for identifying common genetic variants of the diseases on the basis of millions of single nucleotide polymorphisms (SNPs). In order to avoid low power caused by overmuch correction for multiple comparisons in single locus association study, some methods have been proposed by grouping SNPs together into a SNP set based on genomic feat...

متن کامل

Identifying and exploiting trait-relevant tissues with multiple functional annotations in genome-wide association studies

Genome-wide association studies (GWASs) have identified many disease associated loci, the majority of which have unknown biological functions. Understanding the mechanism underlying trait associations requires identifying trait-relevant tissues and investigating associations in a trait-specific fashion. Here, we extend the widely used linear mixed model to incorporate multiple SNP functional an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010