Association mapping via regularized regression analysis of single-nucleotide-polymorphism haplotypes in variable-sized sliding windows.

نویسندگان

  • Yi Li
  • Wing-Kin Sung
  • Jian Jun Liu
چکیده

Large-scale haplotype association analysis, especially at the whole-genome level, is still a very challenging task without an optimal solution. In this study, we propose a new approach for haplotype association analysis that is based on a variable-sized sliding-window framework and employs regularized regression analysis to tackle the problem of multiple degrees of freedom in the haplotype test. Our method can handle a large number of haplotypes in association analyses more efficiently and effectively than do currently available approaches. We implement a procedure in which the maximum size of a sliding window is determined by local haplotype diversity and sample size, an attractive feature for large-scale haplotype analyses, such as a whole-genome scan, in which linkage disequilibrium patterns are expected to vary widely. We compare the performance of our method with that of three other methods--a test based on a single-nucleotide polymorphism, a cladistic analysis of haplotypes, and variable-length Markov chains--with use of both simulated and experimental data. By analyzing data sets simulated under different disease models, we demonstrate that our method consistently outperforms the other three methods, especially when the region under study has high haplotype diversity. Built on the regression analysis framework, our method can incorporate other risk-factor information into haplotype-based association analysis, which is becoming an increasingly necessary step for studying common disorders to which both genetic and environmental risk factors contribute.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Association of MMP-2 gene haplotypes with thoracic aortic dissection in chinese han population.

BACKGROUND Thoracic aortic dissection (TAD) is the most common life-threatening disorder, and MMP-2 is involved in TAD pathogenesis. Our purpose is to systematically evaluate the association of the MMP-2 gene with TAD risk in Chinese Han population. METHODS In our case-control study, we recruited 755 unrelated participants: 315 case participants with TAD and 440 controls. Twenty-two tag SNPs ...

متن کامل

Association of IGF1 gene haplotypes with high myopia in Chinese adults.

OBJECTIVE To investigate the association of high myopia with common single-nucleotide polymorphisms (SNPs) in the IGF1, IGFBP3, and IGFBP4 genes in a Chinese population. METHODS For our case-control study, we recruited 600 unrelated participants: 300 case participants with high myopia (-8.00 diopters or less) and 300 emmetropic controls (within ±1.00 diopter). Twenty-one tag SNPs were selecte...

متن کامل

Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes.

We present a novel approach to disease-gene mapping via cladistic analysis of single-nucleotide polymorphism (SNP) haplotypes obtained from large-scale, population-based association studies, applicable to whole-genome screens, candidate-gene studies, or fine-scale mapping. Clades of haplotypes are tested for association with disease, exploiting the expected similarity of chromosomes with recent...

متن کامل

DNA Polymorphisms at Candidate Gene Loci and Their Relation with Milk Production Traits in Murrah Buffalo (Bubalus bubalis)

DNA polymorphism within diacylglycerol transferase 2 (DGAT2) / monoacyl glycerol transferases 2 (MOGAT2), leptin and butyrophilin genes were analysed using PCR-SSCP in Murrah buffalo. The single strand conformation polymorphism (SSCP) analysis of amplified gene fragment in exon 5 of MOGAT2, exon 3 of leptin and intron 1 of butyrophilin gene revealed different patterns. A, B and C showed the fol...

متن کامل

Detecting haplotype effects in genomewide association studies.

The analysis of genomewide association studies requires methods that are both computationally feasible and statistically powerful. Given the large-scale collection of single nucleotide polymorphisms (SNPs), it is desirable to explore the information contained in their interrelationships. In particular, utilizing haplotypes rather than individual SNPs and accounting for correlations of polymorph...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • American journal of human genetics

دوره 80 4  شماره 

صفحات  -

تاریخ انتشار 2007