Cladistic analysis of genotype data-application to GAW15 Problem 3

نویسندگان

  • Hsuan Jung
  • Keyan Zhao
  • Paul Marjoram
چکیده

Given the increasing size of modern genetic data sets and, in particular, the move towards genome-wide studies, there is merit in considering analyses that gain computational efficiency by being more heuristic in nature. With this in mind, we present results of cladistic analyses methods on the Genetic Analysis Workshop 15 Problem 3 simulated data (answers known). Our analysis attempts to capture similarities between individuals using a series of trees, and then looks for regions in which mutations on those trees can successfully explain a phenotype of interest. Existing varieties of such algorithms assume haplotypes are known, or have been inferred, an assumption that is often unrealistic for genome-wide data. We therefore present an extension of these methods that can successfully analyze genotype, rather than haplotype, data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining gene networks with application to GAW15 Problem 1

The Genetic Analysis Workshop 15 (GAW15) Problem 1 contained baseline expression levels of 8793 genes in immortalized B cells from 194 individuals in 14 Centre d'Etude du Polymorphisme Humain (CEPH) Utah pedigrees. Previous analysis of the data showed linkage and association and evidence of substantial individual variations. In particular, correlation was examined on expression levels of 31 gen...

متن کامل

Genome-wide association tests by two-stage approaches with unified analysis of families and unrelated individuals

Multiple testing is a problem in genome-wide or region-wide association studies. In this report, we consider a study design given by the Genetic Analysis Workshop 15 (GAW15) Problem 3 - nuclear families (parents with their affected children) and unrelated controls. Based on this design, we propose three two-stage approaches to deal with the problem of multiple testing. The tests in the first st...

متن کامل

Joint linkage and imprinting analyses of GAW15 rheumatoid arthritis and gene expression data

BACKGROUND Genomic imprinting is a mechanism in which the expression of a gene copy depends upon the sex of the parent from which it was inherited. This mechanism is now well recognized in humans, and the deregulation of imprinted genes has been implicated in a number of diseases. In this study, we performed a genome-wide joint linkage and imprinting scan using two data sets provided by Genetic...

متن کامل

Cladistic analysis: its applications in association studies of complex diseases.

INTRODUCTION With the increase in genotype data generated by high throughput typing technologies, there is currently a lack of complexity-oriented analytical methods that can maximise the information obtained from these raw data for the study of complex diseases. We introduce the cladistic analysis that is traditionally applied in evolution studies and taxonomy, to specify relevant comparisons ...

متن کامل

Genome-wide sparse canonical correlation of gene expression with genotypes

There is a growing interest in studying natural variation in human gene expression. Studies mapping genetic determinants of expression profiles are often carried out considering the expression of one gene at a time, an approach that is computationally intensive and may be prone to high false-discovery rate because the number of genes under consideration often exceeds tens of thousands. We prese...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • BMC Proceedings

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2007