Novel tree-based method to generate markers from rare variant data
نویسندگان
چکیده
Existing methods for analyzing rare variant data focus on collapsing a group of rare variants into a single common variant; collapsing is based on an intuitive function of the rare variant genotype information, such as an indicator function or a weighted sum. It is more natural, however, to take into account the single-nucleotide polymorphism (SNP) interactions informed directly by the data. We propose a novel tree-based method that automatically detects SNP interactions and generates candidate markers from the original pool of rare variants. In addition, we utilize the advantage of having 200 phenotype replications in the Genetic Analysis Workshop 17 data to assess the candidate markers by means of repeated logistic regressions. This new approach shows potential in the rare variant analysis. We correctly identify the association between gene FLT1 and phenotype Affect, although there exist other false positives in our results. Our analyses are performed without knowledge of the underlying simulating model.
منابع مشابه
Whole Exome Sequencing Revealed a Novel GJB1 Pathogenic Variant and a Rare BSCL2 Mutation in Two Iranian Large Pedigrees with Multiple Affected Cases of Charcot-Marie-Tooth
Charcot-Marie-Tooth disease (CMT) is the most common hereditary neuropathy of the peripheral nervous system with a wide range of severity and age of onset. CMT patients share similar phenotypes which make it often impossible to identify the disease types based on clinical presentation and electrophysiological studies alone. In recent years, novel genetic diagnostic approaches such as whole exom...
متن کاملA LASSO-based approach to analyzing rare variants in genetic association studies
Genetic markers with rare variants are spread out in the genome, making it necessary and difficult to consider them in genetic association studies. Consequently, wisely combining rare variants into "composite" markers may facilitate meaningful analyses. In this paper, we propose a novel approach of analyzing rare variant data by incorporating the least absolute shrinkage and selection operator ...
متن کاملDiscrimination of ADHD Subtypes Using Decision Tree on Behavioral, Neuropsychological, and Neural Markers
Introduction: Attention-Deficit/Hyperactivity Disorder (ADHD) is a well-known neurodevelopmental disorder. Diagnosis and treatment of ADHD can often lead to a developmental trajectory toward positive results. The present study aimed at implementing the decision tree method to recognize children with and without ADHD, as well as ADHD subtypes. Methods: In the present study, the subjects includ...
متن کاملIdentification of a Novel Splice Site Mutation in RUNX2 Gene in a Family with Rare Autosomal Dominant Cleidocranial Dysplasia
Introduction: Pathogenic variants of RUNX2, a gene that encodes an osteoblast-specific transcription factor, have been shown as the cause of CCD, which is a rare hereditary skeletal and dental disorder with dominant mode of inheritance and a broad range of clinical variability. Due to the relative lack of clinical complications resulting in CCD, the medical diagnosis of this disorder is challen...
متن کاملP-215: Discovery of A Novel APA Variant of A Human Potential Gene Based on Expressed Sequenced Tags Analysis
Background: Expressed sequence tags (ESTs) are sequences of cDNA fragments prepared from different tissue sources. There are over one million of these sequences in the publicly available database, and these sequences are believed to represent more than half of all human genes. The ESTs belong to different cDNA libraries, was prepared from one particular cell type, organ, or tumor. Therefore, th...
متن کامل