Prior Knowledge based mutation prioritization towards causal variant finding in rare disease

نویسندگان

  • Vasundhara Dehiya
  • Jaya Thomas
  • Lee Sael
چکیده

How do we determine the mutational effects in exome sequencing data with little or no statistical evidence? Can protein structural information fill in the gap of not having enough statistical evidence? In this work, we answer the two questions with the goal towards determining pathogenic effects of rare variants in rare disease. We take the approach of determining the importance of point mutation loci focusing on protein structure features. The proposed structure-based features contain information about geometric, physicochemical, and functional information of mutation loci and those of structural neighbors of the loci. The performance of the structure-based features trained on 80% of HumDiv and tested on 20% of HumDiv and on ClinVar datasets showed high levels of discernibility in the mutation’s pathogenic or benign effects: F score of 0.71 and 0.68 respectively using multi-layer perceptron. Combining structureand sequence-based feature further improve the accuracy: F score of 0.86 (HumDiv) and 0.75 (ClinVar). Also, careful examination of the rare variants in rare diseases cases showed that structure-based features are important in discerning importance of variant loci.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrated rare variant-based risk gene prioritization in disease case-control sequencing studies

Rare variants of major effect play an important role in human complex diseases and can be discovered by sequencing-based genome-wide association studies. Here, we introduce an integrated approach that combines the rare variant association test with gene network and phenotype information to identify risk genes implicated by rare variants for human complex diseases. Our data integration method fo...

متن کامل

Reducing the search space for causal genetic variants with VASP

MOTIVATION Increasingly, cost-effective high-throughput DNA sequencing technologies are being utilized to sequence human pedigrees to elucidate the genetic cause of a wide variety of human diseases. While numerous tools exist for variant prioritization within a single genome, the ability to concurrently analyze variants within pedigrees remains a challenge, especially should there be no prior i...

متن کامل

Genome analysis Reducing the search space for causal genetic variants with VASP

Motivation: Increasingly, cost-effective high-throughput DNA sequencing technologies are being utilized to sequence human pedigrees to elucidate the genetic cause of a wide variety of human diseases. While numerous tools exist for variant prioritization within a single genome, the ability to concurrently analyze variants within pedigrees remains a challenge, especially should there be no prior ...

متن کامل

Whole exome sequencing identifies a causal RBM20 mutation in a large pedigree with familial dilated cardiomyopathy.

BACKGROUND Whole exome sequencing is a powerful technique for Mendelian disease gene discovery. However, variant prioritization remains a challenge. We applied whole exome sequencing to identify the causal variant in a large family with familial dilated cardiomyopathy of unknown pathogenesis. METHODS AND RESULTS A large family with autosomal dominant, familial dilated cardiomyopathy was ident...

متن کامل

Nonsense Mutation in Coiled-Coil Domain Containing 151 Gene (CCDC151) Causes Primary Ciliary Dyskinesia

Primary ciliary dyskinesia (PCD) is an autosomal-recessive disorder characterized by impaired ciliary function that leads to subsequent clinical phenotypes such as chronic sinopulmonary disease. PCD is also a genetically heterogeneous disorder with many single gene mutations leading to similar clinical phenotypes. Here, we present a novel PCD causal gene, coiled-coil domain containing 151 (CCDC...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1710.03399  شماره 

صفحات  -

تاریخ انتشار 2017