Customisation of the Exome Data Analysis Pipeline Using a Combinatorial Approach
نویسندگان
چکیده
The advent of next generation sequencing (NGS) technologies have revolutionised the way biologists produce, analyse and interpret data. Although NGS platforms provide a cost-effective way to discover genome-wide variants from a single experiment, variants discovered by NGS need follow up validation due to the high error rates associated with various sequencing chemistries. Recently, whole exome sequencing has been proposed as an affordable option compared to whole genome runs but it still requires follow up validation of all the novel exomic variants. Customarily, a consensus approach is used to overcome the systematic errors inherent to the sequencing technology, alignment and post alignment variant detection algorithms. However, the aforementioned approach warrants the use of multiple sequencing chemistry, multiple alignment tools, multiple variant callers which may not be viable in terms of time and money for individual investigators with limited informatics know-how. Biologists often lack the requisite training to deal with the huge amount of data produced by NGS runs and face difficulty in choosing from the list of freely available analytical tools for NGS data analysis. Hence, there is a need to customise the NGS data analysis pipeline to preferentially retain true variants by minimising the incidence of false positives and make the choice of right analytical tools easier. To this end, we have sampled different freely available tools used at the alignment and post alignment stage suggesting the use of the most suitable combination determined by a simple framework of pre-existing metrics to create significant datasets.
منابع مشابه
Whole Exome Sequencing Revealed a Novel GJB1 Pathogenic Variant and a Rare BSCL2 Mutation in Two Iranian Large Pedigrees with Multiple Affected Cases of Charcot-Marie-Tooth
Charcot-Marie-Tooth disease (CMT) is the most common hereditary neuropathy of the peripheral nervous system with a wide range of severity and age of onset. CMT patients share similar phenotypes which make it often impossible to identify the disease types based on clinical presentation and electrophysiological studies alone. In recent years, novel genetic diagnostic approaches such as whole exom...
متن کاملA three-caller pipeline for variant analysis of cancer whole-exome sequencing data
Rapid advancements in next generation sequencing (NGS) technologies, coupled with the dramatic decrease in cost, have made NGS one of the leading approaches applied in cancer research. In addition, it is increasingly used in clinical practice for cancer diagnosis and treatment. Somatic (cancer‑only) single nucleotide variants and small insertions and deletions (indels) are the simplest classes ...
متن کاملA software pipeline for the discovery of variations in exome sequencing projects
Motivations The recent advances in the technologies and strategies for DNA sequencing have dramatically facilitated the identification of novel human genes associated with rare and common diseases [1]. However novel methods are needed to identify high-quality variations among all the ones identified in a single experiment. The most successful approach to identify disease-causing mutations consi...
متن کاملWhole Exome Sequencing for Mutation Screening in Hemophagocytic Lymphohistiocytosis
Background: Hemophagocytic lymphohistiocytosis (HLH) is an immune system disorder characterized by uncontrolled hyper-inflammation owing to hypercytokinemia from the activated but ineffective cytotoxic cells. Establishing a correct diagnosis for HLH patients due to the similarity of this disease with other conditions like malignant lymphoma and leukemia and similarity among its two forms is dif...
متن کاملOperation Analysis of Rotary Tools of Compressor Station Using Exergy Approach
In this study, operation of compressor station has been investigated by exergy approach. Exergy analysis is a thermodynamic method which shows the irreversibility of a system quantitatively. Gas compressors are used to compensate the pressure drop along the gas pipeline significantly. The compression process causes temperature rise of gas; in this regard gas cooler is applied to reduce the temp...
متن کامل