In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data
نویسندگان
چکیده
Four popular somatic single nucleotide variant (SNV) calling methods (Varscan, SomaticSniper, Strelka and MuTect2) were carefully evaluated on the real whole exome sequencing (WES, depth of ~50X) and ultra-deep targeted sequencing (UDT-Seq, depth of ~370X) data. The four tools returned poor consensus on candidates (only 20% of calls were with multiple hits by the callers). For both WES and UDT-Seq, MuTect2 and Strelka obtained the largest proportion of COSMIC entries as well as the lowest rate of dbSNP presence and high-alternative-alleles-in-control calls, demonstrating their superior sensitivity and accuracy. Combining different callers does increase reliability of candidates, but narrows the list down to very limited range of tumor read depth and variant allele frequency. Calling SNV on UDT-Seq data, which were of much higher read-depth, discovered additional true-positive variations, despite an even more tremendous growth in false positive predictions. Our findings not only provide valuable benchmark for state-of-the-art SNV calling methods, but also shed light on the access to more accurate SNV identification in the future.
منابع مشابه
Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data
Next generation sequencing is extensively applied to catalogue somatic mutations in cancer, in research settings and increasingly in clinical settings for molecular diagnostics, guiding therapy decisions. Somatic variant callers perform paired comparisons of sequencing data from cancer tissue and matched normal tissue in order to detect somatic mutations. The advent of many new somatic variant ...
متن کاملA review of somatic single nucleotide variant calling algorithms for next-generation sequencing data
Detection of somatic mutations holds great potential in cancer treatment and has been a very active research field in the past few years, especially since the breakthrough of the next-generation sequencing technology. A collection of variant calling pipelines have been developed with different underlying models, filters, input data requirements, and targeted applications. This review aims to en...
متن کامل1169ppanel-based Next-generation Sequencing of Cancer-relevant Genes as Treatment Decision Support.
Aim: Knowledge about somatic driver mutations in cancer is crucial for decision-making in personalized therapy. Samples with low tumor content cannot be sequenced reliably with traditional methods. We have developed a complete pipeline to sequence more than 550 cancer-relevant genes in parallel using next-generation sequencing of FFPE or frozen tumor tissue. Interpretation of somatic mutations ...
متن کاملComparing somatic mutation-callers
Background: Somatic mutation-calling based on DNA from matched tumor-normal patient samples is one of the key tasks carried by many cancer genome projects. One such large-scale project is The Cancer Genome Atlas (TCGA), which is now routinely compiling catalogs of somatic mutations from hundreds of paired tumor-normal DNA exome-sequence data. Nonetheless, mutation calling is still very challeng...
متن کاملTargeted deep sequencing reveals no definitive evidence for somatic mosaicism in atrial fibrillation.
BACKGROUND Studies of ≤15 atrial fibrillation (AF) patients have identified atrial-specific mutations within connexin genes, suggesting that somatic mutations may account for sporadic cases of the arrhythmia. We sought to identify atrial somatic mutations among patients with and without AF using targeted deep next-generation sequencing of 560 genes, including genetic culprits implicated in AF, ...
متن کامل