Improving Indel Detection Specificity of the Ion Torrent PGM Benchtop Sequencer
نویسندگان
چکیده
The emergence of benchtop sequencers has made clinical genetic testing using next-generation sequencing more feasible. Ion Torrent's PGM™ is one such benchtop sequencer that shows clinical promise in detecting single nucleotide variations (SNVs) and microindel variations (indels). However, the large number of false positive indels caused by the high frequency of homopolymer sequencing errors has impeded PGM™'s usage for clinical genetic testing. An extensive analysis of PGM™ data from the sequencing reads of the well-characterized genome of the Escherichia coli DH10B strain and sequences of the BRCA1 and BRCA2 genes from six germline samples was done. Three commonly used variant detection tools, SAMtools, Dindel, and GATK's Unified Genotyper, all had substantial false positive rates for indels. By incorporating filters on two major measures we could dramatically improve false positive rates without sacrificing sensitivity. The two measures were: B-Allele Frequency (BAF) and VARiation of the Width of gaps and inserts (VARW) per indel position. A BAF threshold applied to indels detected by UnifiedGenotyper removed ~99% of the indel errors detected in both the DH10B and BRCA sequences. The optimum BAF threshold for BRCA sequences was determined by requiring 100% detection sensitivity and minimum false discovery rate, using variants detected from Sanger sequencing as reference. This resulted in 15 indel errors remaining, of which 7 indel errors were removed by selecting a VARW threshold of zero. VARW specific errors increased in frequency with higher read depth in the BRCA datasets, suggesting that homopolymer-associated indel errors cannot be reduced by increasing the depth of coverage. Thus, using a VARW threshold is likely to be important in reducing indel errors from data with higher coverage. In conclusion, BAF and VARW thresholds provide simple and effective filtering criteria that can improve the specificity of indel detection in PGM™ data without compromising sensitivity.
منابع مشابه
Utilization of Benchtop Next Generation Sequencing Platforms Ion Torrent PGM and MiSeq in Noninvasive Prenatal Testing for Chromosome 21 Trisomy and Testing of Impact of In Silico and Physical Size Selection on Its Analytical Performance.
OBJECTIVES The aims of this study were to test the utility of benchtop NGS platforms for NIPT for trisomy 21 using previously published z score calculation methods and to optimize the sample preparation and data analysis with use of in silico and physical size selection methods. METHODS Samples from 130 pregnant women were analyzed by whole genome sequencing on benchtop NGS systems Ion Torren...
متن کامل16S rRNA gene sequencing on a benchtop sequencer: accuracy for identification of clinically important bacteria
AIMS Test the choice of 16S rRNA gene amplicon and data analysis method on the accuracy of identification of clinically important bacteria utilizing a benchtop sequencer. METHODS AND RESULTS Nine 16S rRNA amplicons were tested on an Ion Torrent PGM to identify 41 strains of clinical importance. The V1-V2 region identified 40 of 41 isolates to the species level. Three data analysis methods wer...
متن کاملComparison of Ion Personal Genome Machine Platforms for the Detection of Variants in BRCA1 and BRCA2
PURPOSE Transition to next generation sequencing (NGS) for BRCA1/BRCA2 analysis in clinical laboratories is ongoing but different platforms and/or data analysis pipelines give different results resulting in difficulties in implementation. We have evaluated the Ion Personal Genome Machine (PGM) Platforms (Ion PGM, Ion PGM Dx, Thermo Fisher Scientific) for the analysis of BRCA1/2. MATERIALS AND...
متن کاملChoosing a Benchtop Sequencing Machine to Characterise Helicobacter pylori Genomes
The fully annotated genome sequence of the European strain, 26695 was first published in 1997 and, in 1999, it was directly compared to the USA isolate J99, promoting two standard laboratory isolates for Helicobacter pylori (H. pylori) research. With the genomic scaffolds available from these important genomes and the advent of benchtop high-throughput sequencing technology, a bacterial genome ...
متن کاملVersatile ion S5XL sequencer for targeted next generation sequencing of solid tumors in a clinical laboratory
BACKGROUND Next generation sequencing based tumor tissue genotyping involves complex workflow and a relatively longer turnaround time. Semiconductor based next generation platforms varied from low throughput Ion PGM to high throughput Ion Proton and Ion S5XL sequencer. In this study, we compared Ion PGM and Ion Proton, with a new Ion S5XL NGS system for workflow scalability, analytical sensitiv...
متن کامل