Challenging a bioinformatic tool’s ability to detect microbial contaminants using in silico whole genome sequencing data
نویسندگان
چکیده
High sensitivity methods such as next generation sequencing and polymerase chain reaction (PCR) are adversely impacted by organismal and DNA contaminants. Current methods for detecting contaminants in microbial materials (genomic DNA and cultures) are not sensitive enough and require either a known or culturable contaminant. Whole genome sequencing (WGS) is a promising approach for detecting contaminants due to its sensitivity and lack of need for a priori assumptions about the contaminant. Prior to applying WGS, we must first understand its limitations for detecting contaminants and potential for false positives. Herein we demonstrate and characterize a WGS-based approach to detect organismal contaminants using an existing metagenomic taxonomic classification algorithm. Simulated WGS datasets from ten genera as individuals and binary mixtures of eight organisms at varying ratios were analyzed to evaluate the role of contaminant concentration and taxonomy on detection. For the individual genomes the false positive contaminants reported depended on the genus, with Staphylococcus, Escherichia, and Shigella having the highest proportion of false positives. For nearly all binary mixtures the contaminant was detected in the in-silico datasets at the equivalent of 1 in 1,000 cells, though F. tularensis was not detected in any of the simulated contaminant mixtures and Y. pestis was only detected at the equivalent of one in 10 cells. Once a WGS method for detecting contaminants is characterized, it can be applied to evaluate microbial material purity, in efforts to ensure that contaminants are characterized in microbial materials used to validate pathogen detection assays, generate genome assemblies for database submission, and benchmark sequencing methods.
منابع مشابه
Using metagenomic methods to detect organismal contaminants in microbial materials
6 High sensitivity methods such as next generation sequencing and polymerase chain reaction (PCR) are adversely impacted by organismal and DNA contaminants. Current methods for detecting contaminants in microbial materials (genomic DNA and cultures) are not sensitive enough and require either a known or culturable contaminant. Therefore, high sensitivity methods not requiring a priori assumptio...
متن کاملBioinformatic Analyses of Whole-Genome Sequence Data in a Public Health Laboratory
The ability to generate high-quality sequence data in a public health laboratory enables the identification of pathogenic strains, the determination of relatedness among outbreak strains, and the analysis of genetic information regarding virulence and antimicrobial-resistance genes. However, the analysis of whole-genome sequence data depends on bioinformatic analysis tools and processes. Many p...
متن کاملIn Silico Genome-Genome Hybridization Values Accurately and Precisely Predict Empirical DNA-DNA Hybridization Values for Classifying Prokaryotes
For nearly 50 years microbiologists have been determining prokaryotic genome relatedness by means of nucleic acid reassociation kinetics. These methods, however, are technically challenging, difficult to reproduce, and-given the time and resources it takes to generate a single data-point-not cost effective. In the post genomic era, with the cost of sequencing whole prokaryotic genomes no longer...
متن کاملWhole-Genome Sequencing for Routine Pathogen Surveillance in Public Health: a Population Snapshot of Invasive Staphylococcus aureus in Europe
UNLABELLED The implementation of routine whole-genome sequencing (WGS) promises to transform our ability to monitor the emergence and spread of bacterial pathogens. Here we combined WGS data from 308 invasive Staphylococcus aureus isolates corresponding to a pan-European population snapshot, with epidemiological and resistance data. Geospatial visualization of the data is made possible by a gen...
متن کاملPEPR: pipelines for evaluating prokaryotic references
The rapid adoption of microbial whole genome sequencing in public health, clinical testing, and forensic laboratories requires the use of validated measurement processes. Well-characterized, homogeneous, and stable microbial genomic reference materials can be used to evaluate measurement processes, improving confidence in microbial whole genome sequencing results. We have developed a reproducib...
متن کامل