Ironing out the wrinkles in the rare biosphere through improved OTU clustering
نویسندگان
چکیده
Deep sequencing of PCR amplicon libraries facilitates the detection of low-abundance populations in environmental DNA surveys of complex microbial communities. At the same time, deep sequencing can lead to overestimates of microbial diversity through the generation of low-frequency, error-prone reads. Even with sequencing error rates below 0.005 per nucleotide position, the common method of generating operational taxonomic units (OTUs) by multiple sequence alignment and complete-linkage clustering significantly increases the number of predicted OTUs and inflates richness estimates. We show that a 2% single-linkage preclustering methodology followed by an average-linkage clustering based on pairwise alignments more accurately predicts expected OTUs in both single and pooled template preparations of known taxonomic composition. This new clustering method can reduce the OTU richness in environmental samples by as much as 30-60% but does not reduce the fraction of OTUs in long-tailed rank abundance curves that defines the rare biosphere.
منابع مشابه
Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates.
Massively parallel pyrosequencing of the small subunit (16S) ribosomal RNA gene has revealed that the extent of rare microbial populations in several environments, the 'rare biosphere', is orders of magnitude higher than previously thought. One important caveat with this method is that sequencing error could artificially inflate diversity estimates. Although the per-base error of 16S rDNA ampli...
متن کاملA Projector-Camera System for Ironing Support with Wrinkle Enhancement
Ironing is one of troublesome houseworks, in which the goal of the task is to remove wrinkles caused during washing. A projector has advantages in physical world instruction over an instruction sheet, a Head Mounted Display, or a smartphone/tablet PC because of direct mapping of instructive information on the target object. In this article, we propose a method to detect wrinkles using machine-l...
متن کاملImproved OTU-picking using long-read 16S rRNA gene amplicon sequencing and generic hierarchical clustering
BACKGROUND High-throughput bacterial 16S rRNA gene sequencing followed by clustering of short sequences into operational taxonomic units (OTUs) is widely used for microbiome profiling. However, clustering of short 16S rRNA gene reads into biologically meaningful OTUs is challenging, in part because nucleotide variation along the 16S rRNA gene is only partially captured by short reads. The recen...
متن کاملIroning out the wrinkles in host defense: interactions between iron homeostasis and innate immunity.
Iron is an essential micronutrient for both microbial pathogens and their mammalian hosts. Changes in iron availability and distribution have significant effects on pathogen virulence and on the immune response to infection. Recent advances in our understanding of the molecular regulation of iron metabolism have shed new light on how alterations in iron homeostasis both contribute to and influe...
متن کاملSubsampled open-reference clustering creates consistent, comprehensive OTU definitions and scales to billions of sequences
We present a performance-optimized algorithm, subsampled open-reference OTU picking, for assigning marker gene (e.g., 16S rRNA) sequences generated on next-generation sequencing platforms to operational taxonomic units (OTUs) for microbial community analysis. This algorithm provides benefits over de novo OTU picking (clustering can be performed largely in parallel, reducing runtime) and closed-...
متن کامل