Ecological Consistency of SSU rRNA-Based Operational Taxonomic Units at a Global Scale
نویسندگان
چکیده
Operational Taxonomic Units (OTUs), usually defined as clusters of similar 16S/18S rRNA sequences, are the most widely used basic diversity units in large-scale characterizations of microbial communities. However, it remains unclear how well the various proposed OTU clustering algorithms approximate 'true' microbial taxa. Here, we explore the ecological consistency of OTUs--based on the assumption that, like true microbial taxa, they should show measurable habitat preferences (niche conservatism). In a global and comprehensive survey of available microbial sequence data, we systematically parse sequence annotations to obtain broad ecological descriptions of sampling sites. Based on these, we observe that sequence-based microbial OTUs generally show high levels of ecological consistency. However, different OTU clustering methods result in marked differences in the strength of this signal. Assuming that ecological consistency can serve as an objective external benchmark for cluster quality, we conclude that hierarchical complete linkage clustering, which provided the most ecologically consistent partitions, should be the default choice for OTU clustering. To our knowledge, this is the first approach to assess cluster quality using an external, biologically meaningful parameter as a benchmark, on a global scale.
منابع مشابه
Delimiting operational taxonomic units for assessing ciliate environmental diversity using small-subunit rRNA gene sequences.
Delineating operational taxonomic units (OTUs) is a central element in any culture-independent analysis of environmental microbial eukaryotic diversity. Previous studies either have not justified their choice in sequence distance used to bin small-subunit ribosomal RNA (SSU rRNA) gene sequences amplified from environmental samples into OTUs, or have used a value based on the average across a br...
متن کاملData processing can mask biology: towards better reporting of fungal barcoding data?
Fungal barcoding, that is the use of genetic markers to identify fungal species, has contributed enormously to the rise of mycorrhizal research in the last decade (van der Heijden et al., 2015) because it allows quick and easy en masse identification of species or higher taxonomic ranks and grouping of sequences into entities; this speeds up ecological analyses and the discovery of new species ...
متن کاملSSUnique: Detecting Sequence Novelty in Microbiome Surveys
High-throughput sequencing of small-subunit (SSU) rRNA genes has revolutionized understanding of microbial communities and facilitated investigations into ecological dynamics at unprecedented scales. Such extensive SSU rRNA gene sequence libraries, constructed from DNA extracts of environmental or host-associated samples, often contain a substantial proportion of unclassified sequences, many re...
متن کاملPhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data
Microbial diversity is typically characterized by clustering ribosomal RNA (SSU-rRNA) sequences into operational taxonomic units (OTUs). Targeted sequencing of environmental SSU-rRNA markers via PCR may fail to detect OTUs due to biases in priming and amplification. Analysis of shotgun sequenced environmental DNA, known as metagenomics, avoids amplification bias but generates fragmentary, non-o...
متن کاملDefining DNA-based operational taxonomic units for microbial-eukaryote ecology.
DNA sequence information has increasingly been used in ecological research on microbial eukaryotes. Sequence-based approaches have included studies of the total diversity of selected ecosystems, studies of the autecology of ecologically relevant species, and identification and enumeration of species of interest for human health. It is still uncommon, however, to delineate protistan species base...
متن کامل