New substitution models for rooting phylogenetic trees
نویسندگان
چکیده
The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made.
منابع مشابه
Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction
Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we ...
متن کاملInferring phylogeny from whole genomes
MOTIVATION Inferring species phylogenies with a history of gene losses and duplications is a challenging and an important task in computational biology. This problem can be solved by duplication-loss models in which the primary step is to reconcile a rooted gene tree with a rooted species tree. Most modern methods of phylogenetic reconstruction (from sequences) produce unrooted gene trees. This...
متن کاملSpecies boundaries and phylogenetic relationships within the green algal genus Codium (Bryopsidales) based on plastid DNA sequences.
Despite the potential model role of the green algal genus Codium for studies of marine speciation and evolution, there have been difficulties with species delimitation and a molecular phylogenetic framework was lacking. In the present study, 74 evolutionarily significant units (ESUs) are delimited using 227 rbcL exon 1 sequences obtained from specimens collected throughout the genus' range. Sev...
متن کاملAccounting for solvent accessibility and secondary structure in protein phylogenetics is clearly beneficial.
Amino acid substitution models are essential to most methods to infer phylogenies from protein data. These models represent the ways in which proteins evolve and substitutions accumulate along the course of time. It is widely accepted that the substitution processes vary depending on the structural configuration of the protein residues. However, this information is very rarely used in phylogene...
متن کاملRooting Gene Trees without Outgroups: EP Rooting
Gene sequences are routinely used to determine the topologies of unrooted phylogenetic trees, but many of the most important questions in evolution require knowing both the topologies and the roots of trees. However, general algorithms for calculating rooted trees from gene and genomic sequences in the absence of gene paralogs are few. Using the principles of evolutionary parsimony (EP) (Lake J...
متن کامل