Dimensions of Group-based Phylogenetic Mixtures
نویسندگان
چکیده
In this paper we study group-based Markov models of evolution and their mixtures. In the algebreo-geometric setting, group-based phylogenetic tree models correspond to toric varieties, while their mixtures correspond to secant and join varieties. Determining properties of these secant and join varieties can aid both in model selection and establishing parameter identifiability. Here we explore the first natural geometric property of these varieties: their dimension. The expected projective dimension of the join variety of a set of varieties is one more than the sum of their dimensions. A join variety that realizes the expected dimension is nondefective. Nondefectiveness is not only interesting from a geometric point-of-view, but has been used to establish combinatorial identifiability for several classes of phylogenetic mixture models. In this paper, we focus on group-based models where the equivalence classes of identified parameters are orbits of a subgroup of the automorphism group of the group defining the model. In particular, we show that, for these group-based models, the variety corresponding to the mixture of $r$ trees with $n$ leaves is nondefective when $n \geq 2r+5$. We also give improved bounds for claw trees and give computational evidence that 2-tree and 3-tree mixtures are nondefective for small~$n$.
منابع مشابه
Identifiability of 3-Class Jukes-Cantor Mixtures
We prove identifiability of the tree parameters of the 3-class Jukes-Cantor mixture model. The proof uses ideas from algebraic statistics, in particular: finding phylogenetic invariants that separate the varieties associated to different triples of trees; computing dimensions of the resulting phylogenetic varieties; and using the disentangling number to reduce to trees with a small number of le...
متن کاملPhylogenetic group determination of faecal Escherichia coli and comparative analysis among different hosts
Phylogenetic analysis has shown that Escherichia coli is composed of four main phylogenetic groups (A, B1, B2 and D). Characterization of phylogenetic groups is of clinical interest, as group A and B1 generally associated with commensals, whereas most enteropathogenic isolates are assigned to group D, and group B2 is associated with extra-intestinal pathotype. One hundred E. coli strains recove...
متن کاملPhylogenetic relationships in Ranunculus species (Ranunculaceae) based on nrDNA ITS and cpDNA trnL-F sequences
The genus Ranunculus L., with a worldwide distribution, is the largest member of the Ranunculaceae. Here, nuclear ribosomal internal transcribed spacer (ITS) sequence data and chloroplast trnLF sequence data were used to analyze phylogenetic relationships among members of the annual and perennial (Group Praemorsa, Group Rhizomatosa, Group Grumosa and Group non-Grumosa) species of Ranunculus...
متن کاملIdentifiability of 2-tree mixtures for group-based models
Phylogenetic data arising on two possibly different tree topologies might be mixed through several biological mechanisms, including incomplete lineage sorting or horizontal gene transfer in the case of different topologies, or simply different substitution processes on characters in the case of the same topology. Recent work on a 2-state symmetric model of character change showed that for 4 tax...
متن کاملPhylogenetic analysis of Escherichia coli strains isolated from human samples
Escherichia coli (E. coli) is a normal inhabitant of the gastrointestinal tract of vertebrates, including humans. Phylogenetic analysis has shown that E. coli is composed of four main phylogenetic groups (A, B1, B2 and D). Group A and B1 are generally associated with commensals, whereas group B2 is associated with extra-intestinal pathotypes. Most enteropathogenic isolates, however, are assigne...
متن کامل