Theoretical Basis of Likelihood Methods in Molecular Phylogenetic Inference

نویسنده

  • Rhiju Das
چکیده

Phylogenetic inference for molecular data by the maximum-likelihood approach has been attacked from a theoretical point of view, because the likelihood functions take different forms for different trees, so that optimised likelihood values for different trees do not appear directly comparable (Nei, 1987). Here, a new "super-tree" perspective is introduced to refute these criticisms. A super-tree likelihood expression is constructed which is a function of all possible bipartition lengths; it reduces to the individual tree likelihood functions when bipartitions not in a given tree are set to zero. From this perspective, the problem of phylogeny inference is seen to be a classical statistical problem involving selection between composite hypotheses. In particular, the usual ML procedure is welljustified, and, moreover, the likelihood ratio between two trees does indeed indicate the posterior odds of the trees. This "literal" interpretation of the likelihood values shown by simulation to provide a more intuitive indication of tree selection accuracy than the "integrated" likelihood posterior probabilities of Rannala and Yang (1996) and bootstrap supports. Thus, the likelihood framework for phylogenetic inference for molecular phylogenetic inference has a good theoretical basis -provided that an adequately realistic model of molecular mutation is used to fit the data. To test the adequacy of such molecular mutation models, a set of straightforward "consistency checks", based on likelihood ratio statistics, are also presented. Predicted distributions of these statistics are shown to agree with simulation. These consistency checks as well as a likelihood-based tree selection procedure have been applied to several data sets: mtDNA from five primates, α and β globin genes from five mammals, mtDNA and wingless genes from sixty Heliconiiini butterflies, and mtDNA from forty mimicking races of Heliconius melpomene and Heliconius erato butteflies. These consistency checks, as well as the presence of internal contradictions, reject the commonly used HKY85+Γ model when applied to many of these data sets. It is concluded that while maximum likelihood is rigorous in principle (and clearly preferable to theoretically unjustified methods like maximum parsimony), it should be considered a heuristic procedure for phylogenetic inference until the complex biological processes influencing molecular mutation are fully understood. This dissertation does not exceed 18,000 words in length, exclusive of tables, figure captions, appendices, and references.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A taxonomic study of cyanobacteria in wheat fields adjacent to industrial areas in Yazd province (Iran)

Culturing, isolation, purification, and identification of cyanobacteria collected from wheat field soil, in five stations around the industrial areas in Yazd province (Iran) were conducted in this study. Identification of taxa was based on morphology and molecular methods. Cluster analysis and principal component analyses performed using SPSS software and rate of resemblance among the taxa were...

متن کامل

Genetic diversity of Arum L. based on plastid marker

TrnL-F region including intron trnL (UAA) and trnL (UAA) - trn (GAA) spacer in the large single-copy region of the chloroplast genome is widely used to infer phylogenetic relationships in plants. In this study, we obtained the trnL-F sequences from 8 samples of Arum L. in Iran. Phylogenetic analyses were conducted by the Bayesian inference, maximum parsimony, and maximum likelihood methods. The...

متن کامل

Philosophy and phylogenetic inference: a comparison of likelihood and parsimony methods in the context of Karl Popper's writings on corroboration.

Advocates of cladistic parsimony methods have invoked the philosophy of Karl Popper in an attempt to argue for the superiority of those methods over phylogenetic methods based on Ronald Fisher's statistical principle of likelihood. We argue that the concept of likelihood in general, and its application to problems of phylogenetic inference in particular, are highly compatible with Popper's phil...

متن کامل

Phylogenetic Analysis of Beta-Glucanase Producing Actinomycetes Strain TBG-CH22 - A Comparison of Conventional and Molecular Morphometric Approach

Actinomycetes are inexhaustible producers of commercially valuable metabolites, are continually screened for beneficial compounds. The taxonomic and phylogenetic study of novel actinomycetes strains are mostly based on conventional methods and primary DNA structure of 16s rRNA. Although 16s rRNA sequence is well accepted in phylogeny studies, its secondary structures have not been widely used. ...

متن کامل

An Optimization-Based Sampling Scheme for Phylogenetic Trees

Much modern work in phylogenetics depends on statistical sampling approaches to phylogeny construction to estimate probability distributions of possible trees for any given input data set. Our theoretical understanding of sampling approaches to phylogenetics remains far less developed than that for optimization approaches, however, particularly with regard to the number of sampling steps needed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000