Computational Inference Beyond Kingman's Coalescent
نویسندگان
چکیده
Full likelihood inference under Kingman’s coalescent is a computationally challenging problem to which importance sampling (IS) and the product of approximate conditionals (PAC) method have been applied successfully. Both methods can be expressed in terms of families of intractable conditional sampling distributions (CSDs), and rely on principled approximations for accurate inference. Recently, more general Λ– and Ξ–coalescents have been observed to provide better modelling fits to some genetic data sets. We derive families of approximate CSDs for finite sites Λ– and Ξ–coalescents, and use them to obtain “approximately optimal” IS and PAC algorithms for Λ–coalescents, yielding substantial gains in efficiency over existing methods.
منابع مشابه
Scalable Inference on Kingman's Coalescent using Pair Similarity
We present a scalable sequential Monte Carlo algorithm and its greedy counterpart for models based on Kingman’s coalescent. We utilize fast nearest neighbor algorithms to limit expensive computations to only a subset of data point pairs. For a dataset size of n, the resulting algorithm has O(n log n) computational complexity. We empirically verify that we achieve a large speedup in computation....
متن کاملInference in Kingman's Coalescent with Particle Markov Chain Monte Carlo Method
We propose a new algorithm to do posterior sampling of Kingman’s coalescent, based upon the Particle Markov Chain Monte Carlo methodology. Specifically, the algorithm is an instantiation of the Particle Gibbs Sampling method, which alternately samples coalescent times conditioned on coalescent tree structures, and tree structures conditioned on coalescent times via the conditional Sequential Mo...
متن کاملFinding the best resolution for the Kingman-Tajima coalescent: theory and applications.
Many summary statistics currently used in population genetics and in phylogenetics depend only on a rather coarse resolution of the underlying tree (the number of extant lineages, for example). Hence, for computational purposes, working directly on these resolutions appears to be much more efficient. However, this approach seems to have been overlooked in the past. In this paper, we describe si...
متن کاملInferring Epidemiological Dynamics with Bayesian Coalescent Inference: The Merits of Deterministic and Stochastic Models
Estimation of epidemiological and population parameters from molecular sequence data has become central to the understanding of infectious disease dynamics. Various models have been proposed to infer details of the dynamics that describe epidemic progression. These include inference approaches derived from Kingman's coalescent theory. Here, we use recently described coalescent theory for epidem...
متن کاملProbabilistic Models for the (sub)Tree(s) of Life
The goal of these lectures is to review some mathematical aspects of random tree models used in evolutionary biology to model species trees. We start with stochastic models of tree shapes (finite trees without edge lengths), culminating in the β-family of Aldous’ branching models. We next introduce real trees (trees as metric spaces) and show how to study them through their contour, provided th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Applied Probability
دوره 52 شماره
صفحات -
تاریخ انتشار 2015