Statistical Tests for Discrete Cross-species Data
نویسنده
چکیده
Four methods have been proposed that can be used to test for associations between the states of discrete characters in cross-species data and that do not suffer from non-independence due to overcounting of data points. The tests are those of Ridley (1983), Burt (1989), Grafen (1989), and a new test called the ICDE test. The aim of the paper is to measure the Type I error rates for these methods with simulated null distributions of discrete characters. The null data is generated by a model of discrete character evolution, using three shapes of phylogeny: tetratomous, dichotomous, and realistic. Ridley’s and Burt’s tests are both reasonably valid with the realistic phylogeny but biased with the tetratomous and dichotomous phylogenies. Grafen’s phylogenetic regression is reasonably valid with all tree shapes. One version of the ICDE test was valid, the other less so. The invalid results are explained in terms of two kinds of statistical non-independence that arise in discrete data: non-independence due to the reconstruction of character states by parsimony, and the ‘‘family problem’’ in which similar patterns are found in null data in many separate radiations because all the radiations began from the same ancestral state. 7 1996 Academic Press Limited
منابع مشابه
Non-independence in statistical tests for discrete cross-species data.
The paper described three previously undetected effects, due to biases and non-independence, that can arise in statistical tests for associations between character states in cross-species data. One kind, which we call the family problem, is general to all known methods. In phytogenetic data, the ancestral character state from which changes occur, or below which variation is found, is likely to ...
متن کاملModeling Nonnegative Data with Clumping at Zero: A Survey
Applications in which data take nonnegative values but have a substantial proportion of values at zero occur in many disciplines. The modeling of such “clumped-at-zero” or “zero-inflated” data is challenging. We survey models that have been proposed. We consider cases in which the response for the non-zero observations is continuous and in which it is discrete. For the continuous and then the d...
متن کاملNonparametric Goodness-of-Fit Tests for Discrete, Grouped or Censored Data
The problems of application of nonparametric Kolmogorov, Cramer-von MisesSmirnov, Anderson-Darling goodness-of-fit tests for discrete, grouped and censored data have been considered in this paper. The use of these tests for grouped and censored data as well as samples of discrete random variables is based on Smirnov transformation. The convergence of statistic distributions to the corresponding...
متن کاملGreen Algae (Raphidocelis subcapitata) Growth Model
Toxicity testing in populations probe for responses in demographic variables to anthropromorphic or natural chemical changes in the environment. Importantly, these tests are performed on species in isolation of adjacent tropic levels in their ecosystem. The development and validation of coupled species models may aid in predicting adverse outcomes at the ecosystems level. Here, we aim to valida...
متن کاملA MODEL FOR MIXED CONTINUOUS AND DISCRETE RESPONSES WITH POSSIBILITY OF MISSING RESPONSES
A model for missing data in mixed binary and continuous responses, which can be used on cross-sectional data, is presented. In this model response indicator for the binary response can be dependent on the continuous response. A closed form for the likelihood is found. For data with a complicated pattern of missing responses some new residuals are also proposed. The model of multiplicative heter...
متن کامل