Variance Components Genetic Association Test for Zero-inflated Count Outcomes
نویسندگان
چکیده
Commonly in biomedical research, studies collect data in which an outcome measure contains informative excess zeros; for example when observing the burden of neuritic plaques in brain pathology studies, those who show none contribute to our understanding of neurodegenerative disease. The outcome may be characterized by a mixture distribution with one component being the ‘structural zero’ and the other component being a Poisson distribution. We propose a novel variance components score test of genetic association between a set of genetic markers and a zero-inflated count outcome from a mixture distribution. This test shares advantageous properties with SNP-set tests which have been previously devised for standard continuous or binary outcomes, such as the Sequence Kernel Association Test (SKAT). In particular, our method has superior statistical power compared to competing methods, especially when there is correlation within the group of markers, and when the SNPs are associated with both the mixing proportion and the rate of the Poisson distribution. We apply the method to Alzheimer’s data from the Rush University Religious Orders Study and Memory and Aging Project (ROSMAP), where as proof of principle we find highly significant associations with the APOE gene, in both the ‘structural zero’ and ‘count’ parameters, when applied to a zero-inflated neuritic plaques count outcome.
منابع مشابه
A comparison of nonlinear mixed models and response to selection of tick-infestation on lambs
Tick-borne fever (TBF) is stated as one of the main disease challenges in Norwegian sheep farming during the grazing season. TBF is caused by the bacterium Anaplasma phagocytophilum that is transmitted by the tick Ixodes ricinus. A sustainable strategy to control tick-infestation is to breed for genetically robust animals. In order to use selection to genetically improve traits we need reliable...
متن کاملHurdle, Inflated Poisson and Inflated Negative Binomial Regression Models for Analysis of Count Data with Extra Zeros
In this paper, we propose Hurdle regression models for analysing count responses with extra zeros. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset. In this example, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...
متن کاملAssessment and Selection of Competing Models for Zero-Inflated Microbiome Data
Typical data in a microbiome study consist of the operational taxonomic unit (OTU) counts that have the characteristic of excess zeros, which are often ignored by investigators. In this paper, we compare the performance of different competing methods to model data with zero inflated features through extensive simulations and application to a microbiome study. These methods include standard para...
متن کاملMulti-level zero-inflated poisson regression modelling of correlated count data with excess zeros.
Count data with excess zeros relative to a Poisson distribution are common in many biomedical applications. A popular approach to the analysis of such data is to use a zero-inflated Poisson (ZIP) regression model. Often, because of the hierarchical study design or the data collection procedure, zero-inflation and lack of independence may occur simultaneously, which render the standard ZIP model...
متن کاملEstimation of genetic parameters for average daily gain using models with competition effects.
Components of variance for ADG with models including competition effects were estimated from data provided by the Pig Improvement Company on 11,235 pigs from 4 selected lines of swine. Fifteen pigs with average age of 71 d were randomly assigned to a pen by line and sex and taken off test after approximately 89 d (off-test BW ranged from 61 to 158 kg). Models included fixed effects of line, sex...
متن کامل