Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter.
نویسنده
چکیده
There has been considerable research conducted on the development of statistical models for predicting crashes on highway facilities. Despite numerous advancements made for improving the estimation tools of statistical models, the most common probabilistic structure used for modeling motor vehicle crashes remains the traditional Poisson and Poisson-gamma (or Negative Binomial) distribution; when crash data exhibit over-dispersion, the Poisson-gamma model is usually the model of choice most favored by transportation safety modelers. Crash data collected for safety studies often have the unusual attributes of being characterized by low sample mean values. Studies have shown that the goodness-of-fit of statistical models produced from such datasets can be significantly affected. This issue has been defined as the "low mean problem" (LMP). Despite recent developments on methods to circumvent the LMP and test the goodness-of-fit of models developed using such datasets, no work has so far examined how the LMP affects the fixed dispersion parameter of Poisson-gamma models used for modeling motor vehicle crashes. The dispersion parameter plays an important role in many types of safety studies and should, therefore, be reliably estimated. The primary objective of this research project was to verify whether the LMP affects the estimation of the dispersion parameter and, if it is, to determine the magnitude of the problem. The secondary objective consisted of determining the effects of an unreliably estimated dispersion parameter on common analyses performed in highway safety studies. To accomplish the objectives of the study, a series of Poisson-gamma distributions were simulated using different values describing the mean, the dispersion parameter, and the sample size. Three estimators commonly used by transportation safety modelers for estimating the dispersion parameter of Poisson-gamma models were evaluated: the method of moments, the weighted regression, and the maximum likelihood method. In an attempt to complement the outcome of the simulation study, Poisson-gamma models were fitted to crash data collected in Toronto, Ont. characterized by a low sample mean and small sample size. The study shows that a low sample mean combined with a small sample size can seriously affect the estimation of the dispersion parameter, no matter which estimator is used within the estimation process. The probability the dispersion parameter becomes unreliably estimated increases significantly as the sample mean and sample size decrease. Consequently, the results show that an unreliably estimated dispersion parameter can significantly undermine empirical Bayes (EB) estimates as well as the estimation of confidence intervals for the gamma mean and predicted response. The paper ends with recommendations about minimizing the likelihood of producing Poisson-gamma models with an unreliable dispersion parameter for modeling motor vehicle crashes.
منابع مشابه
Effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models for modeling motor vehicle crashes: a Bayesian perspective
There has been considerable research conducted on the development of statistical models for predicting motor vehicle crashes on highway facilities. Over the last few years, there has been a significant increase in the application hierarchical Bayes methods for modeling motor vehicle crash data. Whether the inferences are estimated using classical or Bayesian methods, the most common probabilist...
متن کاملEffects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models: A Bayesian Perspective
There has been considerable research conducted on the development of statistical models for predicting motor vehicle crashes on highway facilities. Many of these developments were performed for the likelihood-based or frequentist modeling approach. Over the last few years, there has been a significant increase in the application hierarchical Bayes method for modeling motor vehicle crashes. Whet...
متن کاملExamining the Application of Aggregated and Disaggregated Poisson-gamma Models Subjected to Low Sample Mean Bias
The costs of collecting crash and other related data can be very prohibitive. As a result, these data can often only be collected at a limited number of sites. One way to increase the sample size for developing reliable statistical models is to collect data at the same sites for a long time period. Two general classes of models have been proposed for modeling crash data using such datasets: dis...
متن کاملEffects of the Varying Dispersion Parameter of Poisson-gamma models on the estimation of Confidence Intervals of Crash Prediction models
The most common probabilistic structure of the models used by transportation safety analysts for modeling motor vehicle crashes are the traditional Poisson and Poissongamma (or Negative Binomial) distributions. Since crash data have been shown to exhibit over-dispersion, Poisson-gamma models are usually preferred over Poisson regression models. Up until recently, the dispersion parameter of Poi...
متن کاملInvestigating the effects of the fixed and varying dispersion parameters of Poisson-gamma models on empirical Bayes estimates.
Traditionally, transportation safety analysts have used the empirical Bayes (EB) method to improve the estimate of the long-term mean of individual sites; to correct for the regression-to-the-mean (RTM) bias in before-after studies; and to identify hotspots or high risk locations. The EB method combines two different sources of information: (1) the expected number of crashes estimated via crash...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Accident; analysis and prevention
دوره 38 4 شماره
صفحات -
تاریخ انتشار 2006