Bootstrap Confidence Intervals for Regression Error Characteristic Curves Evaluating the Prediction Error of Software Cost Estimation Models
نویسندگان
چکیده
The importance of Software Cost Estimation at the early stages of the development life cycle is clearly portrayed by the utilization of several algorithmic and artificial intelligence models and methods, appeared so far in the literature. Despite the several comparison studies, there seems to be a discrepancy in choosing the best prediction technique between them. Additionally, the large variation of accuracy measures used in the comparison procedure constitutes an inhibitory factor which complicates the decision-making. In this paper, we further extend the utilization of Regression Error Characteristic analysis, a powerful visualization tool with interesting geometrical properties in order to obtain Confidence Intervals for the entire distribution of error functions. As there are certain limitations due to the small-sized and heavily skewed datasets and error functions, we utilize a simulation technique, namely the bootstrap method in order to evaluate the standard error and bias of the accuracy measures, whereas bootstrap confidence intervals are constructed for the Regression Error Characteristic curves. The tool can be applied to any cost estimation situation in order to study the behavior of comparative statistical or artificial intelligence methods and test the significance of difference between models.
منابع مشابه
Statistical Topology Using the Nonparametric Density Estimation and Bootstrap Algorithm
This paper presents approximate confidence intervals for each function of parameters in a Banach space based on a bootstrap algorithm. We apply kernel density approach to estimate the persistence landscape. In addition, we evaluate the quality distribution function estimator of random variables using integrated mean square error (IMSE). The results of simulation studies show a significant impro...
متن کاملSemiparametric Bootstrap Prediction Intervals in time Series
One of the main goals of studying the time series is estimation of prediction interval based on an observed sample path of the process. In recent years, different semiparametric bootstrap methods have been proposed to find the prediction intervals without any assumption of error distribution. In semiparametric bootstrap methods, a linear process is approximated by an autoregressive process. The...
متن کاملEstimation in Simple Step-Stress Model for the Marshall-Olkin Generalized Exponential Distribution under Type-I Censoring
This paper considers the simple step-stress model from the Marshall-Olkin generalized exponential distribution when there is time constraint on the duration of the experiment. The maximum likelihood equations for estimating the parameters assuming a cumulative exposure model with lifetimes as the distributed Marshall Olkin generalized exponential are derived. The likelihood equations do not lea...
متن کاملNon-Bayesian Estimation and Prediction under Weibull Interval Censored Data
In this paper, a one-sample point predictor of the random variable X is studied. X is the occurrence of an event in any successive visits $L_i$ and $R_i$ :i=1,2…,n (interval censoring). Our proposed method is based on finding the expected value of the conditional distribution of X given $L_i$ and $R_i$ (i=1,2…,n). To make the desired prediction, our approach is on the basis of approximating the...
متن کاملBootstrapping Principal Component Regression Models
Bootstrap methods can be used as an alternative for cross-validation in regression procedures such as principal component regression (PCR). Several bootstrap methods for the estimation of prediction errors and confidence intervals are presented. It is shown that bootstrap error estimates are consistent with cross-validation estimates but exhibit less variability. This makes it easier to select ...
متن کامل