Aggregating multiple probability intervals to improve calibration
نویسندگان
چکیده
We apply the principles of the “Wisdom of Crowds (WoC)” to improve the calibration of interval estimates. Previous research has documented the significant impact of the WoC on the accuracy of point estimates but only a few studies have examined its effectiveness in aggregating interval estimates. We demonstrate that collective probability intervals obtained by several heuristics can reduce the typical overconfidence of the individual estimates. We re-analyzed data from Glaser, Langer and Weber (2013) and from Soll and Klayman (2004) and applied four heuristics Averaging, Median, Enveloping, Probability averaging-suggested by Gaba, Tsetlin and Winkler (2014) and new heuristics, Averaging with trimming and Quartiles. We used the hit rate and the Mean Squared Error (MSE) to evaluate the quality of the methods. All methods reduced miscalibration to some degree, and Quartiles was the most beneficial securing accuracy and informativeness.
منابع مشابه
A model-based approach for the analysis of the calibration of probability judgments
The calibration of probability or confidence judgments concerns the association between the judgments and some estimate of the correct probabilities of events. Researchers rely on estimates using relative frequencies computed by aggregating data over observations. We show that this approach creates conceptual problems, and may result in the confounding of explanatory variables or unstable estim...
متن کاملChanging Statistical Significance with the Amount of Information: The Adaptive α Significance Level.
We put forward an adaptive alpha which changes with the amount of sample information. This calibration may be interpreted as a Bayes/non-Bayes compromise, and leads to statistical consistency. The calibration can also be used to produce confidence intervals whose size take in consideration the amount of observed information.
متن کاملCOMMENTS WELCOME Posterior Confidence Intervals in Linear Calibration Problems: Calibrating the Thompson Ice Core Index (Extended Working Paper Version)
In calibration problems, an exogenous state variable and an endogenous response variable or proxy are both observed in a set of calibration observations. We wish to make inferences about the unobserved state variable from an additional observation on the response variable under a diffuse prior. Hoadley (1970) had argued that an informative prior is required in order to obtain a proper posterior...
متن کاملApplication of Full Reinforcement Aggregation Operators in Speech Recognition
s: In speech recognition probably the most important factor is the recognition accuracy. This is why many attempts have been made to improve it. One such idea might be to use some kind of aggregation method for hypothesis probability calculations. The triangular norms are tools for aggregating one probability value from multiple probability values, thus they seem to be good for this task. In th...
متن کاملUsing cognitive models to combine probability estimates
We demonstrate the usefulness of cognitive models for combining human estimates of probabilities in two experiments. The first experiment involves people’s estimates of probabilities for general knowledge questions such as “What percentage of the world’s population speaks English as a first language?” The second experiment involves people’s estimates of probabilities in football (soccer) games,...
متن کامل