Modeling Data Quality with Possibility-distributions
نویسنده
چکیده
Description of data quality relies heavily on numbers. When dealing with data sets, which have been collected during longer periods, however, variation in the data quality will become evident. Older data may have different quality than newer data and other aspects including height or accessibility may influence the quality of parts of the data set. Precise numbers do not reflect this variation, thus I propose the use of fuzzy numbers to specify data quality. Fuzzy numbers are based on the specification of a distribution function. The distribution function may be a probability or a possibility function. Probability is more difficult to determine than possibility. Still, the possibility function may provide all information relevant for the user. Thus, providing the possibility function may be sufficient to improve the data quality description. The paper uses the Austrian cadastre as an example. The separation between legal and technical influences allows the specification of the possibility distributions. The example is restricted to temporal and positional accuracy and completeness. The assumption of quality requirements of two user groups finally allows assessing the fitness for use based on the possibility distributions.
منابع مشابه
Using Weighted Distributions for Modeling Skewed, Multimodal and Truncated Data
When the observations reflect a multimodal, asymmetric or truncated construction or a combination of them, using usual unimodal and symmetric distributions leads to misleading results. Therefore, distributions with ability of modeling skewness, multimodality and truncation have been in the core of interest in statistical literature, always. There are different methods to contract ...
متن کاملDetermination of Load and Strain-Stress Distributions in Hot Closed Die Forging Using the Plasticine Modeling Technique
An axisymmetric hot closed die-forging process has been studied by physical modeling technique using the plasticine. To observe the material flow pattern, layers of plasticine with different colors were used. The normal direction to the layers was considered a principal direction. The strain distribution was obtained by measuring the thickness of the plasticine layers. Based on the strain distr...
متن کاملThe effect of systems interaction possibility of electronic word of mouth advertising and E_ quality on E_ loyalty with the moderating role of decision support satisfaction
Internet revolution and ICT have changed the world and access to information and communication of the people with each other is possible more than past. In this new environment, relying on E-word of mouth communication could be a way to achieve a competitive advantage. Given the pervasive role of new technologies in Service industry as well as importance of customer loyalty in the insurance ind...
متن کاملGroundwater Flow and Transport Modeling With Correlated Possibilistic Data
Stochastic groundwater modeling involves the propagation of probabilistic uncertainty from model input parameters to model estimates, usually via a Monte Carlo method. With the increasing reliance upon expert knowledge to define model inputs, and both fuzzy set and possibility theories to characterize this expert knowledge, alternative means of executing model equations are needed. While the fu...
متن کاملModeling the effects of climate change on the distribution of Acanthalburnus urmianus (Günther, 1899) in Urmia lake basin rivers
According to the reports of the International Panel Climate Change (IPCC) there is no doubt about climate change occurring. All ecosystems on the earth have being concerned by the effects of climate change. Urmia lake basin and its rivers exposed to numerous anthropogenic stressors such as hydrological, morphological, connectivity and water quality pressures. The main objective of this study is...
متن کامل