Case Base Size and Overall Competence: Incremental Increase and Similarity Threshold Selection on a Data Set
نویسنده
چکیده
This paper builds on prior case-based reasoning (CBR) research into the effect of incremental increase of case library size on overall system competence. We explore the effect of gradually increasing the size of the case base on the total accuracy of a CBR algorithm, using a large numeric data set as an example. We use a standard mathematical definition of system competence in our study of the data set and compare these results to those obtained in prior research on several other sets. We also use a strategy for adjusting the size of the similarity threshold, or maximum distance between a test case and a training point for which the match between the two can be considered successful. This makes use of both data normalization and consideration of maximum possible error. As expected, increasing the size of the case library had the effect of increasing competence and accuracy, but this technique did not always produce the most intuitive results.
منابع مشابه
IFSB-ReliefF: A New Instance and Feature Selection Algorithm Based on ReliefF
Increasing the use of Internet and some phenomena such as sensor networks has led to an unnecessary increasing the volume of information. Though it has many benefits, it causes problems such as storage space requirements and better processors, as well as data refinement to remove unnecessary data. Data reduction methods provide ways to select useful data from a large amount of duplicate, incomp...
متن کاملEvaluation of Similarity Measures for Template Matching
Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...
متن کاملSeismic Design of Steel Structures Based on Ductility and Incremental Nonlinear Dynamic Analysis
In this paper a simple tool for seismic design of steel structures for a selected ductility level is presented. For this purpose, a consistent set of earthquakes is selected and sorted based on the maximum acceleration of ground surface. The selected records are applied as the base motion to a single-degree-of-freedom system with strain hardening and the maximum response acceleration is determi...
متن کاملSTEM: Stacked Threshold-based Entity Matching for Knowledge Base Generation
One of the major issues encountered in the generation of knowledge bases is the integration of data coming from a collection of heterogeneous data sources. A key essential task when integrating data instances is the entity matching. Entity matching is based on the definition of a similarity measure among entities and on the classification of the entity pair as a match if the similarity exceeds ...
متن کاملHow Many Cases Do You Need? Assessing and Predicting Case-Base Coverage
Case acquisition is the primary learning method for casebased reasoning (CBR), and the ability of a CBR system’s case-base to cover the problems it encounters is a crucial factor in its performance. Consequently, the ability to assess the current level of case-base coverage and to predict the incremental benefit of adding cases could play an important role in guiding the case acquisition proces...
متن کامل