Is Consistency Harmful?
نویسندگان
چکیده
One of the major goals of most early concept learners was to find hypotheses that were perfectly consistent with the training data. It was believed that this goal would indirectly achieve a high degree of predictive accuracy on a set of test data. Later research has partially disproved this belief. However, the issue of consistency has not yet been resolved completely. We examine the issue of consistency from a new perspective. To avoid overfitting the training data, a considerable number of current systems have sacrificed the goal of learning hypotheses that are perfectly consistent with the training instances by setting a new goal of hypothesis simplicity (Occam’s razor). Instead of using simplicity as a goal, we have developed a novel approach that addresses consistency directly. In other words, our concept learner has the explicit goal of selecting the most appropriate degree of consistency with the training data. We begin this paper by exploring concept learning with less than perfect consistency. Next, we describe a system that can adapt its degree of consistency in response to feedback about predictive accuracy on test data. Finally, we present the results of initial experiments that begin to address the question of how tightly hypotheses should fit the training data for different problems.
منابع مشابه
From impure to harmful: Asymmetric expectations about immoral agents
Article history: Received 16 March 2016 Revised 10 August 2016 Accepted 10 August 2016 Available online 15 August 2016 How does information about agents' past violations influence people's expectations about their future actions? We examined this question, with a focus on the contrast between past harmful and past impure actions. Participants' judgments reflected two independent influences: act...
متن کاملMy Weak Consistency is Strong
It is expensive to maintain strong data consistency during concurrent execution. However, weak consistency levels, which are considered harmful, have been widely applied in analytical jobs. Their success challenges our belief: data consistency, which is believed to be an essential to precise computing, does not always need to be preserved. In this paper, we tackle one of the core questions rela...
متن کاملAnnotating Legitimate Disagreement in Corpus Construction
This paper addresses the resolution of inter-annotator disagreement in corpus construction. Given the consistency requirement which is regarded as a critical criterion of annotation quality, interannotator disagreement is usually considered harmful to the accuracy and reliability of annotation, and thus has to be resolved through various means. We claim that strictly adhering to consistency wou...
متن کاملOn the consistency of information lters for
A common practice when ltering a case-base is to employ a ltering scheme that decides which cases to delete, as well as how many cases to delete, such that the storage requirements are minimized and the classiication competence is preserved or improved. We introduce an algorithm that rivals the most successful existing algorithm in the average case when ltering 30 classiication problems. Neithe...
متن کاملImproving the geospatial consistency of digital libraries metadata
Consistency is an essential aspect of the quality of metadata. Inconsistent metadata records are harmful: given a themed query, the set of retrieved metadata records would contain descriptions of unrelated or irrelevant resources, and would even do not contain some resources considered obvious. This is even worse in when the description of the location is inconsistent. Inconsistent spatial desc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1992