Boosting Descriptive ILP for Predictive Learning

نویسندگان

  • Ning Jiang
  • Simon Colton
چکیده

Inductive Logic Programming has been very successful in application to multirelational predictive tasks. Sophisticated predictive ILP systems, such as Progol and foil, can achieve high predictive accuracy, while the learning results remain understandable. Although boosting [1] is an established method to promote predictive accuracy of weak algorithms, there have been relatively few efforts to apply it to ILP systems. Some studies include [2], which applied AdaBoost to the ffoil ILP system, and [3], in which MolFea, a domain-specific ILP system, was used as the base learner for AdaBoost. While these studies showed that predictive accuracy of ILP can often be increased by boosting, there is still much room for improvement. In particular, the run-time performance of ILP systems becomes an issue because AdaBoost has to invoke them many times to produce base classifiers. This prevents boosting from running more iterations to achieve higher predictive accuracy. Also, base classifiers from these ILP systems tend to be fairly accurate, which causes boosting to converge quickly, hence it is liable to overfitting, particularly on noisy datasets. Moreover, boosting needs to apply a weighting over training examples when the base algorithm is invoked. As ILP systems are usually not able to handle weighted examples, resampling is adopted, in which low weighted examples may be lost. To attempt to overcome these weaknesses, we have investigated the use of boosting with descriptive ILP systems, which generate first-order classification rules from training data in a class-blind manner. Using two methods to convert clauses from a descriptive ILP method to classifiers, we present the results of this approach for three bioinformatics datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Boosting Descriptive ILP for Predictive Learning in Bioinformatics

Boosting is an established propositional learning method to promote the predictive accuracy of weak learning algorithms, and has achieved much empirical success. However, there have been relatively few efforts to apply boosting to Inductive Logic Programming (ILP) approaches. We investigate the use of boosting descriptive ILP systems, by proposing a novel algorithm for generating classification...

متن کامل

Active relational rule learning in a constrained confidence rated boosting framework

In this dissertation, I investigate the potential of boosting within the framework of relational rule learning. Boosting is a particularly robust and powerful technique to enhance the prediction accuracy of systems that learn from examples. Although boosting has been extensively studied in the last years for propositional learning systems, only little attention has been paid to boosting in rela...

متن کامل

Relational Learning Using Constrained Confidence-Rated Boosting

In propositional learning, boosting has been a very popular technique for increasing the accuracy of classification learners. In firstorder learning, on the other hand, surprisingly little attention has been paid to boosting, perhaps due to the fact that simple forms of boosting lead to loss of comprehensibility and are too slow when used with standard ILP learners. In this paper, we show how b...

متن کامل

Rule Evaluation Measures: A Unifying View

Numerous measures are used for performance evaluation in machine learning. In predictive knowledge discovery, the most frequently used measure is classification accuracy. With new tasks being addressed in knowledge discovery, new measures appear. In descriptive knowledge discovery, where induced rules are not primarily intended for classification, new measures used are novelty in clausal and su...

متن کامل

Architectural Support for Compile-time Speculation

Studies on instruction-level parallelism (ILP) have shown that there are few independent instructions within the basic blocks of non-numerical applications. To uncover more independent instructions within these applications, instruction schedulers and microarchitectures must support the speculative execution of instructions. This paper describes an architectural mechanism for speculative execut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006