Tree-Based Algorithms for Action Rules Discovery

نویسندگان

  • Zbigniew W. Ras
  • Li-Shiang Tsay
  • Agnieszka Dardzinska
چکیده

One of the main goals in Knowledge Discovery is to find interesting associations between values of attributes, those that are meaningful in a domain of interest. The most effective way to reduce the amount of discovered patterns is to apply two interestingness measures, subjective and objective. Subjective measures are based on the subjectivity and understandability of users examining the patterns. They are divided into actionable, unexpected, and novel. Because classical knowledge discovery algorithms are unable to determine if a rule is truly actionable for a given user [1], we focus on a new class of rules [15], called E-action rules, that can be used not only for automatic analysis of discovered classification rules but also for hints of how to reclassify some objects in a data set from one state into another more desired one. Actionability is closely linked with the availability of flexible attributes [18] used to describe data and with the feasibility and cost [23] of desired re-classifications. Some of them are easy to achieve. Some, initially seen as impossible within constraints set up by a user, still can be successfully achieved if additional attributes are available. For instance, if a system is distributed and collaborating sites agree on the ontology [5], [6] of their common attributes, the availability of additional data from remote sites can help to achieve certain re-classifications of objects at a server site [23]. Action tree algorithm, presented in this paper, requires prior extraction of classification rules similarly as the algorithms proposed in [15] and [17] but it guarantees a faster and more effective process of E-action rules discovery. It was implemented as system DEAR 2.2 and tested on several public domain databases. Support and confidence of E-action rules is introduced and used to prune a large number of generated candidates which are irrelevant, spurious, and insignificant.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Knowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services

The rapid growing of information technology (IT) motivates and makes competitive advantages in health care industry. Nowadays, many hospitals try to build a successful customer relationship management (CRM) to recognize target and potential patients, increase patient loyalty and satisfaction and finally maximize their profitability. Many hospitals have large data warehouses containing customer ...

متن کامل

Knowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services

The rapid growing of information technology (IT) motivates and makes competitive advantages in health care industry. Nowadays, many hospitals try to build a successful customer relationship management (CRM) to recognize target and potential patients, increase patient loyalty and satisfaction and finally maximize their profitability. Many hospitals have large data warehouses containing customer ...

متن کامل

Action Rules Discovery Based on Tree Classifiers and Meta-actions

Action rules describe possible transitions of objects from one state to another with respect to a distinguished attribute. Early research on action rule discovery usually required the extraction of classification rules before constructing any action rule. Newest algorithms discover action rules directly from a decision system. To our knowledge, all these algorithms assume that all attributes ar...

متن کامل

Using Data Mining and Three Decision Tree Algorithms to Optimize the Repair and Maintenance Process

The purpose of this research is to predict the failure of devices using a data mining tool. For this purpose, at the outset, an appropriate database consists of 392 records of ongoing failures in a pharmaceutical company in 1394, in the next step, by analyzing 9 characteristics and type of failure as a database class, analyzes have been used. In this regard, three decision tree algorithms have ...

متن کامل

FP-tree and COFI Based Approach for Mining of Multiple Level Association Rules in Large Databases

In recent years, discovery of association rules among itemsets in a large database has been described as an important database-mining problem. The problem of discovering association rules has received considerable research attention and several algorithms for mining frequent itemsets have been developed. Many algorithms have been proposed to discover rules at single concept level. However, mini...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009