Evolutionary approach for discovering changing patterns in historical data

نویسندگان

  • Wai-Ho Au
  • Keith C. C. Chan
چکیده

In this paper, we propose a new data mining approach, called dAR, for discovering interesting association rules and their changes by evolutionary computation. dAR searches through huge rule spaces effectively using a genetic algorithm. It has the following characteristics: (i) it encodes a complete set of rules in one single chromosome; (ii) each allele encodes one rule and each rule is represented by some non-binary symbolic values; (iii) the evolutionary process begins with the generation of an initial set of first-order rules (i.e., rules with one condition) using a probabilistic induction technique and based on these rules, rules of higher order (two or more conditions) are obtained iteratively; (iv) it adopts a steadystate reproduction scheme in which only two chromosomes are replaced every time; (v) when identifying interesting rules, an objective interestingness measure is used; and (vi) the fitness of a chromosome is defined in terms of the probability that the attribute values of a tuple can be correctly determined using the rules it encodes. Furthermore, dAR can also be used to mine the changes in discovered rules over time. This allows the accurate prediction of the future based on the historical data in the past. The experimental results on a synthetic database have shown that dAR is very effective at mining interesting association rules and their changes over time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Frequently Changing Substructures from Historical Unordered XML Documents

Recently, there is an increasing research efforts in XML data mining. These efforts largely assumed that XML documents are static. However, in many real applications, XML data are evolutionary in nature. In this paper, we focus on mining evolution patterns from historical XML documents. Specifically, we propose a novel approach to discover frequently changing structures (FCS) from a sequence of...

متن کامل

ارزیابی رویکردهای اقتصاد تطوری و ریشه های فکری آنها

Casting a chronological glance at the trend of studies in the evolutionary economics area suffices to come to the point that the approach enjoys a long historical background. It is to say that the studies conducted in this area consist of miscellaneous thoughts ranging from insight of early Marxists to Austrian neo-liberalists. Nevertheless, a landmark study that paved the way for the developme...

متن کامل

Dating divergence of Polystigma and other Sordariomycetes

Studies on the evolutionary history of ascomycetes in terms of time scale will help to understand historical patterns that shape their biodiversity. Until now most of dating studies of ascomycetes have focused on major events in fungal evolution but not on divergence events within smaller groups of fungi e.g. within Sordariomycetes. We used molecular dating to estimate the time of separation of...

متن کامل

Discovering Evolutionary Theme Patterns from Text ∗ CS 598

Temporal Text Mining (TTM) is concerned with discovering temporal patterns in text information collected over time. Since most text information bears some time stamps, TTM has many applications in multiple domains, such as summarizing events in news articles and revealing research trends in scientific literature. In this paper, we study a particular TTM task – discovering and summarizing the ev...

متن کامل

Discovering occurrences of user-defined patterns in historical data representing collaborative activities in virtual user environment

The paper deals with analyses of performed collaborative activities in virtual user environment, focused on pattern discovering. All activities are monitored and recorded into separate database within defined log format. This log format provides sufficient historical data for various analytical purposes as visualization through timeline or extraction of different statistics based on user expect...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002