Mining Closed Gradual Patterns
نویسندگان
چکیده
With the steady development of the computing tools, we attended last three decades a considerable increase of the quantity of data stored in databases. So, extracting knowledge from this data is of paramount importance. Data mining is becoming an inescapable tool to reach this goal. Association rule extraction is one of the important tasks in data mining. This powerful technique has a wide range of applications in many areas of business practice and also research. A scrutiny of the related work shows that another type of rules called gradual rules also paid attention within the data mining community. Gradual rules of the form −“the more A, the more B”− mainly grasped the interest within recommendation and command system fields [2]. Several approaches and semantics dealing with this kind of rules have been proposed in literature. However, the relevance and the usefulness of the mined knowledge seems no to be the main concern in these approaches. In fact, it is expected that an overwhelming quantity of gradual rules will be drawn even from low sized contexts. The main thrust of this paper is to address to lossless reduction of the mined knowledge. To reach this goal, a possible solution consists in using results of Formal Concept Analysis that has been shown to provide useful seeds to tackle such knowledge extraction problem. However, no work has addressed the use of the FCA framework for gradual patterns. Hence, we introduce a novel Galois connection that is a sine qua non issue for extracting closed gradual patterns. These latter patterns will act as a lossless reduced-size nucleus of patterns. The remainder of the paper is organized as follows. Section 2 reviews the related work focused on mining gradual rules and some basic notions of the FCA framework. Section 3 introduces our novel Galois connection definition and shows its validity and soundness. Section 4 validates the importance of our approach at reducing the hudge number of the extracted gradual closed patterns through experiments carried out over synthetic datasets. Section 5 sketches our future perspectives and presents concluding remarks.
منابع مشابه
Mining Emerging Gradual Patterns
Mining emerging patterns aims at contrasting data sets and identifying itemsets that characterise a data set by contrast to a reference data set, so as to capture and highlight their differences. This paper considers the case of emerging gradual patterns, to extract discriminant attribute co-variations. It discusses the specific features of these gradual patterns and proposes to transpose an ef...
متن کاملA Framework for Mining Closed Sequential Patterns
Sequential pattern mining algorithms developed so far provide better performance for short sequences but are inefficient at mining long sequences, since long sequences generate a large number of frequent subsequences. To efficiently mine long sequences, closed sequential pattern mining algorithms have been developed. These algorithms mine closed sequential patterns which don’t have any super se...
متن کاملCCMine: Efficient Mining of Confidence-Closed Correlated Patterns
Correlated pattern mining has become increasingly important recently as an alternative or an augmentation of association rule mining. Though correlated pattern mining discloses the correlation relationships among data objects and reduces significantly the number of patterns produced by the association mining, it still generates quite a large number of patterns. In this paper, we propose closed ...
متن کاملMining and Ranking Generators of Sequential Patterns
Sequential pattern mining first proposed by Agrawal and Srikant has received intensive research due to its wide range applicability in many real-life domains. Various improvements have been proposed which include mining a closed set of sequential patterns. Sequential patterns supported by the same sequences in the database can be considered as belonging to an equivalence class. Each equivalence...
متن کاملMining Top-K Frequent Closed Patterns without Minimum Support
In this paper, we propose a new mining task: mining top-k frequent closed patterns of length no less than min `, where k is the desired number of frequent closed patterns to be mined, and min ` is the minimal length of each pattern. An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed node count and descendant sum are proposed to...
متن کامل