Efficiently Mining Frequent Closed Partial Orders ( Extended Abstract )
نویسندگان
چکیده
Example 1 (Motivation) Suppose MapleBank in Canada wants to investigate whether there is some orders which customers often follow to open their accounts. A database DB in Table 1 about four customers’ sequences of opening accounts in MapleBank is analyzed. Given a support threshold min sup, a sequential pattern is a sequence s which appears as subsequences of at least min sup sequences. For example, let min sup = 3. The following four sequences are sequential patterns since they are subsequences of three sequences, 1, 2 and 4, in DB. CHK → MMK → MORT → RESP; CHK → MMK → MORT → BROK; CHK → RRSP → MORT → RESP; CHK → RRSP → MORT → BROK The sequential patterns capture the frequent account opening patterns shared by customers. However, the four sequential patterns cannot completely capture the ordering shared by customers 1, 2 and 4. It is easy to see that a partial order R as shown in Figure 1 is shared by the three account opening sequences. The partial order R summarizes the four sequential patterns – the four sequential patterns are paths in partial order R. It also provides more information about the ordering than the sequential patterns.
منابع مشابه
On compressing frequent patterns q
A major challenge in frequent-pattern mining is the sheer size of its mining results. To compress the frequent patterns, we propose to cluster frequent patterns with a tightness measure d (called d-cluster), and select a representative pattern for each cluster. The problem of finding a minimum set of representative patterns is shown NP-Hard. We develop two greedy methods, RPglobal and RPlocal. ...
متن کاملMining Top-K Frequent Closed Patterns without Minimum Support
In this paper, we propose a new mining task: mining top-k frequent closed patterns of length no less than min `, where k is the desired number of frequent closed patterns to be mined, and min ` is the minimal length of each pattern. An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed node count and descendant sum are proposed to...
متن کاملUsing attribute value lattice to find closed frequent itemsets
Finding all closed frequent itemsets is a key step of association rule mining since the non-redundant association rule can be inferred from all the closed frequent itemsets. In this paper we present a new method for finding closed frequent itemsets based on attribute value lattice. In the new method, we argue that vertical data representation and attribute value lattice can find all closed freq...
متن کاملSimultaneous mining of frequent closed itemsets and their generators: Foundation and algorithm
Closed itemsets and their generators play an important role in frequent itemset and association rule mining. They allow a lossless representation of all frequent itemsets and association rules and facilitate mining. Some recent approaches discover frequent closed itemsets and generators separately. The Close algorithm mines them simultaneously but it needs to scan the database many times. Based...
متن کاملCLAIM: An Efficient Method for Relaxed Frequent Closed Itemsets Mining over Stream Data
Recently, frequent itemsets mining over data streams attracted much attention. However, mining closed itemsets from data stream has not been well addressed. The main difficulty lies in its high complexity of maintenance aroused by the exact model definition of closed itemsets and the dynamic changing of data streams. In data stream scenario, it is sufficient to mining only approximated frequent...
متن کامل