SETM*-MaxK: An Efficient SET-Based Approach to Find the Largest Itemset
نویسندگان
چکیده
In this paper, we propose the SETM*-MaxK algorithm to find the largest itemset based on a high-level set-based approach, where a large itemset is a set of items appearing in a sufficient number of transactions. The advantage of the set-based approach, like the SETM algorithm, is simple and stable over the range of parameter values. In the SETM*-MaxK algorithm, we efficiently find the Lk based on Lw, where Lk denotes the set of large k-itemsets with minimum support, Lk = ∅, Lk+1 = ∅ and w = 2 log2k −1, instead of step by step. From our simulation, we show that the proposed SETM*-MaxK algorithm requires shorter time to achieve its goal than the SETM algorithm.
منابع مشابه
Set-Oriented Data Mining in relational Databases
Data mining is an important real-life application for businesses. It is critical to find efficient ways of mining large data sets. In order to benefit from the experience with relational databases, a set-oriented approach to mining data is needed. In such an approach, the data mining operations are expressed in terms of relational or set-oriented operations. Query optimization technology can th...
متن کاملAn improved approach to find and rank BCC-efficient DMUs in data envelopment analysis (DEA)
Recently, a mixed integer data envelopment analysis (DEA) model has been proposed to find the most BCC-efficient (or the best) decision making unit (DMU) by Toloo (2012). This paper shows that the model may be infeasible in some cases, and when the model is feasible, it may fail to identify the most efficient DMU, correctly. We develop an improved model to find the most BCC-efficient DMU that r...
متن کاملA new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining
Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...
متن کاملAN EFFICIENT METHOD FOR OPTIMUM PERFORMANCE-BASED SEISMIC DESIGN OF FUSED BUILDING STRUCTURES
A dual structural fused system consists of replaceable ductile elements (fuses) that sustain major seismic damage and leave the primary structure (PS) virtually undamaged. The seismic performance of a fused structural system is determined by the combined behavior of the individual PS and fuse components. In order to design a feasible and economic structural fuse concept, we need a procedure to ...
متن کاملMining High Average-Utility Itemsets with an Indexed Projection Technique
An itemset in traditional utility mining only considers individual profits and quantities of items in transactions but not its itemset length. The average-utility measure, which is the total utility of an itemset divided by its number of items within it, was then proposed to reveal a better utility effect than the original utility one. However, their proposed approach was based on the principle...
متن کامل