An outlier mining algorithm based on constrained concept lattice
نویسندگان
چکیده
An Outlier Mining Algorithm Based on Constrained Concept Lattice Jifu Zhang , Sulan Zhang , Kai H. Chang b, and Xiao Qin a School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan, P. R. China 030024 b Department of Computer Science and Software Engineering, Auburn University, Auburn, AL, USA 36849-5347 [email protected] Abstract: Traditional outlier mining methods identify outliers from a global point of view. These methods are inefficient to find locally-biased data points (outliers) in low dimensional subspaces. Constrained concept lattices can be used as an effective formal tool for data analysis because constrained concept lattices have the characteristics of high constructing efficiency, practicability, and pertinency,. In this paper,we propose an outlier mining algorithm that by treats the intent of any constrained concept lattice node as a subspace. We introduce sparsity and density coefficientsto measure outliers in low dimensional subspaces. The intent of any constrained concept lattice node is regarded as a subspace,and sparsity subspaces are searched by traversing the constrained concept lattice according to a sparsity coefficient threshold. If the intent of any father node of the sparsity subspace is a density subspace according to a density coefficient threshold, then objects contained in the extent of the sparsity subspace node are considered as bias data points or outliers. Our experimental results show that the proposed algorithm performs very well for high red-shift spectral data sets.
منابع مشابه
Outlier-based Data Association: Combining OLAP and Data Mining
Both data mining and OLAP are powerful decision support tools. However, people use them separately for years: OLAP systems concentrate on the efficiency of building OLAP cubes, and no statistical / data mining algorithms have been applied; on the other hand, statistical analysis are traditionally developed for two-way relational databases, and have not been generalized to the multi-dimensional ...
متن کاملA Classification Rule Acquisition Algorithm Based on Constrained Concept Lattice
Concept lattice is an effective tool for data analysis. Constrained concept lattice, with the characteristics of higher constructing efficiency, practicability and pertinence, is a new concept lattice structure. For classification rule acquisition, a classification rule acquisition algorithm based on the constrained concept lattice is presented by using the concept of partition support accordin...
متن کاملA Frequent Pattern Mining Algorithm Based on Concept Lattice
The concept lattice is an effective tool for data analysis and rule extraction, it is often well to mine frequent patterns by making use of concept lattice. In this paper, a frequent itemset mining algorithm FPCL based on concept lattice which builds lattice in batches, the algorithm builds lattice down layer by layer through the layer concept nodes and temporary nodes based on hierarchical con...
متن کاملRODHA: Robust Outlier Detection using Hybrid Approach
The task of outlier detection is to find the small groups of data objects that are exceptional to the inherent behavior of the rest of the data. Detection of such outliers is fundamental to a variety of database and analytic tasks such as fraud detection and customer migration. There are several approaches[10] of outlier detection employed in many study areas amongst which distance based and de...
متن کاملEfficient Mining of Association Rulesusing Closed
| Discovering association rules is one of the most important task in data mining. Many eecient algorithms have been proposed in the literature. The most noticeable are Apriori, Mannila's algorithm, Partition, Sampling and DIC, that are all based on the Apriori mining method: pruning the subset lattice (itemset lattice). In this paper we propose an eecient algorithm, called Close, based on a new...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. J. Systems Science
دوره 45 شماره
صفحات -
تاریخ انتشار 2014