Fuzzy Logic -based Pre-processing for Fuzzy Association Rule Mining
نویسندگان
چکیده
Conventional Association Rule Mining (ARM) algorithms usually deal with datasets with categorical values and expect any numerical values to be converted to categorical ones using ranges (Age = 25 to 60). Fuzzy logic is used to convert quantitative values of attributes to categorical ones so as to eliminate any loss of information arising due to sharp partitioning (using ranges) and then generate fuzzy association rules. But before any fuzzy ARM algorithm can be applied, there is substantial amount of fuzzy-logic-based pre-processing that needs to be done on the dataset. This paper describes, in detail, a methodology to do this pre-processing which first involves using fuzzy clustering to generate fuzzy partitions and then use these partitions to get a fuzzy version (with fuzzy records) of the original dataset. Ultimately, the fuzzy data (fuzzy records) are represented in a standard manner such that they can be used as input to any kind of fuzzy ARM algorithm, irrespective of how it works and processes the fuzzy data. And finally, we also show how existing algorithms, like Apriori, ARMOR, and FPGrowth can be modified, especially in the manner in which they count itemsets, to mine data in a fuzzy environment.
منابع مشابه
Clustering Based Automatic Fuzzy Partitioning of Numerical Attributes
Partitioning of quantified attributes is essential for mining association rules from quantified data and the Fuzzy approach solves the sharp boundary problem giving Fuzzy association rules having high interpretability and rich applicability. The paper presents automated partitioning of numerical data into Fuzzy sets based on k means clustering algorithm. This can be used as a pre-processing ste...
متن کاملEfficient mining of Fuzzy Association Rules from the Pre-Processed Dataset
Association rule mining is an active data mining research area. Recent years have witnessed many efforts on discovering fuzzy associations. The key strength of fuzzy association rule mining is its completeness. This strength, however, comes with a major drawback to handle large datasets. It often produces a huge number of candidate itemsets. The huge number of candidate itemsets makes it ineffe...
متن کاملAlgorithm of the Inverse Confidence of Data Mining Based on the Techniques of Association Rules and Fuzzy Logic
This article proposes an algorithm for data mining that presents a new measure for assistance in the extraction of knowledge. The algorithm uses association rules to extract rules from the databases and fuzzy logic for the classification and comparison of the collected rules. Key-words: data mining, association rules, fuzzy logic, similarity and algorithm of the inverse confidence.
متن کاملA Survey of Fuzzy Based Association Rule Mining to Find Co- Occurrence Relationships
Data mining is the analysis step of the "Knowledge Discovery in Databases" process, or KDD. It is the process that results in the discovery of new patterns in large data sets. It utilizes methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract knowledge from an existing data set and tra...
متن کاملIntrusion detection by integrating boosting genetic fuzzy classifier and data mining criteria for rule pre-screening
The purpose of the work described in this paper is to provide an intelligent intrusion detection system (IIDS) that uses two of the most popular data mining tasks, namely classification and association rules mining together for predicting different behaviors in networked computers. To achieve this, we propose a method based on iterative rule learning using a fuzzy rule-based genetic classifier....
متن کامل