Association rule mining and load balancing strategy in grid systems
نویسندگان
چکیده
The parallel and distributed systems represent one of the important solutions proposed to ameliorate the performance of the sequential association rule mining algorithms. However, parallelization and distribution process is not trivial and still facing many problems of synchronization, communication, and workload balancing. Our study is limited to the workload balancing problem. In this paper, we propose a dynamic load balancing strategy of association rule mining algorithm under a grid environment. This strategy is built upon a hierarchical grid model with three levels: Super coordinator, coordinator, and processing nodes. The main objective of our strategy is to ameliorate the performances of the distributed association rule mining algorithm “APRIORI”.
منابع مشابه
Design and Analysis of a Dynamic Load Balancing Strategy for Large-Scale Distributed Association Rule Mining
Association rule mining is one of the most important data mining techniques. Algorithms of this technique search a large space, considering numerous different alternatives and scanning the data repeatedly. Parallelism seems to be the natural solution in order to be able to work with industrial-sized databases. Large-scale computing systems, such as Grid computing environments, are recently rega...
متن کاملApplication of Parallelized Apriori in Grid Computing Environment
The goal of the strategy is to improve the performance of distributed algorithms and better their responsiveness. The association rule mining algorithms has high computational complexity due to the size of its search space and the high demands of data access. The work aims at mining the data in a grid computing environment, which computes by distributing the data to its clusters and mines it in...
متن کاملA Hierarchical Dynamic Load Balancing Strategy for Distributed Data Mining
Extracting useful knowledge from data sets measuring in gigabytes and even terabytes is a challenging research area for the data mining community. Sequential approaches suffer from a performance problem due to the fact that they have to mine voluminous databases. Parallelism is introduced as an important solution that could improve the response time and the scalability of these approaches. Howe...
متن کاملAssociation rule mining application to diagnose smart power distribution system outage root cause
Smart grid has been introduced to address power distribution system challenges. In conventional power distribution systems, when a power outage happens, the maintenance team tries to find the outage cause and mitigate it. After this, some information is documented in a dataset called the outage dataset. If the team can estimate the outage cause before searching for it, the restoration time will...
متن کاملA Novel Data Partitioning Approach for Association Rule Mining on Grids
Mining association rules refers to extracting useful knowledge from large databases. Algorithms of this technique are both data and computation-intensive, which make grid platforms very attractive for them. However, to exploit these platforms, new data partitioning features are required where the specificities of both association rule mining technique and grids must be taken into consideration....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. Arab J. Inf. Technol.
دوره 11 شماره
صفحات -
تاریخ انتشار 2014