A 'Best-of-Breed' approach for designing a fast algorithm for computing fixpoints of Galois Connections

نویسنده

  • Simon Andrews
چکیده

28 The fixpoints of Galois Connections form patterns in binary relational data, such as object29 attribute relations, that are important in a number of data analysis fields, including Formal 30 Concept Analysis (FCA), Boolean factor analysis and frequent itemset mining. However, the 31 large number of such fixpoints present in a typical dataset requires efficient computation 32 to make analysis tractable, particularly since any particular fixpoint may be computed 33 many times. Because they can be computed in a canonical order, testing the canonicity 34 of fixpoints to avoid duplicates has proven to be a key factor in the design of efficient algo35 rithms. The most efficient of these algorithms have been variants of the Close-By-One 36 (CbO) algorithm. In this article, the algorithms CbO, FCbO, In-Close, In-Close2 and a new 37 variant, In-Close3, are presented together for the first time, with in-Close2 and In-Close3 38 being the results of breeding In-Close with FCbO. To allow them to be easily compared, 39 the algorithms are presented in the same style and notation. The important advances in 40 CbO are described and compared graphically using a simple example. For the first time, 41 the algorithms are implemented using the same structures and techniques to provide a 42 level playing field for evaluation. Their performance is tested and compared using a range 43 of data sets and the most important features identified for a CbO ‘Best-of-Breed’. This arti44 cle also presents, for the first time, the ‘partial-closure’ canonicity test. 45 2014 Published by Elsevier Inc. 46

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast algorithm for computing fixpoints of Galois connections induced by object-attribute relational data

Fixpoints of Galois connections induced by object-attribute data tables represent important patterns that can be found in relational data. Such patterns are used in several data mining disciplines including formal concept analysis, frequent itemset and association rule mining, and Boolean factor analysis. In this paper we propose efficient algorithm for listing all fixpoints of Galois connectio...

متن کامل

Galois Connections with Hedges

We introduce (fuzzy) Galois connections with hedges. Fuzzy Galois connections are basic structures behind so-called formal concept analysis of data with fuzzy attributes. Introducing hedges to Galois connections means introducing two parameters. The parameters influence the size of the set of all the fixpoints of a Galois connection. In the sense of formal concept analysis, the fixpoints, calle...

متن کامل

A Robust Feedforward Active Noise Control System with a Variable Step-Size FxLMS Algorithm: Designing a New Online Secondary Path Modelling Method

Several approaches have been introduced in literature for active noise control (ANC)systems. Since Filtered-x-Least Mean Square (FxLMS) algorithm appears to be the best choice as acontroller filter. Researchers tend to improve performance of ANC systems by enhancing andmodifying this algorithm. This paper proposes a new version of FxLMS algorithm. In many ANCapplications an online secondary pat...

متن کامل

A Hybrid Neural Network Approach for Kinematic Modeling of a Novel 6-UPS Parallel Human-Like Mastication Robot

Introduction we aimed to introduce a 6-universal-prismatic-spherical (UPS) parallel mechanism for the human jaw motion and theoretically evaluate its kinematic problem. We proposed a strategy to provide a fast and accurate solution to the kinematic problem. The proposed strategy could accelerate the process of solution-finding for the direct kinematic problem by reducing the number of required ...

متن کامل

An Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ

An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 295  شماره 

صفحات  -

تاریخ انتشار 2015