Meta-knowledge Management in Multistrategy Process-oriented Knowledge Discovery Systems

نویسندگان

  • Mykola Pechenizkiy
  • Alexey Tsymbal
  • Seppo Puuronen
چکیده

Current electronic data repositories are growing quickly and contain big amount of data from commercial, scientific, and other domain areas. The capabilities for collecting and storing all kinds of data exceed the abilities to analyze, summarize, and extract knowledge from this data. Knowledge discovery systems (KDSs) use achievements from many technical areas, including databases, Data Mining (DM), statistics, AI, machine learning, pattern recognition, high performance computing, management information systems (MIS), decision support systems, and knowledge-based systems. Knowledge discovery is an innovative approach to information management and is associated commonly with the nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns and relations in large databases (Fayyad, 1996). Numerous data mining techniques have recently been developed to extract knowledge from large databases. Since present-day KDSs are armed with a number of available techniques to process data; and, potentially, there are many possible combinations of these techniques to construct a DM strategy for mining a current problem. In a real problem-solving situation it is not computationally feasible to apply every DM strategy. Therefore, dynamic selection of data mining methods in knowledge discovery systems has been under active study (see, for example, (Tsymbal, 2002)). However, at least two contexts of dynamic selection can be distinguished. First, the so-called multi-classifier systems that apply different ensemble techniques (Dietterich, 1997). Their general idea is usually to select one classifier on the dynamic basis taking into account the local performance (e.g. generalisation accuracy) in the instance space. Second, multistrategy learning that applies a strategy selection approach which takes into account the classification problemrelated characteristics (meta-data). We are interested in the second context in this study. Selection of the most appropriate DM technique or a group of the most appropriate techniques is usually not straightforward. Many empirical studies are aimed

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Relationship between Knowledge Management and the Process of Entrepreneurship in Sport Organizations

In the current competitive world, organizations can reach competitive advantage which support entrepreneurship by providing the required tools. One of the most important tools for developing entrepreneurship which was neglected in previous studies is organizational knowledge management. The present paper aims to shed light on the role and importance of knowledge management in sport entreprene...

متن کامل

Multistrategy Data Exploration Using the INLEN System: Recent Advances

Recent advances in the development of the INLEN system for multistrategy data exploration are briefly reviewed. These advances include the development of a meta-level language for data mining and knowledge discovery, called knowledge generation language (KGL), and the employment of a new type of attributes, called structured attributes. These features are illustrated by an example concerned wit...

متن کامل

A Methodology and Life Cycle Model for Data Mining and Knowledge Discovery in Precision Agriculture

This paper presents a methodology for data mining and knowledge discovery in large, distributed and heterogeneous databases. In order to obtain potentially interesting patterns, relationships, and rules from such large and heterogeneous data collections, it is essential that a methodology be developed to take advantage of the suite of existing methods and tools available for data mining and kno...

متن کامل

A Multistrategy Learning Approach to Flexible Knowledge Organization and Discovery

1 Also with Lockheed Martin Federal Systems, Gaithersburg, MD. 2 Also with Science Applications International Corp., Tysons Corner, VA. Abstract Properly organizing knowledge so that it can be managed often requires the acquisition of patterns and relations from large, distributed, heterogeneous databases. The employment of an intelligent and automated KDD (Knowledge Discovery in Databases) pro...

متن کامل

AqBC: A Multistrategy Approach for Constructive Induction

In order to obtain potentially interesting patterns and relations from large, distributed, heterogeneous databases, it is essential to employ an intelligent and automated KDD (Knowledge Discovery in Databases) process. One of the most important methodologies is an integration of diverse learning strategies that cooperatively performs a variety of techniques and achieves high quality knowledge. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005