Describing Data Table with Best Decision

نویسندگان

  • Ants Torim
  • Rein Kuusik
چکیده

We propose a new way for describing data tables that is inspired by decision trees. Our goal is to summarize entire data table with one “average” object called best decision. The best decision is defined here as a decision that achieves the greatest value of a weight function. In our paper we first review computationally simple weight function for defining the best decision which does not account for the dependencies between the attributes. Then we define decision as a branch in a decision tree and introduce a weight function that takes those dependencies into account. As search-space for such decision grows factorially with the number of attributes, efficient pruning techniques are necessary. We define three pruning techniques that can be applied in combination. We present some empirical data to demonstrate the effectiveness of such techniques. Key-Words: data mining, decision tree, best decision, optimization, algorithms

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Decision Algorithms: A Survey of Rough Set - Theoretic Methods

In this paper we present some strategies for synthesis of decision algorithms studied by us. These strategies are used by systems of communicating agents and lead from the original (input) data table to a decision algorithm. The agents are working with parts of data and they compete for the decision algorithm with the best quality of object classiication. We give examples of techniques for sear...

متن کامل

Basic influence diagrams and the liberal stable semantics

This paper is concerned with the general problem of constructing decision tables and more specifically, with the identification of all possible outcomes of decisions. We introduce and propose basic influence diagrams as a simple way of describing problems of decision making under strict uncertainty. We then establish a correspondence between basic influence diagrams and symmetric generalised as...

متن کامل

Preprocessing Input Data for Machine Learning by FCA

The paper presents an utilization of formal concept analysis in input data preprocessing for machine learning. Two preprocessing methods are presented. The first one consists in extending the set of attributes describing objects in input data table by new attributes and the second one consists in replacing the attributes by new attributes. In both methods the new attributes are defined by certa...

متن کامل

Decision table for classifying point sources based on FIRST and 2MASS databases

With the availability of multiwavelength, multiscale and multiepoch astronomical catalogues, the number of features to describe astronomical objects has increases. The better features we select to classify objects, the higher the classification accuracy is. In this paper, we have used data sets of stars and quasars from near infrared band and radio band. Then best-first search method was applie...

متن کامل

A Comparative Study on Remote Tracking of Parkinsons Disease Progression Using Data Mining Methods

In recent years, applications of data mining methods are become more popular in many fields of medical diagnosis and evaluations. The data mining methods are appropriate tools for discovering and extracting of available knowledge in medical databases. In this study, we divided 11 data mining algorithms into five groups which are applied to a dataset of patient’s clinical variables data with Par...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005