(Semi-)Predictive Discretization During Model Selection

نویسنده

  • Arto Klami
چکیده

Data discretization is needed for various reasons. One reason is that there are many machine learning algorithms that can only be applied to discrete data. In order to use those algorithms, we need to discretize the data. We might also want to do that for solely computational reasons; some problems are easier to compute for discrete variables. Finally, if we know that our data is discrete, but we only have noisy continuous measurements, we would naturally want to discretize the data to correspond with the underlying discrete values.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predictive Discretization During Model Selection

We present an approach to discretizing multivariate continuous data while learning the structure of a graphical model. We derive a joint scoring function from the principle of predictive accuracy, which inherently ensures the optimal trade-off between goodness of fit and model complexity including the number of discretization levels. Using the socalled finest grid implied by the data, our scori...

متن کامل

Solving Linear Semi-Infinite Programming Problems Using Recurrent Neural Networks

‎Linear semi-infinite programming problem is an important class of optimization problems which deals with infinite constraints‎. ‎In this paper‎, ‎to solve this problem‎, ‎we combine a discretization method and a neural network method‎. ‎By a simple discretization of the infinite constraints,we convert the linear semi-infinite programming problem into linear programming problem‎. ‎Then‎, ‎we use...

متن کامل

A New Hybrid Framework for Filter based Feature Selection using Information Gain and Symmetric Uncertainty (TECHNICAL NOTE)

Feature selection is a pre-processing technique used for eliminating the irrelevant and redundant features which results in enhancing the performance of the classifiers. When a dataset contains more irrelevant and redundant features, it fails to increase the accuracy and also reduces the performance of the classifiers. To avoid them, this paper presents a new hybrid feature selection method usi...

متن کامل

Snapshot Location in Proper Orthogonal Decomposition for Linear and Semi-linear Parabolic Partial Differential Equations

It is well-known that the performance of POD and POD-DEIM methods depends on the selection of the snapshot locations. In this work, we consider the selections of the locations for POD and POD-DEIM snapshots for spatially semi-discretized linear or semi-linear parabolic PDEs. We present an approach that for a fixed number of snapshots the optimal locations may be selected such that the global di...

متن کامل

The Role of Discretization Parameters in Sequence Rule Evolution

As raw data become available in ever-increasing amounts, there is a need for automated methods that extract comprehensible knowledge from the data. In our previous work we have applied evolutionary algorithms to the problem of mining predictive rules from time series. In this paper we investigate the effect of discretization on the predictive power of the evolved rules. We compare the effects o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004