Keyword selection method for characterizing

نویسندگان

  • Krista Lagus
  • Samuel Kaski
چکیده

Characterization of subsets of data is a recurring problem in data mining. We propose a keyword selection method that can be used for obtaining characterizations of clusters of data whenever textual descriptions can be associated with the data. Several methods that cluster data sets or form projections of data provide an order or distance measure of the clusters. If such an ordering of the clusters exists or can be deduced, the method utilizes the order to improve the characterizations. The proposed method may be applied , for example, to characterizing graph-ical displays of collections of data ordered e.g. with the SOM algorithm. The method is validated using a collection of 10,000 sci-entiic abstracts from the INSPEC database organized on a WEBSOM document map.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Keyword Selection Method for Characterizing Text Document Maps

Characterization of subsets of data is a recurring problem in data mining. We propose a keyword selection method that can be used for obtaining characterizations of clusters of data whenever textual descriptions can be associated with the data. Several methods that cluster data sets or form projections of data provide an order or distance measure of the clusters. If such an ordering of the clus...

متن کامل

Characterizing compromise solutions for investors with uncertain risk preferences

The optimum portfolio selection for an investor with particular preferences was proven to lie on the normalized efficient frontier between two bounds defined by the Ballestero (1998) bounding theorem. A deeper understanding is possible if the decision-maker is provided with visual and quantitative techniques. Here, we derive useful insights as a way to support investor?s decision-making through...

متن کامل

Characterizing compromise solutions for investors with uncertain risk preferences

The optimum portfolio selection for an investor with particular preferences was proven to lie on the normalized efficient frontier between two bounds defined by the Ballestero (1998) bounding theorem. A deeper understanding is possible if the decision-maker is provided with visual and quantitative techniques. Here, we derive useful insights as a way to support investor?s decision-making through...

متن کامل

A Review on Feature Selection MethodsforHigh Dimensional Data

Feature selection has become an important task for effective application of data mining techniquesin real-world high dimensional datasets. It is a process that selects a subset of original features by removing irrelevant and redundant features on the basis of the evaluation criteria without loss of information content. A feature selection method helps to reduce computational complexity of learn...

متن کامل

The Effect of Using Keyword Method on EFL Learners' Learning and Retrieving English Verb Types

This study used keyword method during encoding information in transferring information from short term memory to make the retrieval easier. For this purpose, 50 adult female elementary students were chosen to participate in this study. This study required two groups of learners (control and experimental groups). The experimental group enjoyed some special flashcards which each of them involved ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008