Multi-class Support Vector Machine Active Learning for Music Annotation

نویسندگان

  • Gang Chen
  • Tian-jiang Wang
  • Li-yu Gong
  • Perfecto Herrera
چکیده

Music annotation is an important research topic in the multimedia area. One of the challenges in music annotation is how to reduce the human effort in labeling music files for building reliable classification models. In the past, there have been many studies on applying support vector machine active learning methods to automatic multimedia data annotation, which try to select the most informative examples for labeling manually. Most of these studies focused on selecting a single unlabeled example in each iteration process for binary classification. As a result, the model has to be retrained after each labeled example is solicited, and the user is likely to lose patience after a few rounds of labeling. In this paper, we present a novel multi-class active learning algorithm that can select multiple music examples for labeling in each iteration process. The key of the multi-sample selection for multi-class active learning is how to reduce the redundancy and avoid selecting the outliers among the selected examples such that each example provides unique information for model updating. To this end, we propose the distance diversity and set density in the support vector machine feature space as the measurement of the scatter of the selected sample set. Experiment results on two music data sets demonstrate the effectiveness of our method. Moreover, although our criterion is designed for music annotation, it can be used in a general frame work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault diagnosis in a distillation column using a support vector machine based classifier

Fault diagnosis has always been an essential aspect of control system design. This is necessary due to the growing demand for increased performance and safety of industrial systems is discussed. Support vector machine classifier is a new technique based on statistical learning theory and is designed to reduce structural bias. Support vector machine classification in many applications in v...

متن کامل

Support Vector Machine Active Learning for Music Mood Tagging

Active learning is a subfield of machine learning based on the idea that the accuracy of an algorithm can be improved with fewer training samples if it is allowed to choose the data from which it learns. We present the results for Support Vector Machine (SVM) active learning experiments for music mood tagging based on a multi-sample selection strategy that chooses samples according to their pro...

متن کامل

Supervised Machine Learning based Medical Image Annotation and Retrieval

This paper presents the approaches and experimental results of image annotation and retrieval in our first participation of ImageCLEFmed 2005. In this work, we investigate a supervised learning approach to associate low-level global image features with their high level visual and/or semantic categories for image annotation and retrieval. For automatic image annotation, we represent input images...

متن کامل

Feature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine

Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods.  In filter methods, features subsets are selected due to some measu...

متن کامل

Active Learning with Support Vector Machines

This thesis examines the use of support vector machines for active learning using linear, polynomial and radial basis function kernels. In our experiments we used named entity recognition which was treated as a binary task and as a multiclass task and we also tackled shallow parsing. We report savings in annotation costs ranging from 80% to 95% depending on the task. We observed that the distri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008