A Generic System for audio indexing: application to speech/music segmentation and music genre recognition

نویسنده

G. Peeters

چکیده

In this paper we present a generic system for audio indexing (classification/ segmentation) and apply it to two usual problems: speech/ music segmentation and music genre recognition. We first present some requirements for the design of a generic system. The training part of it is based on a succession of four steps: feature extraction, feature selection, feature space transform and statistical modeling. We then propose several approaches for the indexing part depending of the local/ global characteristics of the indexes to be found. In particular we propose the use of segment-statistical models. The system is then applied to two usual problems. The first one is the speech/ music segmentation of a radio stream. The application is developed in a real industrial framework using real world categories and data. The performances obtained for the pure speech/ music classes problem are good. However when considering also the non-pure categories (mixed, bed) the performances of the system drop. The second problem is the music genre recognition. Since the indexes to be found are global, “segment-statistical models” are used leading to results close to the state of the art.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Generic Training and Classification System Formirex08 Classification Tasks: Audio Music Mood, Audio Genre, Audio Artist and Audio Tag

This extended abstract details a submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2008 for the training and classification tasks audio music mood, audio genre, audio artist and audio tag. The same system has been submitted for the various tasks without any adaptations to the specific problems. The system named ircamclassification is a generic system which performs batch...

متن کامل

شناسایی خودکار سبک موسیقی

Nowadays, automatic analysis of music signals has gained a considerable importance due to the growing amount of music data found on the Web. Music genre classification is one of the interesting research areas in music information retrieval systems. In this paper several techniques were implemented and evaluated for music genre classification including feature extraction, feature selection and m...

متن کامل

Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

Recently, audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Moreover, a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this article, we present the eva...

متن کامل

Mirex-09 “music Mood, Mixed-genre, Latin-genre and Classical Composer Classification” Tasks: Ircamclassification08 Submission

This extended abstract details a submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2009 for the training and classification tasks “Music Mood, MixedGenre, Latin-Genre and Classical Composer classification” tasks. Ircam has submitted two systems: ircamclassification08 (GP) which is the same system as the one submitted for MIREX-08 and ircamclassification09 (BP) which is a...

متن کامل

Robust singing detection in speech/music discriminator design

In this paper, an approach for robust signing signal detection in speech/music discrimination is proposed and applied to applications of audio indexing. Conventional approaches in speech/music discrimination can provide reasonable performance with regular music signals but often perform poorly with singing segments. This is due mainly to the fact that speech and singing signals are extremely cl...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

A Generic System for audio indexing: application to speech/music segmentation and music genre recognition

نویسنده

چکیده

منابع مشابه

A Generic Training and Classification System Formirex08 Classification Tasks: Audio Music Mood, Audio Genre, Audio Artist and Audio Tag

شناسایی خودکار سبک موسیقی

Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion

Mirex-09 “music Mood, Mixed-genre, Latin-genre and Classical Composer Classification” Tasks: Ircamclassification08 Submission

Robust singing detection in speech/music discriminator design

عنوان ژورنال:

اشتراک گذاری