A Generic System for audio indexing: application to speech/music segmentation and music genre recognition
نویسنده
چکیده
In this paper we present a generic system for audio indexing (classification/ segmentation) and apply it to two usual problems: speech/ music segmentation and music genre recognition. We first present some requirements for the design of a generic system. The training part of it is based on a succession of four steps: feature extraction, feature selection, feature space transform and statistical modeling. We then propose several approaches for the indexing part depending of the local/ global characteristics of the indexes to be found. In particular we propose the use of segment-statistical models. The system is then applied to two usual problems. The first one is the speech/ music segmentation of a radio stream. The application is developed in a real industrial framework using real world categories and data. The performances obtained for the pure speech/ music classes problem are good. However when considering also the non-pure categories (mixed, bed) the performances of the system drop. The second problem is the music genre recognition. Since the indexes to be found are global, “segment-statistical models” are used leading to results close to the state of the art.
منابع مشابه
A Generic Training and Classification System Formirex08 Classification Tasks: Audio Music Mood, Audio Genre, Audio Artist and Audio Tag
This extended abstract details a submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2008 for the training and classification tasks audio music mood, audio genre, audio artist and audio tag. The same system has been submitted for the various tasks without any adaptations to the specific problems. The system named ircamclassification is a generic system which performs batch...
متن کاملشناسایی خودکار سبک موسیقی
Nowadays, automatic analysis of music signals has gained a considerable importance due to the growing amount of music data found on the Web. Music genre classification is one of the interesting research areas in music information retrieval systems. In this paper several techniques were implemented and evaluated for music genre classification including feature extraction, feature selection and m...
متن کاملAudio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion
Recently, audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval, subtitling, monitoring of acoustic scenes, etc. Moreover, a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this article, we present the eva...
متن کاملMirex-09 “music Mood, Mixed-genre, Latin-genre and Classical Composer Classification” Tasks: Ircamclassification08 Submission
This extended abstract details a submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2009 for the training and classification tasks “Music Mood, MixedGenre, Latin-Genre and Classical Composer classification” tasks. Ircam has submitted two systems: ircamclassification08 (GP) which is the same system as the one submitted for MIREX-08 and ircamclassification09 (BP) which is a...
متن کاملRobust singing detection in speech/music discriminator design
In this paper, an approach for robust signing signal detection in speech/music discrimination is proposed and applied to applications of audio indexing. Conventional approaches in speech/music discrimination can provide reasonable performance with regular music signals but often perform poorly with singing segments. This is due mainly to the fact that speech and singing signals are extremely cl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007