Unsupervised Classification of Sound for Multimedia Indexing
نویسندگان
چکیده
Segmenting audio streams in a signi cant manner and clustering sound segments objectively, is a signi cant challenge due to the nature of audio data. This paper presents some preliminary work on clustering sound segments based on frequency and harmonic characteristics. New metrics for comparing the similarity of sound segments are also devised.
منابع مشابه
On supervision and statistical learning for semantic multimedia analysis
Media analysis for video indexing is witnessing an increasing influence of statistical techniques. Examples of these techniques include the use of generative models as well as discriminant techniques for video structuring, classification, summarization, indexing, and retrieval. There is increasing emphasis on reducing the amount of supervision and user interaction needed to construct and utiliz...
متن کاملUnsupervised Clustering of Heart Sound Recordings for Cardiac Auscultation Database Indexing
This study proposes an unsupervised framework for classifying heart sound data. Its goal is to cluster unknown heart sound recordings, such that each cluster contains sound recordings belonging to the same heart diseases or normal heart beat category. The proposed framework is more flexible than the conventional supervised classification of heart sounds by the case when heart sound data belong ...
متن کاملOn the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks
A video‘s soundtrack is usually highly correlated to its content. Hence, audio-based techniques have recently emerged as a means for video concept detection complementary to visual analysis. Most state-of-the-art approaches rely on manual definition of predefined sound concepts such as “engine sounds”, “outdoor/indoor sounds”. These approaches come with three major drawbacks: manual definitions...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملUnsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification
In this paper, we attempt to represent audio as a sequence of acoustic units using unsupervised learning and use them for multi-class classification. We expect the acoustic units to represent sounds or sound sequences to automatically create a sound alphabet. We use audio from multi-class Youtube-quality multimedia data to converge on a set of sound units, such that each audio file is represent...
متن کامل