Drum Detection from Polyphonic Audio via Detailed Analysis of the Time Frequency Domain
نویسنده
چکیده
This publication presents a method for the automatic detection and classification of three distinct drum instruments in real world musical signals. The regarded instruments are kick, snare and hi-hat as agreed by the participants of the contest category Audio Drum Detection within the 2nd Annual Music Information Retrieval Evaluation eXchange (MIREX 2005). There are two challenging issues inherent to drum sound recognition in polyphonic music. The first problem is that the drum sound itself can vary greatly within the same instrument class, due to playing techniques, recording situation and electronic effects. The second apparent problem is the interference and masking with all other instruments sounding simultaneously with the drum in a musical signal, making it difficult to reliably detect occurrences of a certain drum type. The method outlined here achieves a solution to these problems by extending a source separation approach described in earlier publications with spectrogram templates and a more elaborate classification approach. Performance results of the system are given by the outcomes of the Audio Drum Detection contest within the MIREX 2005.
منابع مشابه
Drum Source Separation using Percussive Feature Detection and Spectral Modulation
We present a method for the separation and resynthesis of drum sources from single channel polyphonic mixtures. The frequency domain technique involves identifying the presence of a drum using a novel percussive feature detection function, after which the short-time magnitude spectrum is estimated and scaled according to an estimated time-amplitude function derived from the percussive measure. ...
متن کاملExtraction of Drum Tracks from Polyphonic Music Using Independent Subspace Analysis
The analysis and separation of audio signals into their original components is an important prerequisite to automatic transcription of music, extraction of metadata from audio data, and speaker separation in video conferencing. In this paper, a method for the separation of drum tracks from polyphonic music is proposed. It consists of an Independent Component Analysis and a subsequent partitioni...
متن کاملDrum Sound Detection in Polyphonic Music with Hidden Markov Models
This paper proposes a method for transcribing drums from polyphonic music using a network of connected hidden Markov models (HMMs). The task is to detect the temporal locations of unpitched percussive sounds (such as bass drum or hi-hat) and recognise the instruments played. Contrary to many earlier methods, a separate sound event segmentation is not done, but connected HMMs are used to perform...
متن کاملAutomatic Drum Transcription Using Bi-Directional Recurrent Neural Networks
Automatic drum transcription (ADT) systems attempt to generate a symbolic music notation for percussive instruments in audio recordings. Neural networks have already been shown to perform well in fields related to ADT such as source separation and onset detection due to their utilisation of time-series data in classification. We propose the use of neural networks for ADT in order to exploit the...
متن کاملDrum Transcription Based on Independent Subspace Analysis
In automatic music transcription, metadata extraction from recorded audio data or speaker separation in video conferencing, it is a significant prerequisite task to analyze and separate the audio signal into their original source components. In this report, I study and analyze a set of methods of the extraction of percussive instruments metadata from polyphonic music. It mainly focuses on the s...
متن کامل