Melody Extraction from Polyphonic Audio Signal Mirex2009
نویسندگان
چکیده
This paper describes the proposed algorithm submitted to the MIREX 2009 “Audio Melody Extraction” task. The algorithm addresses the task of extracting the predominant melody pitch from a polyphonic audio signal. The algorithm extracts the melody pitch in three steps. In the first step, transient analysis is performed on the polyphonic audio signal to determine the analysis frame length, and then a fixed number of pitch candidates are obtained by ranking the weights of the harmonic structure of the windowed signal. In the second step, a single dominant pitch sequence (melody line) is selected from the many possible pitch sequences based on the following properties of melody line: (1) while the nominal dynamic range of a singing vibrato is ± 60∼200 cents, it is only± 20∼30 cents for instruments; (2) melody transitions are typically limited to one octave; (3) a rest during singing is often longer than 50ms. In the third step, a smoothing process is performed to refine the estimated pitch sequence.
منابع مشابه
Melody Extraction in Music Audio Signals by Melodic Component Enhancement and Pitch Tracking
This extended abstract is for the “Audio Melody Extraction” contest of MIREX2009. We describe an algorithm that estimates the melody line from a music audio signal. The algorithm is comprised of two stages: melodic component enhancement and melody line tracking. Only a few researchers used this approach because of difficulties of the melody enhancement. Our enhancement algorithm focuses on temp...
متن کاملMelody Extraction based on Harmonic Coded Structure
This paper considers a melody extraction algorithm that estimates the melody in polyphonic audio using the harmonic coded structure (HCS) to model melody in the minimum mean-square-error (MMSE) sense. The HCS is harmonically modulated sinusoids with the amplitudes defined by a set of codewords. The considered algorithm performs melody extraction in two steps: i) pitch-candidate estimation and i...
متن کاملMelody Extraction from Polyphonic Audio Based on Particle Filter
This paper considers a particle filter based algorithm to extract melody from a polyphonic audio in the short-time Fourier transforms (STFT) domain. The extraction is focused on overcoming the difficulties due to harmonic / percussive sound interferences, possibility of octave mismatch, and dynamic variation in melody. The main idea of the algorithm is to consider probabilistic relations betwee...
متن کاملMid-Level Music Melody Representation of Polyphonic Audio for Query-by-Humming System
Recently a great attention is paid to content-based multimedia retrieval that enables users to find and locate audio-visual materials according to the intrinsic characteristics of the target. Query-by-humming (QBH) is also an application that makes retrieval based on major characteristics of music, that is, "melody". There have been some researches on QBH system, most of which are to retrieve m...
متن کاملExtraction of the Melody Pitch Contour from Polyphonic Audio
MIREX 2005 is the second evaluation of algorithms related to music information retrieval (MIR). This document describes our submission to the MIREX audio melody extraction contest addressing the task of identifying the melody pitch contour from polyphonic musical audio. We use mainly a data-driven approach – implementing standard audio signal processing techniques like Fourier analysis, instant...
متن کامل