A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals
نویسنده
چکیده
In this paper, we describe the concept of music scene description and address the problem of detecting melody and bass lines in real-world audio signals containing the sounds of various instruments. Most previous pitch-estimation methods have had difficulty dealing with such complex music signals because these methods were designed to deal with mixtures of only a few sounds. To enable estimation of the fundamental frequency (F0) of the melody and bass lines, we propose a predominant-F0 estimation method called PreFEst that does not rely on the unreliable fundamental component and obtains the most predominant F0 supported by harmonics within an intentionally limited frequency range. This method estimates the relative dominance of every possible F0 (represented as a probability density function of the F0) by using MAP (maximum a posteriori probability) estimation and considers the F0 s temporal continuity by using a multiple-agent architecture. Experimental results with a set of ten music excerpts from compact-disc recordings showed that a real-time system implementing this method was able to detect melody and bass lines about 80% of the time these existed. 2004 Elsevier B.V. All rights reserved.
منابع مشابه
A Real-time Music Scene Description System: Detecting Melody and Bass Lines in Audio Signals
This paper describes a predominant-pitch estimation method that enables us to build a realtime system detecting melody and bass lines as a subsystem of our music scene description system. The purpose of this study is to build such a real-time system that is practical from the engineering viewpoint, that gives suggestions to the modeling of music understanding, and that is useful in various appl...
متن کاملF0 Estimation of Melody and Bass Lines in Real-world Musical Audio Signals
This paper describes a method for estimating the fundamental frequency (F0) of melody and bass lines in monaural audio signals containing sounds of various instruments. Most previous methods premised mixtures of a few sounds and had great difficulty dealing with audio signals sampled from compact discs. Our method does not rely on the unreliable F0’s component and obtains the most predominant F...
متن کاملA robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings
This paper describes a robust method for estimating the fundamental frequency (F0) of melody and bass lines in monaural realworld musical audio signals containing sounds of various instruments. Most previous F0-estimation methods had great difficulty dealing with such complex audio signals because they were designed to deal with mixtures of only a few sounds. To make it possible to estimate the...
متن کاملA Predominant-F0 Estimation Method for Real-world Musical Audio Signals
In this paper we describe a robust method, called PreFEst, for estimating the fundamental frequency (F0) of melody and bass lines in monaural audio signals containing sounds of various instruments. Most previous F0-estimation methods have difficulty dealing with such complex audio signals because they are designed for mixtures of only a few sounds. Without assuming the number of sound sources, ...
متن کاملA Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
In this paper I introduce a method, called PreFEst, for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Most previous F0-estimation methods have had difficulty dealing with such complex audio signals because these methods were designed to deal with mixtures of only a few sounds. Without assuming the number of sound sources, PreFEst can esti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 43 شماره
صفحات -
تاریخ انتشار 2004