A real-time music-scene-description system: predominant-F0 estimation for detecting melody and bass lines in real-world audio signals

نویسنده

  • Masataka Goto
چکیده

In this paper, we describe the concept of music scene description and address the problem of detecting melody and bass lines in real-world audio signals containing the sounds of various instruments. Most previous pitch-estimation methods have had difficulty dealing with such complex music signals because these methods were designed to deal with mixtures of only a few sounds. To enable estimation of the fundamental frequency (F0) of the melody and bass lines, we propose a predominant-F0 estimation method called PreFEst that does not rely on the unreliable fundamental component and obtains the most predominant F0 supported by harmonics within an intentionally limited frequency range. This method estimates the relative dominance of every possible F0 (represented as a probability density function of the F0) by using MAP (maximum a posteriori probability) estimation and considers the F0 s temporal continuity by using a multiple-agent architecture. Experimental results with a set of ten music excerpts from compact-disc recordings showed that a real-time system implementing this method was able to detect melody and bass lines about 80% of the time these existed. 2004 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Real-time Music Scene Description System: Detecting Melody and Bass Lines in Audio Signals

This paper describes a predominant-pitch estimation method that enables us to build a realtime system detecting melody and bass lines as a subsystem of our music scene description system. The purpose of this study is to build such a real-time system that is practical from the engineering viewpoint, that gives suggestions to the modeling of music understanding, and that is useful in various appl...

متن کامل

F0 Estimation of Melody and Bass Lines in Real-world Musical Audio Signals

This paper describes a method for estimating the fundamental frequency (F0) of melody and bass lines in monaural audio signals containing sounds of various instruments. Most previous methods premised mixtures of a few sounds and had great difficulty dealing with audio signals sampled from compact discs. Our method does not rely on the unreliable F0’s component and obtains the most predominant F...

متن کامل

A robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings

This paper describes a robust method for estimating the fundamental frequency (F0) of melody and bass lines in monaural realworld musical audio signals containing sounds of various instruments. Most previous F0-estimation methods had great difficulty dealing with such complex audio signals because they were designed to deal with mixtures of only a few sounds. To make it possible to estimate the...

متن کامل

A Predominant-F0 Estimation Method for Real-world Musical Audio Signals

In this paper we describe a robust method, called PreFEst, for estimating the fundamental frequency (F0) of melody and bass lines in monaural audio signals containing sounds of various instruments. Most previous F0-estimation methods have difficulty dealing with such complex audio signals because they are designed for mixtures of only a few sounds. Without assuming the number of sound sources, ...

متن کامل

A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals

In this paper I introduce a method, called PreFEst, for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Most previous F0-estimation methods have had difficulty dealing with such complex audio signals because these methods were designed to deal with mixtures of only a few sounds. Without assuming the number of sound sources, PreFEst can esti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 43  شماره 

صفحات  -

تاریخ انتشار 2004