Improved Pvsola Time-stretching and Pitch-shifting for Polyphonic Audio

نویسندگان

  • Sebastian Kraft
  • Martin Holters
  • Adrian von dem Knesebeck
  • Udo Zölzer
چکیده

An advanced phase vocoder technique for high quality audio pitch shifting and time stretching is described. Its main concept is based on the PVSOLA time stretching algorithm which is already known to give good results on monophonic speech. Some enhancements are proposed to add the ability to process polyphonic material at equal quality by distinguishing between sinusoidal and noisy frequency components. Furthermore, the latency is reduced to get closer to a real time implementation. The new algorithm is embedded into a flexible pitch shifting and time stretching framework by adding transient detection and resampling. A subjective listening test is used to evaluate the new algorithm and to verify the improvements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Alias-free, Multiresolution Sinusoidal Modeling for Polyphonic, Wideband Audio

In this paper, we describe an improved method of generating more accurate sinusoidal parameters famplitude, frequency, phaseg from a wideband polyphonic audio source in a multiresolution, nonaliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument. In addition to a m...

متن کامل

Adaptive Harmonization and Pitch Correction of Polyphonic Audio Using Spectral Clustering

There are several well known harmonization and pitch correction techniques that can be applied to monophonic sound sources. They are based on automatic pitch detection and frequency shifting without time stretching. In many applications it is desired to apply such effects on the dominant melodic instrument of a polyphonic audio mixture. However, applying them directly to the mixture results in ...

متن کامل

Audio Pitch Shifting Using the Constant-Q Transform

Pitch shifting of polyphonic music is usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analyzing and processing music signals. Recently invertible constant-Q transforms (CQT) featuring high Q-factor...

متن کامل

Pitch Shifting of Audio Signals Using the Constant-q Transform

Pitch-scale modifications of polyphonic music are usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are thereby based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analysing and processing music signals. Recently invertible constant-Q transforms (CQT) fea...

متن کامل

Time Stretching & Pitch Shifting with the Web Audio API: Where are we at?

Audio time stretching and pitch shifting are operations that all major commercial and/or open source Digital Audio Workstations, DJ Mixing Software and Live Coding Suites offer. These operations allow users to change the duration of audio files while maintaining the pitch and vice-versa. Such operations enable DJs to speed up or slow down songs in order to mix them by aligning the beats. Unfort...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012