Improved Pvsola Time-stretching and Pitch-shifting for Polyphonic Audio
نویسندگان
چکیده
An advanced phase vocoder technique for high quality audio pitch shifting and time stretching is described. Its main concept is based on the PVSOLA time stretching algorithm which is already known to give good results on monophonic speech. Some enhancements are proposed to add the ability to process polyphonic material at equal quality by distinguishing between sinusoidal and noisy frequency components. Furthermore, the latency is reduced to get closer to a real time implementation. The new algorithm is embedded into a flexible pitch shifting and time stretching framework by adding transient detection and resampling. A subjective listening test is used to evaluate the new algorithm and to verify the improvements.
منابع مشابه
Alias-free, Multiresolution Sinusoidal Modeling for Polyphonic, Wideband Audio
In this paper, we describe an improved method of generating more accurate sinusoidal parameters famplitude, frequency, phaseg from a wideband polyphonic audio source in a multiresolution, nonaliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument. In addition to a m...
متن کاملAdaptive Harmonization and Pitch Correction of Polyphonic Audio Using Spectral Clustering
There are several well known harmonization and pitch correction techniques that can be applied to monophonic sound sources. They are based on automatic pitch detection and frequency shifting without time stretching. In many applications it is desired to apply such effects on the dominant melodic instrument of a polyphonic audio mixture. However, applying them directly to the mixture results in ...
متن کاملAudio Pitch Shifting Using the Constant-Q Transform
Pitch shifting of polyphonic music is usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analyzing and processing music signals. Recently invertible constant-Q transforms (CQT) featuring high Q-factor...
متن کاملPitch Shifting of Audio Signals Using the Constant-q Transform
Pitch-scale modifications of polyphonic music are usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are thereby based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analysing and processing music signals. Recently invertible constant-Q transforms (CQT) fea...
متن کاملTime Stretching & Pitch Shifting with the Web Audio API: Where are we at?
Audio time stretching and pitch shifting are operations that all major commercial and/or open source Digital Audio Workstations, DJ Mixing Software and Live Coding Suites offer. These operations allow users to change the duration of audio files while maintaining the pitch and vice-versa. Such operations enable DJs to speed up or slow down songs in order to mix them by aligning the beats. Unfort...
متن کامل