Pitch-synchronous time-scaling for high-frequency excitation regeneration

نویسندگان

  • João P. Cabral
  • Luís C. Oliveira
چکیده

The goal of bandwidth extension of speech (BWE) is to extrapolate the missing low or high frequency components of the wide-band speech (50-8000 Hz) based entirely on information contained in a narrow-band signal (300-3400 Hz). In this paper we propose a new method for high-frequency regeneration of the excitation signal, using the correlation between the shape of the glottal flow waveform and the spectrum of the voice source. The high-band excitation is generated by performing a pitch-synchronous time-scale (PSTS) transformation on the linear prediction narrow-band residual to generate an high-pass signal that retains the periodic characteristics of the original signal but with a larger open quotient. This method is easy to implement and does not introduce discontinuities in the spectrum of the regenerated excitation. It can be used in applications for BWE where no side information is transmitted or for low bit coding of wide-band speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals

Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...

متن کامل

High-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech

In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...

متن کامل

Issues in high quality LPC analysis and synthesis

This paper deals with careful non-real-time LPC analysis. A baseline system is first described. lt uses a pitch-synchronous covariancemethod analysis with a laryngograph signal providing the pitch synchrony. Work to improve the voicing decision and F0 determination and to find a better voiced excitation waveform is described. Setting a lower Iimit on the value of B 1 is found to be useful. Buzz...

متن کامل

Time -frequency analysis of vocal source signal for speaker recognition

This paper investigates the importance of spectrotemporal characteristics of the source excitation signal for speaker recognition. We propose an effective feature extraction technique for obtaining essential timefrequency information from the linear prediction (LP) residual signal, which are closely related to the glottal excitation of individual speaker. With pitch synchronous analysis, wavele...

متن کامل

Pitch-synchronous time-scaling for prosodic and voice quality transformations

Current time-domain pitch modification techniques have well known limitations for large variations of the original fundamental frequency. This paper proposes a technique for changing the pitch and duration of a speech signal based on time-scaling the linear prediction (LP) residual. The resulting speech signal achieves better quality than the traditional LP-PSOLA method for large fundamental fr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005