Analysis by synthesis speech coding with generalized pitch prediction

نویسندگان

  • Paul Mermelstein
  • Yasheng Qian
چکیده

A new analysis-by-synthesis speech coding structure is presented for high-quality speech coding in the 4 to 8 kb/s range. CELP with generalized pitch prediction (GPP-CELP) di ers from classical code-excited linear prediction (CELP) in that for voiced segments it is the speech signal that is decomposed into a component predictable with the aid of the adaptive codebook (ACB) and a nonpredictable aperiodic component, not the LPC residual. The spectrum of the aperiodic component is estimated by linear-prediction analysis. An approximation to the aperiodic component is synthesized from a stochastic codebook of sparse pulse sequences and its spectrum is shaped by the LPC synthesis lter. The ACB contains samples of the past reconstructed signal, low-passed to increase the pitch prediction gain. For voiced segments the new structure yields higher pitch prediction gain and lower linearprediction gain than classical CELP. Subjective and objective comparisons reveal signi cant advantages for GPP-CELP over classical CELP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stability and performance analysis of pitch filters in speech coders

This paper analyzes the stability and performance of pitch filters in speech coding when pitch prediction is combined with formant prediction. A computationally simple stability test based on a sufficient condition is formulated for pitch synthesis filters. For typical orders of pitch filters, this sufficient test is very tight. Based on the test, a simple stabilization technique that minimizes...

متن کامل

Low complexity VQ for multi-tap pitch predictor coding

Pitch predictors are successfully used in Linear Prediction Analysis-by-Synthesis (LPAS) coders to model periodicity in speech. The various structures of pitch predictors are investigated and used in LPAS coders. In most of the low bit-rate LPAS coder design, single-tap or three-tap pitch are commonly used. Higher prediction gain can be achieved by using additional taps. 5-tap pitch predictor i...

متن کامل

Analysis-by-synthesis low-rate multimode harmonic speech coding

This paper presents an analysis-by-synthesis multimode harmonic coder (AbS-MHC) that employs new techniques to improve both the speech model accuracy and the parameter estimation robustness in the low rate harmonic coding framework. To improve the speech model accuracy, an enhanced frequency domain transition model is used in conjunction with the sinusoidal model based harmonic coding of voiced...

متن کامل

Pitch Prediction Filters in Speech Coding RAVI

Prediction error filters which combine short-time prediction (formant prediction) with long-time prediction (pitch prediction) in a cascade connection are examined. A number of different solution methods (autocorrelation, covariance, Burg) and implementations (transversal and lattice) are considered. It is found that the F-P cascade (formant filter before the pitch filter) outperforms the P-F c...

متن کامل

Speech Coding at 4.8 kb/s with an Improved Pitch Filter

The reconstructed speech quality in a low bit-rate CELP coder is very dependent on the performance of the pitch filter. In this paper, we present an improved pitch filter, a fractional pseudethree-tap pitch synthesis filter, which performs better than a conventional one-tap pitch filter. We discuss the frequency response of the improved pitch filter. We explore stability issues for three-tap pi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999