Optimal transform for segmented parametric speech coding

نویسندگان

  • Damith J. Mudugamuwa
  • Alan B. Bradley
چکیده

In voice coding applications where there is no constraint on the encoding delay, such as store and forward message systems or voice storage, segment coding techniques can be used to achieve a reduction in data rate without compromising the level of distortion. For low data rate linear predictive coding schemes, increasing the encoding delay allows one to exploit any long term temporal stationarities on an interframe basis, thus reducing the transmission bandwidth or storage needs of the speech signal. Transform coding has previously been applied in low data rate speech coding to exploit both the interframe and the intraframe correlation [9][2]. This paper investigates the potential for optimising the transform for segmented parametric representation of speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive transformation for segmented parametric speech coding

In voice coding applications where there is no constraint on the encoding delay, segment coding techniques can be used to achieve a reduction in data rate. For low data rate linear predictive coding schemes, increasing the encoding delay allows one to exploit any long term temporal stationarities on an interframe basis, thus reducing the transmission bandwidth or storage needs of the speech sig...

متن کامل

Improvements to the Switched Parametric & Transform Audio Coder

In this paper, we introduce improvements to previous sines + transients + noise audio modeling systems, including new sinusoidal trajectory selection and quantization procedures. In previous work [1], the audio is first segmented into transient and non-transient regions. The transient region is modeled using traditional transform coding techniques, while the non-transient regions are modeled us...

متن کامل

Efficient Block Coding of Images Using Gaussian Mixture Models

An efficient method for block coding of speech was presented by Rao and Subramaniam in [7]. An adaptation of this method for the use in image coding is presented in this paper. The probability density function (PDF) of the image blocks is estimated and modelled as multivariate Gaussian mixtures using the k-means and Expectation-Maximisation (EM) algorithms. This parametric model is incorporated...

متن کامل

Error Protection and Concealment for HILN MPEG-4 Parametric Audio Coding

The HILN (Harmonic and Individual Lines plus Noise) MPEG-4 parametric audio coding tool allows efficient representation of general audio signals at very low bit rates. Therefore possible applications include transmission over IP or wireless channels which are both characterised by specific transmission error models. On the other hand, since parametric audio coding is a relatively new technique ...

متن کامل

Spectral Coding of Speech LSF Parameters Using Karhunen-Loeve Transform

In this paper, the use of optimal KarhunenLoeve (KL) transform for quantization of speech line spectrum frequency (LSF) coefficients is studied. Both scalar quantizer (SQ) and vector quantizer (VQ) schemes are developed to encode efficiently the transform parameters after operating one or two-dimensional KL transform. Furthermore, the SQ schemes are also combined with entropy coding by using Hu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998