VECTOR QUANTIZATION AND SCALAR LINEAR PREDICTION FOR WAVEFORM CODING OF SPEECH AT 16 kbls
نویسندگان
چکیده
This thesis is an investigation of Vector Quantization, Scalar Linear Prediction and other related signal processing techniques, with the purpose of providing high quality, low delay speech waveform coding at medium data rates (16 kbls). Speech waveform coding systems based on adaptive scalar prediction and adaptive scalar quantization have been used to provide toll quality coded speech at high rates such as 32 kbls (ADPCM). However, the performance of these systems is known to degrade to sub-toll quality at 16 kbls, due to excessive quantization noise. Vector Quantization (VQ) is well known to provide a significant reduction in quantization noise over scalar quantization; in fact VQ can be shown to have a theoretically optimal rate-distortion performance at very large vector dimensions. This suggests that the performance of 16 kbls ADPCM may be sigmf5cantly improved by replacing the scalar quantizer with a vector quantizer. The resulting configuration, called Vector ADPCM, has an inherently high complexity; however, techniques are described which reduce the complexity to the level where implementation with commercially available digital hardware is feasible. Vector ADPCM is found to provide a 3-dB performance improvement over scalar ADPCM, with a 15 times increase in complexity, while still maintaining an encodingldecoding delay of less than 2 milliseconds. Adaptive Postfiltering significantly improves the subjective quality of the coded speech. Informal listening tests indicate that the coded speech is of very good communications quality.
منابع مشابه
A Continuously Adaptive Vector Predictive Coder (avpc) for Speech Encoding
In this work we present a waveform speech c~ ding system including vector quantization. This system can be seen as a vector version of the scalar ADPCM speech coder. In such system the speech samples are grouped in vectors that are coded by a vector quantizer when its prediction subtraction has been made. This is obtained in the coder by a vector Due to the non-statio narity of the speech signa...
متن کاملAPVQ encoder applied to wideband speech coding
This paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization and Adaptive Prediction as it is represented in Fig.1. Speech signal is split in 16 subbands by means of a QMF filter bank and so every subband is 500Hz wide. This APVQ en...
متن کاملQuantization techniques pdf
This paper proposes some fast and simple quantization techniques to display.The main reason for adopting different techniques in vector quantizers VQ is to design an optimal quantizer. Since the actual probability distributions of image.Abstract: Image Compression is a technique for competently coding digital. Vector Quantization VQ is a block-coding technique that quantizes blocks of data.Defi...
متن کاملEfficient vector quantization of LPC parameters at 24 bits/frame
Abstruct-Linear predictive coding (LPC) parameters are widely used in various speech processing applications for representing the spectral envelope information of speech. For low bit rate speech-coding applications, it is important to quantize these parameters accurately using as few bits as possible. Though the vector quantizers are more efficient than the scalar quantizers, their use for accu...
متن کاملCOMPANDED LATTICE VQ FOR EFFICIENT PARAMETRIC LPC QUANTIZATION (ThuPmPO1)
Source coding based on Gaussian Mixture Models (GMM) has been recently proposed for LPC quantization. We address in this paper the related problem of designing efficient codebooks for Gaussian vector sources. A new technique of ellipsoidal lattice vector quantization (VQ) is described, based on 1) scalar companding optimized for Gaussian random variables and 2) rectangular lattice codebooks wit...
متن کامل