Coding of Excitation Signals In a Waveform Interpolation Speech Coder

نویسنده

  • Mohammad M. A. Khan
چکیده

The goal of this thesis is to improve the quality of the Waveform Interpolation (WI) coded speech at 4.25 kbps. The quality improvement is focused on the efficient coding scheme of voiced speech segments, while keeping the basic coding format intact. In the WI paradigm voiced speech is modelled as a concatenation of the Slowly Evolving pitch-cycle Waveforms (SEW). Vector quantization is the optimal approach to encode the SEW magnitude at low bit rates, but its complexity imposes a formidable barrier. Product code vector quantizers (PC-VQ) are a family of structured VQs that circumvent the complexity obstacle. The performance of product code VQs can be traded off against their storage and encoding complexity. This thesis introduces split/shape-gain VQ—a hybrid product code VQ, as an approach to quantize the SEW magnitude. The amplitude spectrum of the SEW is split into three non-overlapping subbands. The gains of the three subbands form the gain vector which are quantized using the conventional Generalized Lloyd Algorithm (GLA). Each shape vector obtained by normalizing each subband by its corresponding coded gain is quantized using a dimension conversion VQ along with a perceptually based bit allocation strategy and a perceptually weighted distortion measure. At the receiver, the discontinuity of the gain contour at the boundary of subbands introduces buzziness in the reconstructed speech. This problem is tackled by smoothing the gain versus frequency contour using a piecewise monotonic cubic interpolant. Simulation results indicate that the new method improves speech quality significantly. The necessity of SEW phase information in the WI coder is also investigated in this thesis. Informal subjective test results demonstrate that transmission of SEW magnitude encoded by split/shape-gain VQ and inclusion of a fixed phase spectrum drawn from a voiced segment of a high-pitched male speaker obviates the need to send phase information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Pitch Pulse Evolution Model for a Dual Excitation Linear Predictive Speech Coder

This paper introduces a new technique to model the excitation waveform for a linear predictive speech coder The target appli cation is high quality speech coding for rates near kb s Our pitch pulse evolution model decomposes the excitation into two separate but simultaneous signals the evolving pitch pulse com ponent and the unvoiced noise like contribution A number of formulations for decompos...

متن کامل

A Pitch Pulse Evolution Model for a Dual ExcitationLinear Predictive Speech

This paper introduces a new technique to model the excitation waveform for a linear predictive speech coder. The target application is high quality speech coding for rates near 4 kb/s. Our pitch pulse evolution model decomposes the excitation into two separate but simultaneous signals: the evolving pitch pulse component and the unvoiced, noise-like contribution. A number of formulations for dec...

متن کامل

Very low rate speech coding using temporal decomposition and waveform interpolation

In very low rate coding the aim is to accurately represent speech characteristics as efficiently as possible. High coding gains for the spectral features can be achieved through the use of temporal decomposition. Waveform interpolation coders accurately represent the excitation using characteristic waveforms (CWs) extracted at a constant rate. In this paper, the two approaches are combined into...

متن کامل

A Low-complexity Improved WI Speech Coding at 2kbps

The waveform interpolation (WI) speech coding presents a good performance at low bit rate. However, the algorithm has a very high complexity in computation. In this paper, a low-complexity improved waveform interpolation speech coder at 2kbps is proposed. The improved coding scheme has greatly reduced the computational complexity and improved the reconstructed speech quality by using various te...

متن کامل

Wideband Speech Coding at 4 kbps using Waveform Interpolation

In this paper we present a new low rate, wideband speech coder operating at 4 kbps and based on Waveform Interpolation (WI). An outline of WI speech coding is provided together with a description of its adaptation to wideband speech. Particular emphasis is placed on the quantisation of the WI parameters. Included is a detailed analysis of the quantisation requirements for the Line Spectral Freq...

متن کامل

A new low bit rate speech coder based on intraframe waveform interpolation

A new characteristic waveform (CW) interpolation coder is proposed in this paper. In the proposed coder, two characteristic waveforms are extracted from LPC residual signal at each frame. The Waveform Interpolation (WI) is operated within the frame. In the novel WI, variable dimension vector quantization (VDVQ) and power vector quantization are proposed and the low frequency band (LFB) and high...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001