The multimodal multipulse excitation vocoder
نویسندگان
چکیده
This paper presents a new high-quality, variable-rate vocoder in which the average bit-rate is parametrically controllable. The new vocoder is intended for use with data-voice simultaneous channel (DVSC) applications, in which the speech data is transmitted simultaneously with video and other types of data. The vocoder presented in this paper achieves state-of-the-art quality at several different bit-rates between 5.5 Kbps and 10 Kbps. Further, it achieves this performance at acceptable levels of complexity and delay.
منابع مشابه
Maximum-take-precedence ACELP: a low complexity search method
The ACELP method makes use of multipulse structure to represent the excitation pulses of residual signal. With the purpose of computational complexity reduction, this paper provides the Maximum-TakePrecedence ACELP (MTP-ACELP) search method under the acceptable degradation in performance. Because the maximum of target signal is preferentially compensated, the degradation of performance would be...
متن کاملSelection of excitation vectors for the CELP coders
In this paper, we investigate several algorithms that construct the input for the synthesis filter in the CELP coder, we present them under the same formalism, and we compare their performances. We model the excitation vector by a linear combination of K signals, which are issued from K codebooks and multiplied by K associated gains. We demonstrate that this generalized form incorporates severa...
متن کاملA Glottal Vocoder Employing Vector Quantization
This paper describes a speech coder for low bit rates using a parametric representation of voiced excitation waveforms (Glottal ARX) and standard LPC for unvoiced. For efficient compression purposes the excitation and spectrum parameters are quantized with vector quantization (VQ). This has resulted in a glottal vocoder operating at 1320 bits/s and sounding more natural than a standard LPC voco...
متن کاملEfficient multipulse approximation of speech excitation using the most singular manifold
We propose a novel approach to find the locations of the multipulse sequence that approximates the speech source excitation. This approach is based on the notion of Most Singular Manifold (MSM) which is associated to the set of less predictable events. The MSM is formed by identifying (directly from the speech waveform) multiscale singularities which may correspond to significant impulsive exci...
متن کاملMultipulse Sequences for Residual Signal Modeling
In source-filter models of speech production, the residual signal what remains after passing the speech signal through the inverse filter contains important information for the generation of naturally sounding re-synthesized speech. Typically, the voiced regions of residual signals are regarded as a mixture of glottal pulse and noise. This paper introduces a novel approach to represent the nois...
متن کامل