Vocal tract and area function estimation with both lip and glottal losses
نویسندگان
چکیده
Traditional algorithms simplify the lattice recursion for evaluation of the PARCOR’s by localizing the loss in vocal tract at one of its ends, the lips or the glottis. In this paper we present a framework for mapping to pseudo areas the VT transfer function with no rigid constraints on the losses in system, thereby allowing losses to be present at both the lips and glottis. This method allows us to calculate the reflection coefficients at both the glottis (rG) and the lips (rLip). The area functions obtained from these new PARCOR’s, have better temporal (inter-frame) and spatial (intra-frame) predictability.
منابع مشابه
Estimating the vocal-tract area function and the derivative of the glottal wave from a speech signal
We present a new method for estimating the vocal-tract area functions from speech signals. First, we point out and correct a long-standing sign error in some literature related to the derivation of the acoustic reflection coefficients of the vocal tract from a speech signal. Next, to eliminate the influence of the glottal wave on the estimation of the vocal-tract filter, we estimate the vocal-t...
متن کاملShape parameter estimate for a glottal model without time position
From a recorded speech signal, we propose to estimate a shape parameter of a glottal model without estimating his time position. Indeed, the literature usually propose to estimate the time position first (ex. by detecting Glottal Closure Instants). The vocal-tract filter estimate is expressed as a minimum-phase envelope estimation after removing the glottal model and a standard lips radiation m...
متن کاملThe Effect of Glottal Opening on the Acoustic Response of the Vocal Tract
In this study, a geometrically simplified physical model of the vocal tract, larynx and sub-glottal tract is used in the presence of an external applied sound field to quantify the changes to the acoustic response of the vocal tract with changing glottal width. The applied sound field is tonal, generated by a loudspeaker, at frequencies ranging from 100 Hz to 2 kHz. The acoustic response of the...
متن کاملUsual to Particular Phonatory Situations Studied with High-speed Videoendoscopy
Current high-speed videoendoscopy (HSV) make it possible to obtain 4000 images of the larynx per second. By this process, the analysis of the vocal folds can provide significant information. This is also possible to estimate the area of the glottis. All this information is useful for the study of the various phonatory modes, but also for glottal flow estimation which allows the improvement of o...
متن کاملContinuous Voice Morphing Using Separated Vocal Tract Area Functions and Glottal Source Waves
This paper presents a flexible voice morphing method, which is based on a conversion using a linear combination of the vocal tract area functions estimated from speech signals. The method focuses on the continuity of the phonological identity of the overall interpolated area. The main features of the method are 1) to separate characteristics of the vocal tract resonances from those of glottal s...
متن کامل