Estimation of Articulatory Parameters

نویسنده

  • Li Deng
چکیده

This paper presents our research work of estimating articulatory states and model parameters from speech acoustics. This work represents a prerequisite for a speech recognition system based on articulatory dynamical models. The model parameter estimation was based on the vowels obtained from 590 sentences of 59 speakers from TIMIT speech database, whereas the state estimation experiments have been done using the vowels from 100 sentences of 10 speakers from TIMIT. The 10 English vowels investigated have the following transcription: /AA/, /AE/, /AH/, /AO/, /EH/, /EY/, /IH/, /IY/, /UH/ and /UW/. For each vowel, articulatory dynamical models were created using secondorder dynamical systems. The states of these models represent the positions of articulators such as lips, tongue and pharynx. We used 8 state parameters to represent these articulators and 3 formant frequencies to represent the acoustic observation vectors. The nonlinear relationship between articulatory state vector and speech acoustics has been approximated using a piecewise linear approximation on small regions in the articulatory space. This linearization was performed using a codebook of 610,000 pairs of articulatory and acoustic vectors created using the Metropolis algorithm. The whole codebook was then linearized on about 7,500 regions using a vector quantization method. The articulatory model parameters and states were estimated using the EM (expectation-maximization) algorithm and Kalman ltering techniques. The results obtained in estimating the articulatory parameters encourage us to apply this technique to automatic speech recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic-to-articulatory inversion using a speaker-normalized HMM-based speech production model

Acoustic-to-articulatory inverse mapping is a difficult problem because of its non-linear and oneto-many characteristics. We have previously developed a speech inversion method using a hidden Markov model (HMM)-based speech production model which takes into account the phonemespecific dynamic constraints of articulatory parameters. We found that the constraint significantly decreases the estima...

متن کامل

Articulatory synthesis using corpus-based estimation of line spectrum pairs

An attempt to define a new articulatory synthesis method, in which the speech signal is generated through a statistical estimation of its relation with articulatory parameters, is presented. A corpus containing acoustic material and simultaneous recordings of the tongue and facial movements was used to train and test the articulatory synthesis of VCV words and short sentences. Tongue and facial...

متن کامل

Articulatory synthesis using corpus-based e

An attempt to define a new articulatory synthesis method, in which the speech signal is generated through a statistical estimation of its relation with articulatory parameters, is presented. A corpus containing acoustic material and simultaneous recordings of the tongue and facial movements was used to train and test the articulatory synthesis of VCV words and short sentences. Tongue and facial...

متن کامل

The Accurate Estimation of Articulatory Synthesiser Parameters through Reducing the Degree of Saturation

A new method is proposed to correctly estimate the parameters of an articulatory speech synthesiser using a MLP neural network. This is achieved through modifying the statistical characteristic of the acoustic input pattern vectors in order to prevent the activation level of the hidden nodes from approaching saturation. The technique results in considerably faster neural learning and a more acc...

متن کامل

Estimation of Articulatory Synthesiser Parameters from Pseudo-formants

The articulatory speech synthesiser is likely to be the ultimate solution to the synthesis of natural sounding, intelligible speech [I]. Yet, the problem of estimating articulatory parameters, from a given speech signal, remains a challenge although remarkable attempts have been reported within the literature towards this end. [ 2-61. This paper presents a new technique for the accurate estimat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998