A data reduction method to estimate vowel distributions and its use in comparing two formant estimation methods

نویسندگان

  • Tadashi Sakata
  • Yuichi Ueda
چکیده

Speech features such as formants of vowels uttered by many talkers are considered to form a normal distribution in each phoneme on a feature space. However, those features may apparently show the different dispersions peculiar to the estimation methods. Therefore, if the correct distributions can be found by a credible method, it will make clear the definition of feature estimation errors so that the comparative evaluations between the feature estimation methods will become possible. In this paper, we first propose the data reduction method to estimate true formant distributions of vowels. In the method, we apply the principal component analysis to the formant data of each vowel on a F1-F2 space to search an average value and a three-sigma ellipse. If the average and the ellipse are searched iteratively after removing the outside data of the ellipse regarded as errors, they finally converge. The proportion of the data samples within the final ellipse to all data will be different in the formant estimation methods. We consider that the estimation method of larger proportion is higher in the accuracy because of the high trust. IFC (Inverse Filter Control) method, in which formants are estimated from zero-crossing information, has been compared with LPC method under the above criteria. As a result of the analysis using vowels in words, it has been shown that the IFC method is superior to the LPC in the proportion and the ratio of area in the final ellipse to that in initial one. The proportions of data within the final ellipses are 90-96% in IFC and 84-95% in LPC, which are obtained from five Japanese vowels uttered by 20 males. The intuition obtained by observing the states of distributions supports the numerals in the analysis. Based on the results, we conclude that the formant estimation using zero-crossing information (IFC) is more effective than that by spectral shapes (LPC).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

Study and Comparison of Formant Characteristics of Persian Vowels in 4-7-year-old Children Using Cochlear Implants and Those Using Hearing Aids

Background and Objective: One of the most important physical properties of vowels is their formant structure. One of the most obvious speech errors in hearing-impaired children is vowel errors. The present study aimed to determine and compare the formant structure of Persian vowels in deaf and cochlear implant children in the age range of 4-7 years. Materials and Methods: This descriptive-anal...

متن کامل

Investigating Interaural Frequency-Place Mismatches via Bimodal Vowel Integration

For patients having residual hearing in one ear and a cochlear implant (CI) in the opposite ear, interaural place-pitch mismatches might be partly responsible for the large variability in individual benefit. Behavioral pitch-matching between the two ears has been suggested as a way to individualize the fitting of the frequency-to-electrode map but is rather tedious and unreliable. Here, an alte...

متن کامل

Modelling of Lithuanian Speech Diphthongs

The goal of the paper is to get a method of Lithuanian speech diphthong modelling. We use a formant-based synthesizer for this modelling. The second order quasipolynomial has been chosen as the formant model in time domain. A general diphthong model is a multi-input and single-output (MISO) system, that consists of two parts where the first part corresponds to the first vowel of the diphthong a...

متن کامل

بررسی ساختار سازه‌ای واکه‌های زبان فارسی در بزرگ‌سالان دوزبانه آذری فارسی

Objective: Vowels are the center of syllables while formant structures are one of the most important acoustic characteristics of speech sounds that help in their articulatory and perceptual aspects. Formants represent the shape and size of the vocal tract. There exist trivial differences between the vocal tracts of different people due to which the formant structures of a vowel in one person ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010