Automatic Pronunciation Assessment for Mandarin Chinese: Approaches and System Overview
نویسندگان
چکیده
This paper presents the algorithms used in a prototypical software system for automatic pronunciation assessment of Mandarin Chinese. The system uses forced alignment of HMM (Hidden Markov Models) to identify each syllable and the corresponding log probability for phoneme assessment, through a ranking-based confidence measure. The pitch vector of each syllable is then sent to a GMM (Gaussian Mixture Model) for tone recognition and assessment. We also compute the similarity of scores for intensity and rhythm between the target and test utterances. All four scores for phoneme, tone, intensity, and rhythm are parametric functions with certain free parameters. The overall scoring function was then formulated as a linear combination of these four scoring functions of phoneme, tone, intensity, and rhythm. Since there are both linear and nonlinear parameters involved in the overall scoring function, we employ the downhill Simplex search to fine-tune these parameters in order to approximate the scoring results obtained from a human expert. The experimental results demonstrate that the system can give consistent scores that are close to those of a human’s subjective evaluation.
منابع مشابه
Automatic Mandarin pronunciation scoring for native learners with dialect accent
This paper studies pronunciation scoring algorithm in CALL system aiming at teaching native Chinese learn standard Mandarin. Most of the pronunciation scoring algorithms focus on non-native environment, which may not be suitable for native speakers. We bring up a new algorithm based on traditional posterior log-likelihood algorithm by weighting the initial part of Mandarin syllables, where fina...
متن کاملiCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent
We present iCALL, a speech corpus designed to evaluate Mandarin Chinese pronunciation patterns of non-native speakers of European descent, developed at the Institute for Infocomm Research (IR) in Singapore. To the best of our knowledge, iCALL is larger than any reported non-native corpora to date in terms of utterance number, duration, and number of speakers: iCALL consists of 90,841 utterances...
متن کاملPronunciation variation modeling for Mandarin with accent
In order to solve the problem of the performance decrease when state-of-art automatic speech recognition (ASR) system facing accent speech, we propose the Pronunciation Variation Model (PVM). Two approaches are proposed to construct the PVM in this paper. 6.38% and 7.78% relative error rate reduction is achieved for Shanghai and Wuhan accent mandarin, respectively. The experiment on these two t...
متن کاملExplicit Pronunciation Training Using Automatic Speech Recognition Technology
A system is described, provisionally named Pronto, which uses automatic speech recognition (ASR) for training pronunciation of second languages in adult learners. The first version of Pronto was developed for native speakers of American English learning Spanish and for Mandarin Chinese speakers learning English. Pronto grows out of work in the Indiana Speech Training Aid (ISTRA) research progra...
متن کاملAutomatic Pronunciation Assessment for Mandarin Proficiency Test Based on HMM
Objective pronunciation assessment plays a very important role in the Mandarin Proficiency Test. But it still has a long way to go before it reaches the level of success. In this paper, the novel Mandarin objective pronunciation assessment pronunciation of is proposed. The standard of Mandarin pronunciation is divided into six levels. The mandarin pronunciation is divided into consonant, vowel ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 12 شماره
صفحات -
تاریخ انتشار 2007