Improving pronunciation modeling for non-native speech recognition
نویسندگان
چکیده
In this paper, three different approaches to pronunciation modeling are investigated. Two existing pronunciation modeling approaches, namely the pronunciation dictionary and n-best rescoring approach are modified to work with little amount of non-native speech. We also propose a speaker clustering approach, which capable of grouping the speakers based on their pronunciation habits. Given some speech, the approach can also be used for pronunciation adaptation. This approach is called latent pronunciation analysis. The results show that conventional pronunciation dictionary perform slightly better than n-best list rescoring, while the latent pronunciation analysis has shown to be beneficial for speaker clustering, and it can produce nearly the same improvement as the pronunciation dictionary approach, without the need to know the origin of the speaker.
منابع مشابه
Non-native Pronunciation Variation Modeling for Automatic Speech Recognition
Communication using speech is inherently natural, with this ability of communication unconsciously acquired in a step-by-step manner throughout life. In order to explore the benefits of speech communication in devices, there have been many research works performed over the past several decades. As a result, automatic speech recognition (ASR) systems have been deployed in a range of applications...
متن کاملFrame-Level Selective Decoding Using Native and Non-native Acoustic Models for Robust Speech Recognition to Native and Non-native Speech
v Regarded as a mismatch problem between the training and test conditions § Training condition: native speech § Testing condition: non-native speech § Widely used methods in speaker or environment adaptation v Research works dedicated to non-native ASR § Acoustic modeling § Pronunciation modeling § Language modeling § Hybrid modeling § Many researches uses a small amount of non-native speech Wh...
متن کاملIntegration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition
To recognize non-native speech, larger acoustic/linguistic distortions must be handled adequately in acoustic modeling, language modeling, lexical modeling, and/or decoding strategy. In this paper, a novel method to enhance MLLR adaptation of acoustic models for non-native speech recognition is proposed. In the case of native speech recognition, MLLR speaker adaptation was successfully introduc...
متن کاملModeling context and language variation for non-native speech recognition
Non-native speakers often face difficulty in pronouncing like the native speakers. This paper proposes to model pronunciation variation in non-native speaker’s speech using only acoustics models, without the need for the corpus. Variation in term of context and language will be modeled. The combination of both modeling resulted in the reduction of absolute WER as much as 16% and 6% for native V...
متن کاملSPEECH RECOGNITION BY GOATS, WOLVES, SHEEP and ... NON-NATIVES
This paper gives an overview of current understanding of acoustic-phonetic issues arising when trying to recognize speech from non-native speakers. Regional accents can be modeled by systematic shifts in pronunciation. These can often better be represented by multiple models, than by pronunciation variants in the dictionary. The problem of non-native speech is much more difficult because it is ...
متن کامل