Non-native English speech recognition using bilingual English lexicon and acoustic models
نویسندگان
چکیده
This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speakers’ pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japanese and English acoustic models are used in recognizing both transcriptions, and the highest-likelihood word sequence obtained in combining with native Englishand Japanese-pronounced words is the recognition result. Continuous speech recognition experiments show that the proposed system greatly improves Japanese-English speech recognition performance while maintaining the same performance level as that of a purely native English recognition system.
منابع مشابه
Speaker adaptation for non-native speakers using bilingual English lexicon and acoustic models
This paper proposes a supervised speaker adaptation method that is effective for both non-native (i.e. Japanese) and native English speakers’ pronunciation of English speech. This method uses English and Japanese phoneme acoustic models and a pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The same utterances are used for adaptation of both acousti...
متن کاملMultilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints
In this paper we present an automated approach for non-native speech recognition. We introduce a new phonetic confusion concept that associates sequences of native language (NL) phones to spoken language (SL) phones. Phonetic confusion rules are automatically extracted from a non-native speech database for a given NL and SL using both NL’s and SL’s ASR systems. These rules are used to modify th...
متن کاملEnglish-Spanish Bilingual Alphabet for Embedded Speech Recognition
This article introduces the phonetic alphabet that has been used to train acoustic models with a mixture of Spanish language and American English data, with the purpose of improving the speech recognition performance, when using Spanish, for speakers that are fluent in both languages, as is very frequently the case in the USA Spanish speaking population. We target a decoder that can be used in ...
متن کاملImproving ASR performance on non-native speech using multilingual and crosslingual information
This paper presents our latest investigation of automatic speech recognition (ASR) on non-native speech. We first report on a non-native speech corpus an extension of the GlobalPhone database which contains English with Bulgarian, Chinese, German and Indian accent and German with Chinese accent. In this case, English is the spoken language (L2) and Bulgarian, Chinese, German and Indian are the ...
متن کاملUsing English Acoustic Models for Hindi Automatic Speech Recognition
Bilingual speakers of Hindi and English often mix English and Hindi together in their everyday conversations. This motivates us to build a mix language Hindi-English recognizer. For this purpose, we need well-trained English and Hindi recognizers. For training our English recognizer we have at our disposal many hours of annotated English speech data. For Hindi, however, we have very limited res...
متن کامل