Multi-Dialectical Languages Effect on Speech Recognition
نویسندگان
چکیده
Research has shown that automatic speech recognition (ASR) performance typically decreases when evaluated on a dialectal variation of the same language that was not used for training its models. Similarly, models simultaneously trained on a group of dialects tend to underperform when compared to dialect-specific models. When trying to decide which dialect-specific model (recognizer) to use to decode an utterance (e.g., a voice search query), possible strategies include automatically detecting the spoken dialect or following the user’s language preferences as set in his/her cell phone. In this paper, we observe that user’s voice search queries are usually directed to a dialect-specific recognizer that does not match the user’s current location, and present a study that shows that automatically selecting the recognizer based on the user’s geographical location helps improve the user experience. Keywords—multi-dialectical languages; speech recognition; voice search
منابع مشابه
Acoustic modelling for speech recognition in Indian languages in an agricultural commodities task domain
In developing speech recognition based services for any task domain, it is necessary to account for the support of an increasing number of languages over the life of the service. This paper considers a small vocabulary speech recognition task in multiple Indian languages. To configure a multi-lingual system in this task domain, an experimental study is presented using data from two linguistical...
متن کاملEfficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network
This paper proposes a method of acoustic modeling for zero-resourced languages speech recognition under mismatch conditions. In those languages, very limited or no transcribed speech is available for traditional monolingual speech recognition. Conventional methods such as IPA based universal acoustic modeling has been proved to be effective under matched acoustic conditions (similar speaking st...
متن کاملMulti-lingual speech recognition system for speech-to-speech translation
This paper describes the speech recognition module of the speech-to-speech translation system being currently developed at ATR. It is a multi-lingual large vocabulary continuous speech recognition system supporting Japanese, English and Chinese languages. A corpusbased statistical approach was adopted for the system design. The database we collected consists of more than 600 000 sentences cover...
متن کاملMulti-lingual Fingerspelling Recognition in a Kiosk for the Handicapped
This paper presents the design and evaluation of a multi-lingual fingerspelling recognition module that is designed for an information terminal. Through the use of multimodal input and output methods, the information terminal acts as a communication medium between deaf and blind people. The system converts fingerspelled words to speech and vice versa using fingerspelling recognition, fingerspel...
متن کاملTowards Language-Universal End-to-End Speech Recognition
Building speech recognizers in multiple languages typically involves replicating a monolingual training recipe for each language, or utilizing a multi-task learning approach where models for different languages have separate output labels but share some internal parameters. In this work, we exploit recent progress in end-to-end speech recognition to create a single multilingual speech recogniti...
متن کامل