Using English Acoustic Models for Hindi Automatic Speech Recognition
نویسندگان
چکیده
Bilingual speakers of Hindi and English often mix English and Hindi together in their everyday conversations. This motivates us to build a mix language Hindi-English recognizer. For this purpose, we need well-trained English and Hindi recognizers. For training our English recognizer we have at our disposal many hours of annotated English speech data. For Hindi, however, we have very limited resources. Therefore, in this paper we are proposing methods for rapid development of a Hindi speech recognizer using (i) trained English acoustic models to replace Hindi acoustic models; and (ii) adapting Hindi acoustic models from English acoustic models using Maximum Likelihood Linear Regression. We propose using data-driven methods for both substitution and adaptation. Our proposed recognizer has an accuracy of 96% for recognizing isolated Hindi words.
منابع مشابه
Cross Lingual Modelling Experiments for Indonesian
The extension of Large Vocabulary Continuous Speech Recognition (LVCSR) to resource poor languages such as Indonesian is hindered by the lack of transcribed acoustic data and appropriate pronunciation lexicons. Research has generally been directed toward establishing robust cross-lingual acoustic models, with the assumption that phonetic lexicons are readily available. This is not the case for ...
متن کاملHindi Speech Recognition and Online Speaker Adaptation
Speaker Adaptation is a technique which is used to improve the recognition accuracy of Automatic Speech Recognition (ASR) systems. Here, we report a study of the impact of online speaker adaptation on the performance of a speaker independent, continuous speech recognition system for Hindi language. The speaker adaptation is performed using the Maximum Likelihood Linear Regression (MLLR) transfo...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملMultilingual speech recognition: a unified approach
In this paper, we present a unified approach for hidden markov model based multilingual speech recognition. The proposed approach could be used across acoustically similar as well as diverse languages. We use an automatic phone mapping algorithm to map phones across languages and reduce the effective number of phones in the multililingual acoustic model. We experimentally verify the effectivene...
متن کاملContinuous Hindi Speech Recognition using Monophone based Acoustic Modeling
Speech is a natural way of communication and it provides an intuitive user interface to machines. Although the performance of automatic speech recognition (ASR) system is far from perfect. The overall performance of any speech recognition system is highly depends on the acoustic modeling. Hence generation of an accurate and robust acoustic model holds the key to satisfactory recognition perform...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012