Isolated-word Error Correction for Partially Phonemic Languages using Phonetic Cues

نویسندگان

  • Bhupesh Bansal
  • Monojit Choudhury
  • Pradipta Ranjan Ray
  • Sudeshna Sarkar
  • Anupam Basu
چکیده

Partially phonemic languages use writing systems which are in between strictly phonemic and non-phonemic orthography. Therefore, phonetic errors are very frequent in such languages. This paper introduces an approach for development of spellcheckers for partially phonemic languages that use grapheme-to-phoneme mapping for isolated-word error correction. Since, a complete and accurate grapheme-to-phoneme system is overkill for a spellchecker, the framework can deal with incomplete phonological information through the use of metaphonemes. The paper also discusses the implementation of a Bengali spellchecker based on this approach and some other issues specific to the Bengali spell-checking. The framework described here is generic in nature and can be used for any partially phonemic languages by incorporating the language specific parts like phonological rules, the keyboard layout and ranking strategies. This approach is very useful for Indian languages as most of them are partially phonemic in nature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Using Phonetic Websites on Iranian EFL Learners’ Word Level Pronunciation

Computer-assisted language learning (CALL) is reaching an up most position in the pedagogical field of English as a Second or Foreign Language (ESL/EFL). The present study was carried out to study the effect of using phonetic websites on Iranian EFL students’ pronunciation and knowledge of phonemic symbols. Participants of the study included 30 EFL female pre-intermediate students studyin...

متن کامل

Error correction for speaker-independent isolated word recognition through likelihood compensation using phonetic bigram

We propose an error correction technique for speakerindependent isolated word recognition by compensating for a word's likelihood. Likelihood is compensated for by likelihood calculated by a phonetic bigram. The phonetic bigram is a phoneme model expressing frame correlation within an utterance. A speaker-independent isolated word recognition experiment showed that our proposed technique reduce...

متن کامل

Differences in the Association between Segment and Language: Early Bilinguals Pattern with Monolinguals and Are Less Accurate than Late Bilinguals

Early bilinguals often show as much sensitivity to L2-specific contrasts as monolingual speakers of the L2, but most work on cross-language speech perception has focused on isolated segments, and typically only on neighboring vowels or stop contrasts. In tasks that include sounds in context, listeners' success is more variable, so segment discrimination in isolation may not adequately represent...

متن کامل

Design and implementation of Persian spelling detection and correction system based on Semantic

Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors.  Also developing Persian tools will provide Persian progr...

متن کامل

Analysis of phonetic transcriptions for Danish automatic speech recognition

Automatic speech recognition (ASR) relies on three resources: audio, orthographic transcriptions and a pronunciation dictionary. The dictionary or lexicon maps orthographic words to sequences of phones or phonemes that represent the pronunciation of the corresponding word. The quality of a speech recognition system depends heavily on the dictionary and the transcriptions therein. This paper pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004