An Amharic speech corpus for large vocabulary continuous speech recognition

نویسندگان

  • Solomon Teferra Abate
  • Wolfgang Menzel
  • Bairu Tafila
چکیده

• has rich morphology -> many word forms. Phonetics Amharic has a set of 38 phones, seven vowels and thirty-one consonants. Consonants Manner Voicing Place of Articulation of Art/n Lab Dent Pal Vel Glo Stops Voiceless p[p] t[t] m[t∫ ] k[k] …[?] Voiced b[b] d[d] ¥[d ] g[g] GlottalizedÍ[p‘] μ[t‘] 1⁄2[t∫ ‘]q[q] Rounded [kw], [gw], [qw] Fricatives Voiceless f[f] s[s] ][∫ ] h[h] Voiced z[z] •[ ] Glottalized Õ[s‘] Rounded [hw] Nasals Voiced m[m]n[n] }[ ] Liquids Voiced l[l], r[r] Semi vowelsVoiced w[w] y[j]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Development of Large Vocabulary Continuous Speech Recognition Using Phonetically Structured Speech Corpus

This paper presents the results of acoustic modeling used in a Large Vocabulary Continuous Speech Recognition (LVCSR) system designed with the use of a phonetically controlled large vocabulary corpus. Evaluation experiments showed that relatively good speech recognition results may be obtained with adequate training material, taking into account: a) the presence of lexical stress; b) speech sty...

متن کامل

First steps in building a large vocabulary continuous speech recognition system for Vietnamese

This paper presents an overview of our activities for building a Large Vocabulary Continuous Speech Recognition (LVCSR) system for Vietnamese implemented at CLIPS-IMAG Laboratory (France) and International Research Center MICA (Vietnam). Firstly, a new methodology for fast text corpora acquisition for minority languages which has been applied to Vietnamese is proposed. Secondly, the first resul...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005