persian continuous speech recognition

context dependent modeling in continuous speech recognition based on a persian phonetic decision tree

Journal: :the modares journal of electrical engineering 2003

seyed hosein shams seyed mohammad ahadi

context-dependent modeling is a well-known approach to increase modeling accuracy in continuous speech recognition. the most common way to implement this approach is via triphone modeling. nevertheless, the large number of such models results in several problems in model training, whilst the robust training of such models is often hardly obtained. one approach to solve this problem is via param...

متن کامل

Recognition of continuous persian speech using a medium-sized vocabulary speech corpus

1999

S. M. Ahadi

Speech recognition in Persian (Farsi) has recently been addressed by a few native speaking researchers and some approaches to isolated word and phoneme recognition have been reported. A main bottleneck in this research field is the lack of a recognition-specific speech corpus. In this work, a phonetically balanced speech database of Persian has been modified and used in continuous speech recogn...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

ژورنال: پردازش علائم و داده ها 2021

Bastanfard, Azam, Ghoreishi, Sayed Akbar, Veisi, Hadi,

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

ژورنال: Iranian Journal of Electrical and Electronic Engineering 2016

Bashirpour, M., Geravanchizadeh, M.,

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Building and Incorporating Language Models for Persian Continuous Speech Recognition Systems

2006

Mohammad Bahrani Hossein Sameti Nazila Hafezi H. Movassagh

In this paper building statistical language models for Persian language using a corpus and incorporating them in Persian continuous speech recognition (CSR) system are described. We used Persian Text Corpus for building the language models. First we preprocessed the texts of corpus by correcting the different orthography of words. Also, the number of POS tags was decreased by clustering POS tag...

متن کامل

تخمین سریع ضرایب پیچش در هنجارسازی طول مجرای صوتی با استفاده از امتیاز به دست آمده از مدلسازی تشخیص جنسیت

ژورنال: پردازش علائم و داده ها 2016

الماس‌گنج, فرشاد, رضا, شقایق, شکفته, یاسر, صراف رضایی, ایمان, قلی پور, حسن, کبودیان, جهانشاه, گودرزی, محمدمحسن,

The performance of automatic speech recognition (ASR) systems is adversely affected by the variations in speakers, audio channels and environmental conditions. Making these systems robust to these variations is still a big challenge. One of the main sources of variations in the speakers is the differences between their Vocal Tract Length (VTL). Vocal Tract Length Normalization (VTLN) is an effe...

متن کامل

effects of ageing on speed and temporal resolution of speech stimuli in older adults

Journal: :medical journal of islamic republic of iran 0

zahra jafari rehabilitation research center (rrc), department of basic sciencesin rehabilitation, school of rehabilitation sciences, iran university of medical sciences, tehran, iran.سازمان اصلی تایید شده: دانشگاه علوم پزشکی ایران (iran university of medical sciences)سازمان های دیگر: rehabilitation research center (rrc) shaghayegh omidvar rehabilitation research center, department of audiology, school ofrehabilitation, tehran university of medical sciences, tehran, iran.سازمان اصلی تایید شده: دانشگاه علوم پزشکی تهران (tehran university of medical sciences)سازمان های دیگر: rehabilitation research center fateme jafarloo rehabilitation research center, department of audiology, school ofrehabilitation, tehran university of medical sciences, tehran, iran.سازمان اصلی تایید شده: دانشگاه علوم پزشکی تهران (tehran university of medical sciences)سازمان های دیگر: rehabilitation research center

background: according to previous studies, most of the speech recognition disorders in older adults are the results of deficits in audibility and auditory temporal resolution. in this paper, the effect of ageing on timecompressed speech and auditory temporal resolution by word recognition in continuous and interrupted noise was studied. methods: a time-compressed speech test (tcst) was conducte...

متن کامل

Continuous speech recognition

Journal: :IEEE Signal Processing Magazine 1995

متن کامل

the effect of early bilingualism on auditory temporal processing ability using time-compressed persian speech test

Journal: :auditory and vestibular research 0

ensiyeh rahmani department of audiology, school of rehabilitation sciences, iran university of medical sciences, tehran, iran farnoush jarollahi department of audiology, school of rehabilitation sciences, iran university of medical sciences, tehran, iran agha fatemeh hosseini department of biostatistics, school of health, iran university of medical sciences, tehran, iran mahnaz soleymani department of audiology, school of rehabilitation sciences, iran university of medical sciences, tehran, iran

background and aim: bilingualism is an important phenomenon with different effects on each aspect of language processing. auditory temporal processing is a major component of the auditory processing ability. since bilingual and monolingual individual’s brain process are different, and no studies have yet been conducted on the effect of temporal processing on speech recognition performance of az...

متن کامل

Towards automatic learning in LVCSR: rapid development of a Persian broadcast transcription system

2008

Christian Gollan Hermann Ney

We present a new method for automatic learning and refining of pronunciations for large vocabulary continuous speech recognition which starts from a small amount of transcribed data and uses automatic transcription techniques for additional untranscribed speech data. The recognition performance of speech recognition systems usually depends on the available amount and quality of the transcribed ...

متن کامل