The Effects of Background Noise on the Performance of an Automatic Speech Recogniser

نویسندگان

Jason Littlefield

Ahmad Hashemi-Sakhtsari

چکیده

Ambient or environmental noise is a major factor that affects the performance of an automatic speech recogniser. Large vocabulary, speaker-dependent, continuous speech recognisers are commercially available. Speech recognisers perform well in a quiet environment, but poorly in a noisy environment. Speaker-dependent speech recognisers require training prior to them being tested, where the level of background noise in both phases affects the performance of the recogniser. This study aims to determine whether the best performance of a speech recogniser occurs when the levels of background noise during the training and test phases are the same, and how the performance is affected when the levels of background noise during the training and test phases are different. The relationship between the performance of the speech recogniser and upgrading the computer speed and amount of memory as well as software version was also investigated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Investigation of Language Structure by Means of Language Models Incorporating Breathing and Articulatory Noise

In our experiment we used a bigram language model and a standard speech recogniser to test if linguistic information is related to the position of silence, articulatory noise, background noise, laughing and breathing in spontaneous speech. We observed that for silence and articulatory noise the acoustic modelling is more important than linguistic information represented in the bigrams of a lang...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II)

An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

The Effects of Background Noise on the Performance of an Automatic Speech Recogniser

نویسندگان

چکیده

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Investigation of Language Structure by Means of Language Models Incorporating Breathing and Articulatory Noise

Improving the performance of MFCC for Persian robust speech recognition

A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II)

عنوان ژورنال:

اشتراک گذاری