Is Syllable Stress Information Robust for ASR in Adverse Conditions?

نویسندگان

  • Bogdan Ludusan
  • Stefan Ziegler
  • Guillaume Gravier
چکیده

This paper presents a study on the robustness of stress information for automatic speech recognition in the presence of noise. The syllable stress, extracted from the speech signal, was integrated in the recognition process by means of a previously proposed decoding method. Experiments were conducted for several signal-to-noise ratio conditions and the results show that stress information is robust in the presence of medium to low noise. This was found to be true both when syllable boundary information was used for stress detection and when this information was not available. Furthermore, the obtained relative improvement increased with a decrease in signal quality, indicating that the stressed parts of the signal can be considered islands of reliability.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Syllable, Articulatory-feature, and Stress-accent Model of Speech Recognition

Current-generation automatic speech recognition (ASR) systems assume that words are readily decomposable into constituent phonetic components (\phonemes"). A detailed linguistic dissection of state-of-the-art speech recognition systems indicates that the conventional phonemic \beads-on-a-string" approach is of limited utility, particularly with respect to informal, conversational material. The ...

متن کامل

Word segmentation in Persian continuous speech using F0 contour

Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...

متن کامل

Throat Microphone Signals for Syllable Recognition Using Linear Prediction Cepstrum

The performance of standard Automatic Speech Recognition (ASR) systems using the Normal Microphone (NM) degrades even if the ambience is slightly noisy. In contrast to the NM speech, the Throat Microphone (TM) speech is unaffected by such an ambience. This paper explores the feasibility of using the TM speech for developing robust ASR systems in these conditions. This ASR system may also be use...

متن کامل

A comparison of LPC and FFT-based acoustic features for noise robust ASR

Within the context of robust acoustic features for automatic speech recognition (ASR), we evaluated mel-frequency cepstral coefficients (MFCCs) derived from two spectral representation techniques, i.e. the fast Fourier transform (FFT) and linear pre­ dictive coding (LPC). ASR systems based on the two feature types were tested on a digit recognition task using continuous density hidden Markov ph...

متن کامل

Investigating the Role of Three Species of Arbuscular Mycorrhizal Fungi on Growth, Acid Phosphatase Enzyme Activity and Phenolic Compounds in Zinnia Plant under Drought Stress Conditions

This experiment was conducted to study the effects of three identified isolates of Arbuscular mycorrhizal fungi (AMF) on growth, acid phosphatase enzyme activity and phenolic compounds (phenol, flavonoid and anthocyanin) of zinnia plants (Zinnia elegans L.var. Magellan Red) under water stress conditions. A factorial (two factors) experiment was planned based on a completely randomized design (C...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013