A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation

نویسندگان

  • Kalle J. Palomäki
  • Guy J. Brown
  • DeLiang Wang
چکیده

In this study we describe a binaural auditory model for recognition of speech in the presence of spatially separated noise intrusions, under small-room reverberation conditions. The principle underlying the model is to identify time–frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by grouping the reliable regions according to common azimuth. Reliable time–frequency regions are passed to a missing data speech recogniser, which performs decoding based on this partial description of the speech signal. In order to obtain robust estimates of spatial location in reverberant conditions, we incorporate some aspects of precedence effect processing into the auditory model. We show that the binaural auditory model improves speech recognition performance in small room reverberation conditions in the presence of spatially separated noise, particularly for conditions in which the spatial separation is 20 or larger. We also demonstrate that the binaural system outperforms a single channel approach, notably in cases where the target speech and noise intrusion have substantial spectral overlap. 2004 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Binaural Model for Missing Data Speech Recognition in Noisy and Reverberant Conditions

We describe a binaural auditory model for speech recognition, which is robust in the presence of reverberation and spatially separated noise intrusions. The principle underlying the model is to identify time-frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by applying a simple model of ...

متن کامل

A Binaural Auditory Model for Missing Data Recognition of Speech in Noise

We describe a binaural auditory model for speech recognition, which is robust in the presence of reverberation and spatially separated noise intrusions. The principle underlying the model is to identify time-frequency regions which constitute reliable evidence of the speech signal. This is achieved both by determining the spatial location of the speech source, and by applying a simple model of ...

متن کامل

مدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی

In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...

متن کامل

Speech intelligibility prediction in reverberation: Towards an integrated model of speech transmission, spatial unmasking, and binaural de-reverberation.

Room acoustic indicators of intelligibility have focused on the effects of temporal smearing of speech by reverberation and masking by diffuse ambient noise. In the presence of a discrete noise source, these indicators neglect the binaural listener's ability to separate target speech from noise. Lavandier and Culling [(2010). J. Acoust. Soc. Am. 127, 387-399] proposed a model that incorporates ...

متن کامل

Binaural Signal Processing for Enhanced Speech Recognition Robustness in Complex Listening Environments

This paper addresses the problem of automatic speech recognition (ASR) in the presence of room reverberation, speaker movements and highly non-stationary background noise on the basis of binaural microphone recordings. Investigations are conducted for Track 1 of the 2nd CHiME Speech Separation and Recognition Challenge, posing a small-vocabulary task that requires the recognition of a short key...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 43  شماره 

صفحات  -

تاریخ انتشار 2004