An improved speech transmission index for intelligibility prediction
نویسندگان
چکیده
The speech transmission index (STI) is a well known measure of intelligibility, most suited to the evaluation of speech intelligibility in rooms, with stimuli subjected to additive noise and reverberance. However, STI and its many variations do not effectively represent the intelligibility of stimuli containing non-linear distortions such as those resulting from processing by enhancement algorithms. In this paper, we revisit the STI approach and propose a variation which processes the modulation envelope in short-time segments, requiring only an assumption of quasi-stationarity (rather than the stationarity assumption of STI) of the modulation signal. Results presented in this work show that the proposed approach improves the measures correlation to subjective intelligibility scores compared to traditional STI for a range of noise types and subjected to different enhancement approaches. The approach is also shown to have higher correlation than other coherence, correlation and distance measures tested, but is unsuited to the evaluation of stimuli heavily distorted with (for example) masking based processing, where an alternative approach such as STOI is recommended. 2014 Elsevier B.V. All rights reserved.
منابع مشابه
Acoustic Study of an Auditorium by the Determination of Reverberation Time and Speech Transmission Index
The quality of the communication between teachers and students and ultimately, of classroom education itself, is closely linked to the acoustic quality of the auditorium. This acoustic quality can be characterized based on the reverberation time (RT), speech transmission index (STI) and the sound insulation. In this context, an acoustic study was conducted in an auditorium located in the Higher...
متن کاملPredicting speech intelligibility in conditions with nonlinearly processed noisy speech
The speech-based envelope power spectrum model (sEPSM; [1]) was proposed in order to overcome the limitations of the classical speech transmission index (STI) and speech intelligibility index (SII). The sEPSM applies the signal-tonoise ratio in the envelope domain (SNRenv), which was demonstrated to successfully predict speech intelligibility in conditions with nonlinearly processed noisy speec...
متن کاملImproving the prediction power of the speech transmission index to account for non-linear distortions introduced by noise-reduction algorithms
Although the speech transmission index (STI) has been shown to predict successfully the effects of linear distortions introduced by filtering and additive noise, it does not account for non-linear distortions present in noise-suppressed speech. In this study, the normalized covariance metric (NCM), a STIbased intelligibility measure, was modified to reduce the effects of non-linear distortions ...
متن کاملObjective prediction of speech intelligibility at high ambient noise levels using the speech transmission index
In many cases the intelligibility of speech in noise may be assumed independent of the absolute sound level; the speech-to-noise ratio (SNR) primarily determines intelligibility. However, at high sound levels, speech intelligibility is found to decrease. Subjective Speech Reception Threshold (SRT) measurements were performed at various speech and noise levels, and with various noise spectra. De...
متن کاملEvaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms
A distortion-weighted glimpse proportion metric (BiDWGP) for predicting binaural speech intelligibility were evaluated in simulated anechoic and reverberant conditions, with and without a noise masker. The predictive performance of BiDWGP was compared to four reference binaural intelligibility metrics, which were extended from the Speech Intelligibility Index (SII) and the Speech Transmission I...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 65 شماره
صفحات -
تاریخ انتشار 2014