Perceptual speech enhancement exploiting temporal masking properties of human auditory system
نویسندگان
چکیده
The use of simultaneous masking in speech enhancement has shown promise for a range of noise types. In this paper, a new speech enhancement algorithm based on a short-term temporal masking threshold to noise ratio (MNR) is presented. A novel functional model for forward masking based on three parameters is incorporated into a speech enhancement framework based on speech boosting. The performance of the speech enhancement algorithm using the proposed forward masking model was compared with seven other speech enhancement methods over 12 different noise types and four SNRs. Objective evaluation using PESQ revealed that using the proposed forward masking model, the speech enhancement algorithm outperforms the other algorithms by 6–20% depending on the SNR. Moreover, subjective evaluation using 16 listeners confirmed the objective test results. 2009 Elsevier B.V. All rights reserved.
منابع مشابه
A Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...
متن کاملSpeech Enhancement Using Masking Properties in Adverse Environments
In this paper, we propose a speech enhancement method by exploiting masking properties of human auditory system. The masking properties are exploited to calculate a masking threshold. The spectral components which lie above the threshold are audible to human listeners. These audible spectral components in the proposed method are suppressed as a predefined attenuation factor of the original nois...
متن کاملPerceptual Speech Enhancement Using a Hilbert Transform Based Time-Frequency Representation of Speech
A new Time-Frequency (TF) representation of speech signal is introduced and used for speech enhancement. TF representation and speech enhancement algorithm are both based on perceptual properties of human auditory system in which the concept of band analysis is exploited. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing Critical Bands (CB) ...
متن کاملSpeech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters
A speech enhancement technique based on the temporal masking properties of the human auditory system is presented. The noisy signal is divided into a number of sub-bands with fractional bark accuracy, and the sub-band signals are individually and adaptively weighted in the time domain according to a short-term temporal masking threshold to noise ratio estimate in each subband. Objective measure...
متن کاملSpeech Enhancement Using Adaptive Kalman Filter Combined With Perceptual Weighting Filter
The speech enhancement is one of the important techniques used to improve the quality of a speech signal i.e. degraded by noise. Speech enhancement using conventional kalman filter require calculating the parameters of AR (auto-regressive) model, and performing a lot of matrix operations, which is non-adaptive. In this paper the proposed method i.e. adaptive kalman filter combined with perceptu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 52 شماره
صفحات -
تاریخ انتشار 2010