Noise Robust Speech Parameterization using Relative Spectra and Auditory Filterbank

نویسندگان

  • Youssef Zouhir
  • Kaïs Ouni
چکیده

In the present study, a new feature extraction method based on relative spectra and gammachirp auditory filterbank is proposed for robust noisy speech recognition. The relative spectra filtering are applied to the log of the output of the gammachirp filterbank which incorporates the properties of the cochlear filter in order to remove uncorrelated additive noise components. The performances of this method have been evaluated on the isolated speech word corrupted by real-world noisy environments using the continuous Gausian-Mixture density Hidden Markov Model. The evaluation of the experimental results shows that the proposed method achieves best recognition rates compared to the conventional techniques like Perceptual Linear Prediction (PLP), Linear Predictive Cepstral Coefficients (LPCC) and Mel-Frequency Cepstral Coefficients (MFCC).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique

In this paper we introduce a robust feature extractor, dubbed as robust compressive gammachirp filterbank cepstral coefficients (RCGCC), based on an asymmetric and level-dependent compressive gammachirp filterbank and a sigmoid shape weighting rule for the enhancement of speech spectra in the auditory domain. The goal of this work is to improve the robustness of speech recognition systems in ad...

متن کامل

On the relevance of auditory-based Gabor features for deep learning in robust speech recognition

Previous studies support the idea of merging auditory-based Gabor features with deep learning architectures to achieve robust automatic speech recognition, however, the cause behind the gain of such combination is still unknown. We believe these representations provide the deep learning decoder with more discriminable cues. Our aim with this paper is to validate this hypothesis by performing ex...

متن کامل

Perceptual Domain Based Speech and Audio Coder

This paper applies a new auditory filterbank to wide band speech and audio coding. The coding algorithm is capable of producing high quality coded speech and audio, which account for temporal as well as spectral details. The analysis and synthesis are performed using a critical-bandrate auditory filterbank with superior auditory masking properties. The outputs of the analysis filters are proces...

متن کامل

A generalized framework for compensation of mel-filterbank outputs in feature extraction for robust ASR

This paper describes a novel and efficient noise-robust frontend that utilizes a set of Mel-filterbank output compensation methods, together with cumulative distribution mapping of cepstral coefficients, for noisy speech recognition. The proposed compensation framework includes the use of noise spectral subtraction, spectral flooring and log Mel-filterbank output weighting. Recognition experime...

متن کامل

Amplitude Modulation Maps for Robust Speech Recognition

Two recognition tasks are discussed in which pre-processing based on amplitude modulation (AM) maps is compared with other feature extraction strategies. In the first task we show how the AM map representation can be used to segregate voiced speech signals from one another. The second shows how the AM representation can be used for robust digit recognition in additive noise. Natural vowels from...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015