A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement

نویسندگان

  • Yi-Chiao Wu
  • Hsin-Te Hwang
  • Syu-Siang Wang
  • Chin-Cheng Hsu
  • Yu Tsao
  • Hsin-Min Wang
چکیده

This paper presents a novel difference compensation postfiltering approach based on the locally linear embedding (LLE) algorithm for speech enhancement (SE). The main goal of the proposed post-filtering approach is to further suppress residual noises in SE-processed signals to attain improved speech quality and intelligibility. The proposed system can be divided into offline and online stages. In the offline stage, we prepare paired differences: the estimated difference of {SE-processed speech; noisy speech} and the ground-truth difference of {clean speech; noisy speech}. In the online stage, on the basis of estimated difference of a test utterance, we first predict the corresponding ground-truth difference based on the LLE algorithm, and then compensate the noisy speech with the predicted difference. In this study, we integrate a deep denoising autoencoder (DDAE) SE method with the proposed LLE-based difference compensation post-filtering approach. The experiment results reveal that the proposed post-filtering approach obviously enhanced the speech quality and intelligibility of the DDAE-based SE-processed speech in different noise types and signal-to-noise-ratio levels.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering

This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

New Hybrid Sub - band Speech Enhancement Systems incorporating Neural Networks and post - Weiner Filtering

In this paper, two new hybrid sub-band systems are proposed which aim to combine neural network sub-band processing with post-Wiener filtering for adaptive speech-enhancement processing of noisy signals. The proposed hybrid architectures comprise an early auditory-processing modelling inspired Multi-Microphone Sub-band Adaptive (MMSBA) system incorporating neural-network based non-linear sub-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017