Speech Separation with Dereverberation-based Pre-processing Incorporating Visual Cues
نویسندگان
چکیده
separation with dereverberation-based pre-processing incorporating visual cues.
منابع مشابه
Speech Dereverberation by Blind Adaptive MIMO Filtering Exploiting Nongaussianity, Nonwhiteness, and Nonstationarity
In this paper, we present a class of novel algorithms for blind dereverberation of speech signals based on TRINICON, a general framework for broadband adaptive MIMO signal processing. In order to exploit all fundamental stochastic signal properties of speech for the dereverberation/deconvolution process and to avoid any whitening artifacts known from previous approaches, we propose the incorpor...
متن کاملA General Framework for Incorporating Time-Frequency Domain Sparsity in Multichannel Speech Dereverberation
Blind multichannel speech dereverberation methods based on multichannel linear prediction (MCLP) estimate the dereverberated speech component without any knowledge of the room acoustics by estimating and subtracting the undesired reverberant component from the reference microphone signal. In this paper we present a general framework for incorporating sparsity in the time-frequency domain into M...
متن کاملClassification based binaural dereverberation
Reverberation has a detrimental effect on speech perception both in terms of quality as well as intelligibility, as late reflections smear temporal and spectral cues. The ideal binary mask, which is an established computational approach to sound separation, was recently extended to remove reverberation. Experiments with both normal hearing and hearing impaired listeners have shown significant i...
متن کاملSpeech Recognition by Dereverberation Method Based on Multi-channel LMS Algorithm in Noisy Reverberant Environment
1 Introduction In a distant-talking environment, channel distortion drastically degrades speech recognition performance because of mismatches between the training and test environments. The current approaches focusing on robustness issues for automatic speech recognition (ASR) in noisy reverberant environments can be classified as speech enhancement, robust feature extraction, or model adaptati...
متن کاملStatistical Model of Speech Signals Based on Composite Autoregressive System with Application to Blind Source Separation
This paper presents a new statistical model for speech signals, which consists of a time-invariant dictionary incorporating a set of the power spectral densities of excitation signals and a set of all-pole filters where the gain of each pair of excitation and filter elements is allowed to vary over time. We use this model to develop a combined blind separation and dereverberation method for spe...
متن کامل