Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter
نویسندگان
چکیده
A novel speech separation structure which simulates the cocktail party e ect using a modi ed iterative Wiener lter and a multi-layer perceptron neural network is presented. The neural network is used as a speaker recognition system to control the iterative Wiener lter. The neural network is a modi ed perceptron with a hidden layer using feature data extracted from LPC cepstral analysis. The proposed technique has been successfully used for speech separation when the interference is competing speech or broad band noise.
منابع مشابه
Deep Transform: Cocktail Party Source Separation via Probabilistic Re-Synthesis
In cocktail party listening scenarios, the human brain is able to separate competing speech signals. However, the signal processing implemented by the brain to perform cocktail party listening is not well understood. Here, we trained two separate convolutive autoencoder deep neural networks (DNN) to separate monaural and binaural mixtures of two concurrent speech streams. We then used these DNN...
متن کاملCocktail Party Processing via Structured Prediction
While human listeners excel at selectively attending to a conversation in a cocktail party, machine performance is still far inferior by comparison. We show that the cocktail party problem, or the speech separation problem, can be effectively approached via structured prediction. To account for temporal dynamics in speech, we employ conditional random fields (CRFs) to classify speech dominance ...
متن کاملDeep Transform: Cocktail Party Source Separation via Complex Convolution in a Deep Neural Network
Convolutional deep neural networks (DNN) are state of the art in many engineering problems but have not yet addressed the issue of how to deal with complex spectrograms. Here, we use circular statistics to provide a convenient probabilistic estimate of spectrogram phase in a complex convolutional DNN. In a typical cocktail party source separation scenario, we trained a convolutional DNN to re-s...
متن کاملProbabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network
Separation of competing speech is a key challenge in signal processing and a feat routinely performed by the human auditory brain. A long standing benchmark of the spectrogram approach to source separation is known as the ideal binary mask. Here, we train a convolutional deep neural network, on a twospeaker cocktail party problem, to make probabilistic predictions about binary masks. Our result...
متن کاملMultichannel signal separation for cocktail party speech recognition: a dynamic recurrent network
This paper addresses the method of multichannel signal separation with its application to cocktail party speech recognition. First, we present a fundamental principle for multichannel signal separation which describes what spatial independence criterion results in. Second, for practical implementation of the signal separation lter, we consider a dynamic recurrent network and develop a new simpl...
متن کامل