Building a Binaural Source Separator
نویسندگان
چکیده
We propose a number of cues and a strategy for combining them that could be used by a binaural machine to perform source separation. Our previous work has used the single cue of interaural phase difference (IPD) to segment the time-frequency plane using an EM algorithm. We see this as a first step towards a larger and more complete system that takes advantage of more of the cues available to a listener from the stereo mixture such as interaural level difference (ILD), monaural cues, and reliability cues. Additionally, these cues could be integrated with one another by extending the existing probabilistic framework.
منابع مشابه
Studies on binaural and monaural signal analysis methods and applications
Author Sampo Vesa Title Studies on Binaural and Monaural Signal Analysis — Methods and Applications Sound signals can contain a lot of information about the environment and the sound sources present in it. This thesis presents novel contributions to the analysis of binaural and monaural sound signals. Some new applications are introduced in this work, but the emphasis is on analysis methods. Th...
متن کاملمدل میکروسکوپی دوگوشی مبتنی بر فیلتر بانک مدولاسیون برای پیش گویی قابلیت فهم گفتار در افراد دارای شنوایی عادی
In this study, a binaural microscopic model for the prediction of speech intelligibility based on the modulation filter bank is introduced. So far, the spectral criteria such as the STI and SII or other analytical methods have been used in the binaural models to determine the binaural intelligibility. In the proposed model, unlike all models of binaural intelligibility prediction, an automatic ...
متن کاملAuthenticity and Naturalness of Binaural Reproduction via Headphones regarding Different Equalization Methods
Not only a suitable localization performance, but also the plausibility and authenticity of the played scene are major criteria for a successful binaural reproduction. It is therefore important to examine whether the binaural reproduction can be perceptually distinguished from a real source. The aim of the presented investigation is to analyze the quality and reliability of binaural reproductio...
متن کاملLocalizing nearby sound sources in a classroom: binaural room impulse responses.
Binaural room impulse responses (BRIRs) were measured in a classroom for sources at different azimuths and distances (up to 1 m) relative to a manikin located in four positions in a classroom. When the listener is far from all walls, reverberant energy distorts signal magnitude and phase independently at each frequency, altering monaural spectral cues, interaural phase differences, and interaur...
متن کاملStructuring time domain blind source separation algorithms for CASA integration
Most algorithms based on Computational Auditory Scene Analysis (CASA) for binaural speech separation do not have the ability to inhibit already localized and for a long time present sources in the auditory scene. This has the major drawback that the auditory cues of weaker and new sources are subject to interference from already localized and perceived signals and the separation performance is ...
متن کامل