Zero-crossing-based ratio masking for sound segregation
نویسندگان
چکیده
This paper presents a new method of zero-crossing based binaural mask estimation for sound segregation under the condition that multiple sound sources are present simultaneously. The masking is determined by the estimated sound source directions using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). In the suggested method, the estimation of ITDs is utilizing the statistical properties of zero-crossings detected from binaural filter-bank outputs. We also consider the estimation of ITDs with the aid of IID samples to cope with the phase ambiguities of ITD samples in high frequencies. For the masking method, we consider to use the target-to-total power ratio in each segment of the timefrequency domain. We show that this power ratio is optimal from the view point of reconstructing the target speech signal. As a result, the proposed method is able to provide an accurate estimate of sound source directions and also a good masking scheme for speech segregation while offering significantly less computational complexity compared to cross-correlation-based methods.
منابع مشابه
Zero-Crossing Based Time-Frequency Masking for Sound Segregation
This paper presents a new method of zero-crossing based binaural mask estimation for sound segregation under the condition that multiple sound sources are present simultaneously. The masking is determined by the estimated sound source directions using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). In the suggested method, the estimatio...
متن کاملSpatial Hearing Algorithms Based on Binaural Zero-Crossings: Sound Source Localization, Segregation, and Dereverberation
This thesis concerns a new zero-crossing-based binaural model for spatial hearing. Conventional binaural model computes cross-correlations of binaural signals for the estimation of the interaural time difference which is a primary spatial cue. However, the cross-correlationbased binaural processing model requires high computational complexity and suffers from inaccuracies in localizing sound so...
متن کاملSound segregation based on binaural zero-crossings
This paper presents a new method of sound segregation based on zero-crossings generated from binaural filter-bank outputs. In our approach, sound source directions are identified using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). The estimation of ITDs is performed using zero-crossings generated from binaural filter-bank outputs to g...
متن کاملUsing a Cascade of Asymmetric Resonators with Fast-Acting Compression as a Cochlear Model for Machine-Hearing Applications
Every day, machines process many thousands of hours of audio signals through a realistic cochlear model. They extract features, inform classifiers and recommenders, and identify copyrighted material. The machine-hearing approach to such tasks has taken root in recent years, because hearingbased approaches perform better than we can do with more conventional sound-analysis approaches. We use a b...
متن کاملA Pole–Zero Filter Cascade Provides Good Fits to Human Masking Data and to Basilar Membrane and Neural Data
A cascade of two-pole–two-zero filters with level-dependent pole and zero dampings, with few parameters, can provide a good match to human psychophysical and physiological data. The model has been fitted to data on detection threshold for tones in notched-noise masking, including bandwidth and filter shape changes over a wide range of levels, and has been shown to provide better fits with fewer...
متن کامل