Interaural Time Difference Estimation Using Generalized Cross- correlation with Maximum Likelihood Weighting in Reverberant Environments
نویسندگان
چکیده
In this paper, an interaural time difference (ITD) estimation method is proposed for binaural speech separation in reverberant environments. First, the auditory signals are represented in the time-frequency (T-F) domain, and the ITD for each T-F bin is then estimated using generalized cross-correlation (GCC) with a maximum likelihood (ML) weighting function. In particular, the ML weighting function is designed to reduce the reverberation effect. Then, a mask is estimated by comparing the estimated ITD with the ITD corresponding to the location of the pre-defined target speech source. Finally, the target speech is separated by applying the mask to the auditory signals. It is shown that the proposed ITD estimation method outperforms a conventional cross-correlation-based ITD estimation method under reverberant conditions in terms of the signal-to-noise ratio (SNR) and signal-to-distortion ratio (SDR) of the separated speech signals.
منابع مشابه
Source Localisation Mapping using Weighted Interaural Cross-Correlation
Computational sound localisation using binaural microphones requires accurate Interaural Time Difference (ITD) estimation in order to infer an angle of incidence. The Phase Transform weighting function, used in generalized cross correlation calculations for microphone arrays is investigated here for application to ITD estimation in binaural head recordings. Empirical measurements using the meth...
متن کاملA probability model for interaural phase difference
In this paper, we derive a probability model for interaural phase differences at individual spectrogram points. Such a model can combine observations across arbitrary time and frequency regions in a structured way and does not make any assumptions about the characteristics of the sound sources. In experiments with speech from twenty speakers in simulated reverberant environments, this probabili...
متن کاملRobust Tde-based Doa Estimation for Compact Audio Arrays
Cross-correlation based time delay estimates (TDE) can be used for direction-of-arrival (DOA) estimation with an acoustic array in not-too-reverberant environments. In order to benefit from the computational efficiency of TDE-based DOA estimation, and concentrating on applications that use a compact microphone array and low sampling frequency, we use a combination of approaches to make TDE-base...
متن کاملSource Localization in Reverberant Environments : Part II - Statistical Analysis
The main di culty in building robust practical systems for acoustical source localization using microphone arrays, is the e ects of room-reverberation. In this paper, a statistical analysis is presented of the in uence of room reverberation on source localization techniques. Using a statistical reverberation model presented in a companion paper, the Cram erRao lower bound for time-deley estimat...
متن کاملPerformance of GCC- and AMDF-Based Time-Delay Estimation in Practical Reverberant Environments
Recently, there has been an increased interest in the use of the time-delay estimation (TDE) technique to locate and track acoustic sources in a reverberant environment. Typically, the delay estimate is obtained through identifying the extremumof the generalized cross-correlation (GCC) function or the average magnitude difference function (AMDF). These estimators are well studied and their stat...
متن کامل