Interaural Time Difference Estimation Using Generalized Cross- correlation with Maximum Likelihood Weighting in Reverberant Environments

نویسندگان

  • Ji Hun Park
  • Seung Ho Choi
چکیده

In this paper, an interaural time difference (ITD) estimation method is proposed for binaural speech separation in reverberant environments. First, the auditory signals are represented in the time-frequency (T-F) domain, and the ITD for each T-F bin is then estimated using generalized cross-correlation (GCC) with a maximum likelihood (ML) weighting function. In particular, the ML weighting function is designed to reduce the reverberation effect. Then, a mask is estimated by comparing the estimated ITD with the ITD corresponding to the location of the pre-defined target speech source. Finally, the target speech is separated by applying the mask to the auditory signals. It is shown that the proposed ITD estimation method outperforms a conventional cross-correlation-based ITD estimation method under reverberant conditions in terms of the signal-to-noise ratio (SNR) and signal-to-distortion ratio (SDR) of the separated speech signals.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Source Localisation Mapping using Weighted Interaural Cross-Correlation

Computational sound localisation using binaural microphones requires accurate Interaural Time Difference (ITD) estimation in order to infer an angle of incidence. The Phase Transform weighting function, used in generalized cross correlation calculations for microphone arrays is investigated here for application to ITD estimation in binaural head recordings. Empirical measurements using the meth...

متن کامل

A probability model for interaural phase difference

In this paper, we derive a probability model for interaural phase differences at individual spectrogram points. Such a model can combine observations across arbitrary time and frequency regions in a structured way and does not make any assumptions about the characteristics of the sound sources. In experiments with speech from twenty speakers in simulated reverberant environments, this probabili...

متن کامل

Robust Tde-based Doa Estimation for Compact Audio Arrays

Cross-correlation based time delay estimates (TDE) can be used for direction-of-arrival (DOA) estimation with an acoustic array in not-too-reverberant environments. In order to benefit from the computational efficiency of TDE-based DOA estimation, and concentrating on applications that use a compact microphone array and low sampling frequency, we use a combination of approaches to make TDE-base...

متن کامل

Source Localization in Reverberant Environments : Part II - Statistical Analysis

The main di culty in building robust practical systems for acoustical source localization using microphone arrays, is the e ects of room-reverberation. In this paper, a statistical analysis is presented of the in uence of room reverberation on source localization techniques. Using a statistical reverberation model presented in a companion paper, the Cram erRao lower bound for time-deley estimat...

متن کامل

Performance of GCC- and AMDF-Based Time-Delay Estimation in Practical Reverberant Environments

Recently, there has been an increased interest in the use of the time-delay estimation (TDE) technique to locate and track acoustic sources in a reverberant environment. Typically, the delay estimate is obtained through identifying the extremumof the generalized cross-correlation (GCC) function or the average magnitude difference function (AMDF). These estimators are well studied and their stat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014