Using Energy Difference for Speech Separation of Dual - microphone Close - talk System

نویسندگان

  • Yi Jiang
  • Ming Jiang
  • Yuanyuan Zu
  • Zhenming Feng
چکیده

Using the computational auditory scene analysis (CASA) as a framework, a novel speech separation approach based on dual-microphone energy difference (DMED) is proposed for close-talk system. The energy levels of the two microphones are calculated in time-frequency (T-F) units. The DMEDs are calculated as the energy level ratio between the two microphones, and used as a cue to estimate the signal to noise ratio (SNR) and ideal binary mask (IBM) for mix-acoustic of the close-to-mouth microphone. The binary masked units are grouped to generate the target speech. Test with speeches and different noises show that the algorithm is more than 95 % accurate. As the T-F units’ length increase, the accuracy increase as well. Using automatic speech recognition (ASR) analysis, we show that the proposed algorithm improves speech quality in actual close talk system. Copyright © 2013 IFSA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Dual-Microphone Speech Enhancement Algorithm for Close-Talk System

While human listening is robust in complex auditory scenes, current speech enhancement algorithms do not perform well in noisy environments, even close-talk system is used. This paper addresses the robustness in dual microphone embedded close talk system by employing a computational auditory scene analysis (CASA) framework. The energy difference between the two microphones is used as the primar...

متن کامل

Blind Source Separation for Speech Application Under Real Acoustic Environment

A hands-free speech recognition system [1] is essential for the realization of an intuitive, unconstrained, and stress-free human-machine interface, where users can talk naturally because they require no microphone in their hands. In this system, however, since noise and reverberation always degrade speech quality, it is difficult to achieve high recognition performance, compared with the case ...

متن کامل

Real Time Speech Separation by Lateral Inhibition and Masking

In this paper, we propose a simple algorithm to separate a speech signals with the highest energy from a mixture of sound sources . We use two microphones and assume that one speaker is close to one microphone, and the other speaker is close to another microphone. In the system we use the concept of auditory filter banks together with lateral inhibition, intensity interaural difference and mask...

متن کامل

Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition

A new one-stage maximum confidence measure (MCM) based interaural phase difference estimation framework for noise masking is proposed to closely integrate the underline speech models into dual-microphone array noise filtering for robust speech recognition. The main ideas are: (1) utilizing both the speech and filler models of the recognizer to feedback confidence measures (CMs) that indicate th...

متن کامل

An analytic modeling approach to enhancing throat microphone speech commands for keyword spotting

This research was carried out on enhancing throat microphone speech for noise-robust speech keyword spotting. The enhancement was performed by mapping the log-energy in the Mel-frequency bands of throat microphone speech to those of the corresponding close-talk microphone speech. An analytic equation detection system, Eureqa, which can infer nonlinear relations directly from observed data, was ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013