binary masking

Stereo-input speech recognition using sparseness-based time-frequency masking in a reverberant environment

2009

Yosuke Izumi Kenta Nishiki Shinji Watanabe Takuya Nishimoto Nobutaka Ono Shigeki Sagayama

We present noise robust automatic speech recognition (ASR) using sparseness-based underdetermined blind source separation (BSS) technique. As a representative underdetermined BSS method, we utilized time-frequency masking in this paper. Although time-frequency masking is able to separate target speech from interferences effectively, one should consider two problems. One is that masking does not...

متن کامل

Musical Sound Separation Based on Binary Time-Frequency Masking

Journal: :EURASIP Journal on Audio, Speech, and Music Processing 2009

متن کامل

Reconstruction techniques for improving the perceptual quality of binary masked speech.

Journal: :The Journal of the Acoustical Society of America 2014

Donald S Williamson Yuxuan Wang DeLiang Wang

This study proposes an approach to improve the perceptual quality of speech separated by binary masking through the use of reconstruction in the time-frequency domain. Non-negative matrix factorization and sparse reconstruction approaches are investigated, both using a linear combination of basis vectors to represent a signal. In this approach, the short-time Fourier transform (STFT) of separat...

متن کامل

Continuous time-frequency masking method for blind speech separation with adaptive choice of threshold parameter using ICA

2006

Zbynek Koldovský Jan Nouza Jan Kolorenc

We propose a novel method for blind speech separation using continuous time-frequency masking. The method is equipped with an adaptive choice of a threshold parameter that is based on utilization of ICA methods. We present a direct application that consists in the speech segregation for automatic transcription of spoken broadcasts disturbed by background music. Experimental results show improve...

متن کامل

Robust Image Registration Using Selective Correlation Coefficient

2000

Yutaka Sato Shun'ichi Kaneko Satoru Igarashi

\Ve propose a new method for robust image registrat,ion called 'Selective Correlation Coefficient (SCC)' in order to search images under illconditioned illuminat,ion or partial occlusion. A correlation mask image is generated for selecting pixels of a image before matching. The mask image can be derived from a binary-coded increment sign image defined from any object image and the template imag...

متن کامل

Effects of Cochlear Hearing Loss on the Benefits of Ideal Binary Masking

2016

Vahid Montazeri Shaikat Hossain Peter F. Assmann

Ideal Binary Masking (IdBM) is considered as the primary goal of computational auditory scene analysis. This binary masking criterion provides a time-frequency representation of noisy speech and retains regions where the speech dominates the noise while discarding regions where the noise is dominant. Several studies have shown the benefits of IdBM for normal hearing and hearing-impaired listene...

متن کامل

Image Enhancement Using an Adaptive Un-sharp Masking Method Considering the Gradient Variation

Journal: International Journal of Engineering 2017

Hamid Hassanpour, Sekineh Asadi, Zahra Mortezaei,

Technical limitations in image capturing usually impose defective, such as contrast degradation. There are different approaches to improve the contrast of an image. Among the exiting approaches, un-sharp masking is a popular method due to its simplicity in implementation and computation. There is an important parameter in un-sharp masking, named gain factor, which affects the quality of the enh...

متن کامل

Face hallucination based on morphological component analysis

Journal: :Signal Processing 2013

Yan Liang Xiaohua Xie Jian-Huang Lai

In this paper, we formulate the face hallucination as an image decomposition problem, and propose a Morphological Component Analysis (MCA) based method for hallucinating a single face image. A novel three-step framework is presented for the proposed method. Firstly, a low-resolution input image is up-sampled via an interpolation. Then, the interpolated image is decomposed into a global high-res...

متن کامل

Speech Enhancement Using Wavelet Coefficients Masking with Local Binary Patterns

2017

Christian Arcos Marley Vellasco Abraham Alcaim

In this paper, we present a wavelet coefficients masking based on Local Binary Patterns (WLBP) approach to enhance the temporal spectra of the wavelet coefficients for speech enhancement. This technique exploits the wavelet denoising scheme, which splits the degraded speech into pyramidal subband components and extracts frequency information without losing temporal information. Speech enhanceme...

متن کامل

Automated Pause Insertion for Improved Intelligibility Under Reverberation

2016

Petko N. Petkov Norbert Braunschweiler Yannis Stylianou

Speech intelligibility in reverberant environments is reduced because of overlap-masking. Signal modification prior to presentation in such listening environments, e.g., with a public announcement system, can be employed to alleviate this problem. Time-scale modifications are particularly effective in reducing the effect of overlap-masking. A method for introducing linguistically-motivated paus...

متن کامل