نتایج جستجو برای: speech presence uncertainty

تعداد نتایج: 803739  

In this paper an estimator for speech enhancement based on Laplacian Mixture Model has been proposed. The proposed method, estimates the complex DFT coefficients of clean speech from noisy speech using the MMSE  estimator, when the clean speech DFT coefficients are supposed mixture of Laplacians and the DFT coefficients of  noise are assumed zero-mean Gaussian distribution. Furthermore, the MMS...

Journal: :Signal Processing 2011
Woojung Lee Ji-Hyun Song Joon-Hyuk Chang

In speech enhancement, soft decision, in which the speech absence probability (SAP) is introduced to modify the spectral gain or update the noise power, is known to be efficient. In many previous works, a fixed a priori probability of speech absence (q) is assumed in estimating the SAP, which is not realistic since speech is quasi-stationary and may not be present in each frequency bin. To addr...

Journal: :IEICE Transactions 2013
Jong-Woong Kim Joon-Hyuk Chang Sang-Won Nam Dong Kook Kim Jong Won Shin

In this paper, we propose a speech-presence uncertainty estimation to improve the global soft decision-based speech enhancement technique by using the spectral gradient scheme. The conventional soft decision-based speech enhancement technique uses a fixed ratio (Q) of the a priori speech-presence and speech-absence probabilities to derive the speech-absence probability (SAP). However, we attemp...

Journal: :EURASIP Journal on Advances in Signal Processing 2010

2010
Katherine Forbes-Riley Diane J. Litman

We investigate whether four metacognitive metrics derived from student correctness and uncertainty values are predictive of student learning in a fully automated spoken dialogue computer tutoring corpus. We previously showed that these metrics predicted learning in a comparable wizarded corpus, where a human wizard performed the speech recognition and correctness and uncertainty annotation. Our...

1999
David Malah Richard V. Cox Anthony J. Accardi

Speech enhancement algorithms which are based on estimating the short-time spectral amplitude of the clean speech have better performance when a soft-decision gain modification, depending on the a priori probability of speech absence, is used. In reported works a fixed probability, q, is assumed. Since speech is non-stationary and may not be present in every frequency bin when voiced, we propos...

Journal: :Speech Communication 2011
Anthony P. Stark Kuldip K. Paliwal

In this paper, we investigate the use of the minimum mean square error (MMSE) spectral energy estimator for use in environmentrobust automatic speech recognition (ASR). In the past, it has been common to use the MMSE log-spectral amplitude estimator for this task. However, this estimator was originally derived under subjective human listening criteria. Therefore its complex suppression rule may...

Journal: :Speech Communication 2012
Kuldip K. Paliwal Belinda Schwerin Kamil K. Wójcicki

In this paper we investigate the enhancement of speech by applying MMSE short-time spectral magnitude estimation in the modulation domain. For this purpose, the traditional analysis-modification-synthesis framework is extended to include modulation domain processing. We compensate the noisy modulation spectrum for additive noise distortion by applying the MMSE short-time spectral magnitude esti...

Journal: :Computer Speech & Language 2010
Yang Shao Soundararajan Srinivasan Zhaozhang Jin DeLiang Wang

A conventional automatic speech recognizer does not perform well in the presence of multiple sound sources, while human listeners are able to segregate and recognize a signal of interest through auditory scene analysis. We present a computational auditory scene analysis system for separating and recognizing target speech in the presence of competing speech or noise. We estimate, in two stages, ...

2011
Chang Huai You Soo Ngee Koh Haizhou Li Susanto Rahardja

This paper considers a single channel speech enhancement algorithm, which is based on our previous work on βorder minimum mean square error (MMSE) spectral estimation. We propose to make β a function of both local and frame signal-to-noise ratios (SNRs) in order to achieve more effective preservation of weak speech components. Moreover, by taking into account the speech-presence uncertainty in ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید