Robust automatic continuous-speech recognition based on a voiced-unvoiced decision
نویسندگان
چکیده
In this paper, the implementation of a robust front-end to be used for a large-vocabulary Continuous Speech Recognition (CSR) system based on a Voiced-Unvoiced (V-U) decision has been addressed. Our approach is based on the separation of the speech signal into voiced and unvoiced components. Consequently, speech enhancement can be achieved through processing of the voiced and the unvoiced components separately. Enhancement of the voiced component is performed using an adaptive comb filtering, whereas the unvoiced component is enhanced using the modified spectral subtraction approach. We proved via experiments that the proposed CSR system is robust in additive noisy environments (SNR down to 0 dB).
منابع مشابه
Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environments
This paper presents an evaluation of a robust Voiced-Unvoicedbased large-vocabulary Continuous-Speech Recognition (CSR) system in the presence of highly interfering noise. Comparative experiments have indicated that the inclusion of an accurate Voiced-Unvoiced (V-U) classifier in our design of a CSR system improves the performance of such a recognizer, for speech contaminated by both additive G...
متن کاملTowards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
In this paper, we show that the concept of Voiced-Unvoiced (VU) classification of speech sounds can be incorporated not only in speech analysis or speech enhancement processes, but also can be useful for recognition processes. That is, the incorporation of such a classification in a continuous speech recognition (CSR) system not only improves its performance in low SNR environments, but also li...
متن کاملRobust speech recognition using a voiced-unvoiced feature
In this paper, a voiced-unvoiced measure is used as acoustic feature for continuous speech recognition. The voiced-unvoiced measure was combined with the standard Mel Frequency Cepstral Coefficients (MFCC) using linear discriminant analysis (LDA) to choose the most relevant features. Experiments were performed on the SieTill (German digit strings recorded over telephone line) and on the SPINE (...
متن کاملA continuous voicing parameter in the frequency domain
In automatic speech analysis, voicing articulatory is often defined as a binary decision: voiced or unvoiced. Linguists agree that this articulatory should be continuous. In this paper, we present a new approach to compute a continuous voicing indicator of a speech frame. This voicing percentage is then evaluated in both a segmentation process and a speech recognition task. Promising results sh...
متن کاملA Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques
This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thre...
متن کامل