Robust Multi-Keyword Spotting of Telephone Speech Using Stochastic Matching
نویسندگان
چکیده
In telephone speech recognition, the acoustic mismatch between the training and the test environment often causes severe degradation due to the channel distortion and ambient noise. In this paper, a two-level codebook-based stochastic matching (CBSM) is proposed to deal with the acoustic mismatch. For multi-keyword detection, we define a keyword relation table and a weighting function for reasonable keyword combinations. In the multi-keyword spotting system, 94 right context-dependent INITIAL’s, 37 context-independent FINAL’s and 1 silence model are adopted. In order to evaluate the multi-keyword spotting system, 1275 faculty names and department names are selected as the keywords. Using a testing set of 2400 conversional speech utterances from 8 speakers, the proposed two-level CBSM can reduce the recognition error rate from 36.52% to 13.4%.
منابع مشابه
Multi-keyword spotting of telephone speech using a fuzzy search algorithm and keyword-driven two-level CBSM
In telephone speech recognition, the acoustic mismatch between training and testing environments often causes a severe degradation in the recognition performance. This paper presents a keyword-driven two-level codebook-based stochastic matching (CBSM) algorithm to eliminate the acoustic mismatch. Additionally, in Mandarin speech, it is dicult to correctly recognize the unvoiced part in a sylla...
متن کاملTelephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verification
In this paper a fuzzy search algorithm is proposed to deal with the recognition error for telephone speech. Since the prosodic information is a very special and important feature for Mandarin speech, we integrate the prosodic information into keyword verification. For multi-keyword detection, we define a keyword relation and a weighting function for reasonable keyword combinations. In the keywo...
متن کاملNon-Uniform Boosted MCE Training of Deep Neural Networks for Keyword Spotting
Keyword spotting can be formulated as a non-uniform error automatic speech recognition (ASR) problem. It has been demonstrated [1] that this new formulation with the nonuniform MCE training technique can lead to improved system performance in keyword spotting applications. In this paper, we demonstrate that deep neural networks (DNNs) can be successfully trained on the non-uniform minimum class...
متن کاملRobust Keyword Spotting Using a Multi-Stream Approach
Speech recognition systems are prone to severe degradation in noisy environments due to mismatch between training and testing conditions. A multi-stream approach for keyword spotting is proposed to improve robustness in mismatched conditions. The assumption is that most real world noises are colored and do not affect the full spectrum equally, meaning certain parts of the spectrum can still pro...
متن کاملComparison of keyword spotting methods for searching in speech
This paper presents and discusses keyword spotting methods for searching in speech. In contrast with searching in text, the searching in speech or generally in multimedia data still represents a challenge. The aim of the paper is to present a keyword spotting (KWS) method based on a large vocabulary continuous speech recognition (LVCSR) system, based on phonetics decoder, and keyword spotting u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009