Robust Keyword Spotting Using a Multi-Stream Approach
نویسندگان
چکیده
Speech recognition systems are prone to severe degradation in noisy environments due to mismatch between training and testing conditions. A multi-stream approach for keyword spotting is proposed to improve robustness in mismatched conditions. The assumption is that most real world noises are colored and do not affect the full spectrum equally, meaning certain parts of the spectrum can still provide reliable information characterizing the utterance. In the proposed method for keyword spotting, the full frequency band is split into several sub-bands, each of which contain both static and delta parameters. Robustness is achieved using only features from sub-bands with highest signal-tonoise ratio (SNR) during recognition, while ignoring sub-bands that are strongly affected by noise. The problem is how to correctly select and combine the useful bands for accurate recognition, without prior knowledge of the noise characteristics. In this paper we propose a new likelihood ratio, used both to select usable bands and provide a confidence measure for robust keyword spotting. Tests carried out using the TiDigits database show a significant improvement in keyword spotting performance compared to a product based approach. In addition, including a non-keyword test set from Resource Management results in a reduction of Equal Error Rate.
منابع مشابه
Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملRobust Multi-Keyword Spotting of Telephone Speech Using Stochastic Matching
In telephone speech recognition, the acoustic mismatch between the training and the test environment often causes severe degradation due to the channel distortion and ambient noise. In this paper, a two-level codebook-based stochastic matching (CBSM) is proposed to deal with the acoustic mismatch. For multi-keyword detection, we define a keyword relation table and a weighting function for reaso...
متن کاملA Survey on Various Word Spotting Techniques for Content Based Document Image Retrieval
Searching documents for information and retrieval of relevant documents is a basic activity. Various tools are readily available for searching and retrieval from digital documents, but not much robust methods are available for retrieval from historic documents and old manuscripts as they are not digitized but available in scanned formats. Conventional way of retrieval from scanned document imag...
متن کاملNoise Robust Keyword Spotting Using Deep Neural Networks For Embedded Platforms
The recent development of embedded platforms along with spectacular growth in communication networking technologies is driving the Internet of things to thrive. More complex tasks are now possible to operate in small devices such as speech recognition and keyword spotting which are in great demand. Traditional voice recognition approaches are already being used in several embedded applications,...
متن کاملKeyword Spotting Using Normalization of Posterior Probability Confidence Measures
Keyword Spotting Using Normalization of Posterior Probability Confidence Measures by Rachna Vijay Vargiya Thesis Advisor: Marius C. Silaghi, Ph.D. Keyword spotting techniques deal with recognition of predefined vocabulary keywords from a voice stream. This research uses HMM based keyword spotting algorithms for this purpose. The three most important componenets of a keyword detection system are...
متن کامل