Hypothesis dependent threshold setting for improved out-of-vocabulary data rejection
نویسندگان
چکیده
An efficient rejection procedure is necessary to reject out-ofvocabulary words and noise tokens that occur in voice activated vocal services. Garbage or filler models are very useful for such a task. However, a post-processing of the recognized hypothesis, based on a likelihood ratio statistic test, can refine the decision and improve performance. These tests can be applied either on acoustic parameters or on phonetic or prosodic parameters that are not taken into account by the HMM-based decoder. This paper focuses on the post-processing procedure and shows that making the likelihood ratio decision threshold dependent on the recognized hypothesis largely improves the efficiency of the rejection procedure. Models and anti-models are one of the keypoints of such an approach. Their training and usage are also discussed, as well as the contextual modeling involved. Finally results are reported on a field database collected from a 2000word directory task using various phonetic and prosodic parameters.
منابع مشابه
A phone-dependent confidence measure for utterance rejection
An acoustic con dence measure for acceptance/rejection of recognition hypotheses for continuous speech utterances is proposed. This measure is useful for rejecting utterances that are out of domain, or contain out-of-vocabulary words or speech dis uencies. A phone-based approach is implemented so that a single global threshold can be applied to hypothesis rejection for any word sequence. Phone ...
متن کاملUse of a confidence measure based on frame level likelihood ratios for the rejection of incorrect data
Interactive vocal services are based on speech recognition systems which must be able to reject efficiently incorrect utterances such as out-of-vocabulary or noise tokens. A possible approach is a post-processing of the hypotheses delivered by the recogniser, based on the computation of a confidence measure (CM). A recognition hypothesis is rejected if its CM is below a chosen threshold. This p...
متن کاملTask-Induced Involvement in L2 Vocabulary Learning: A Case for Listening Comprehension
The study aimed at investigating whether the retention of vocabulary acquired incidentally is dependent upon the amount of task-induced involvement. Immediate and delayed retention of twenty unfamiliar words was examined in three learning tasks( listening comprehension + group discussion, listening comprehension + dictionary checking + summary writing in L1, and listening comprehension + dictio...
متن کاملRejection in speech recognition systems with limited training
In this paper, we propose a new rejection criterion applicable specifically to limited-training speech recognition systems such as Speaker-Dependent (SD) recognition systems. The new criterion uses confidence measures as well as speakerspecific out-of-vocabulary (OOV) models. The OOV models are created from the same training data that is available to create the in-vocabulary (IV) word models. W...
متن کاملRobust rejection for embedded systems
This paper compares three low cost rejection-modeling approaches and their combination under the framework of a voice control task. Whole word acoustic models were created with in-vocabulary training data. Rejection parameters were adjusted with validation data to find optimal rejection performance for out-of-vocabulary commands within a 3% error rate limit for in-vocabulary commands. The rejec...
متن کامل