Improved estimation, evaluation and applications of confidence measures for speech recognition
نویسندگان
چکیده
This paper describes our approach to the estimation of con dence in the words generated by a speech recognition system. We describe the models and the features employed for con dence estimation. In addition we discuss the characteristics of an information-theoretic metric for assessing the performance of the con dence measure. We provide a simple application of con dence measures in which we rank the performance of speakers.
منابع مشابه
Improved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کاملNew word-level and sentence-level confidence scoring using graph theory calculus and its evaluation on speech understanding
A lot of work has been devoted to the estimation of confidence measures for speech recognizers. In the quite extended case where a word-graph speech recognizer is in use, we will present new confidence measures employing the graph theory that shows us how to estimate some interesting characteristics about the different paths through the graph that constitute the recognition solutions, without t...
متن کاملConfidence Measures for Large Vocabulary Continuous Speech Recognition
Estimation confidence of the output hypothesis of a speech recognizer can be used in many practical applications of speech recognition technology. In this paper, we propose to estimate the confidence of a hypothesized word directly as its posterior probability. There are tow methods to calculate posterior probabilities, one of which is based on word graphs and the other is based on N-best lists...
متن کاملLocal word confidence measure using word graph and n-best list
This paper presents some confidence measures for large vocabulary speech recognition which are based on word graph or N-Best List structures. More and more applications need fast estimation of any measures in order to stay real-time. We propose some simple and fast measures, locally computed, that can be directly used within the first decoding recognition process. We also define some other meas...
متن کاملConfidence measures for large vocabulary continuous speech recognition
In this paper, we present several confidence measures for large vocabulary continuous speech recognition. We propose to estimate the confidence of a hypothesized word directly as its posterior probability, given all acoustic observations of the utterance. These probabilities are computed on word graphs using a forward–backward algorithm. We also study the estimation of posterior probabilities o...
متن کامل