A fuzzy acoustic-phonetic decoder for speech recognition
نویسندگان
چکیده
In this paper, a general framework of acoustic-phonetic modelling is developed. Context sensitive rules are incorporated into a knowledge-based automatic speech recognition (ASR) system and are assessed with control based on fuzzy decision making. The reliability measure is outlined: a tests collection is run and a confusion matrix is built for each rule. During the recognition procedure the fuzzy set of trained values related to the phonetic unit to be recognized is computed, and its membership function is automatically drawn.
منابع مشابه
Rescoring under fuzzy measures with a multilayer neural network in a rule-based speech recognition system
In this paper, a speech rescoring system is developed on a set of phonetic hypotheses produced by a bottom-up knowledge-based decoder. An original method to automatically compute a fuzzy membership function from top-down acoustic rules statistics is compared with a possibilistic measure. To aggregate the fuzzy degrees into a phonetic score, a mutilayer neural network is trained on the results o...
متن کاملEnglish-Spanish Bilingual Alphabet for Embedded Speech Recognition
This article introduces the phonetic alphabet that has been used to train acoustic models with a mixture of Spanish language and American English data, with the purpose of improving the speech recognition performance, when using Spanish, for speakers that are fluent in both languages, as is very frequently the case in the USA Spanish speaking population. We target a decoder that can be used in ...
متن کاملA fuzzy synchronization algorithm for bimodal speech signals
This paper describes a rule{based fuzzy system that estimates the relationship between acoustic and visual speech and uses this estimate for the synchronization of not aligned audio{visual signals. The relations are quantiied by means of a set of rules, which associate typical mouth shapes (visual classes) to speciic acoustic classes. The visual and acoustic classes are learned from training da...
متن کاملSpeech Understanding and Speech Translation in Various Domains by Maximum A-posteriori Semantic Decoding
This paper describes a domain-limited system for speech understanding as well as for speech translation. An integrated semantic decoder directly converts the preprocessed speech signal into its semantic representation by a maximum a-posteriori classification. With the combination of probabilistic knowledge on acoustic, phonetic, syntactic, and semantic levels, the semantic decoder extracts the ...
متن کاملA Stack Decoder for Continous Speech Recognition
We describe the structure, preliminary implementation and performance of an algorithm for doing continuous speech recognition. The algorithm, known as a stack decoder, proceeds by continually evaluating one-word extensions of the most promising partial transcriptions of an input utterance. The output is a list of candidate complete transcriptions, ordered by likelihood under a stochastic model....
متن کامل