New efficient fillers for unlimited word recognition and keyword spotting
نویسندگان
چکیده
This paper describes our complete results for improved lexical llers as well as two new kinds of llers, gives their results in unlimited speech recognition as well as for keyword spotting and compares them to the acoustic-phonetic ller in the case of keyword spotting. Tests have been conducted on di erent vocabularies derived from ATIS and the Wall Street Journal database. Results for keyword spotting show the superiority of the independent lexical phonemic ller that combines accuracy (92% for a false alarm rate of 1.2 FA/h/kw) as well as task-independent training. As for new-word detection, the syllabic and the independent lexical llers perform quite well, and allow relevant detection of the phonetic transcription.
منابع مشابه
Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognition
Since the number of vocabulary words is often very large in both general-task keyword spotting and unlimited-vocabulary continuous-speech recognition, we choose to represent, unlike other teams, vocabulary words and out-vocabulary words with the same set of subword HMMs. Secondly we replace the classical one-phoneme transcription of llers in the lexicon by a new, more powerful one-syllable tran...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملA probabilistic method for keyword retrieval in handwritten document images
Keyword retrieval in handwritten document images (word spotting) is very challenging given that OCR accuracy is not yet adequate for handwritten scripts, specially with large lexicons. Various proposed approaches build indices on information such as image features or OCR scores and have improved the performance of the traditional approach that builds index on OCR’ed text. In this paper, we impr...
متن کاملKeyword spotting in unconstrained handwritten Chinese documents using contextual word model
a r t i c l e i n f o Keywords: Keyword spotting Chinese handwritten documents Word similarity Contextual word model This paper proposes a method for keyword spotting in off-line Chinese handwritten documents using a contextual word model, which measures the similarity between the query word and every candidate word in the document by combining a character classifier and the geometric context a...
متن کاملError spotting using syllabic fillers in spontaneous conversational speech recognition
Spontaneous conversational phone-call speech databases are difficult to recognize because of the large variation of speech rates, of pronunciations as well as noises, of acoustic degradations from the telephone channel, and of an unpredictible non-grammatical language structure including many random phenomena. Each cause of misrecognition can be addressed separately; however there is still no s...
متن کامل