Using keyword spotting to help humans correct captioning faster
نویسندگان
چکیده
Automatic real-time captioning provides immediate and on demand access to spoken content in lectures or talks, and is a crucial accommodation for deaf and hard of hearing (DHH) people. However, in the presence of specialized content, like in technical talks, automatic speech recognition (ASR) still makes mistakes which may render the output incomprehensible. In this paper, we introduce a new approach, which allows audience or crowd workers, to quickly correct errors that they spot in ASR output. Prior approaches required the crowd worker to manually “edit” the ASR hypothesis by selecting and replacing the text, which is not suitable for real-time scenarios. Our approach is faster and allows the worker to simply type corrections for misrecognized words as soon as he or she spots them. The system then finds the most likely position for the correction in the ASR output using keyword search (KWS) and stitches the word into the ASR output. Our work demonstrates the potential of computation to incorporate human input quickly enough to be usable in real-time scenarios, and may be a better method for providing this vital accommodation to DHH people.
منابع مشابه
Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملPrediction of keyword spotting accuracy based on simulation
This paper proposes a method of predicting accuracy of keyword spotting in terms of FA count and spotting score of correct detections. A new measure F for predicting the FA count is calculated by simulation of the keyword spotting for phoneme sequences that phoneme-based language model generates. Another measure C for predicting the spotting score of correct detections is obtained from a produc...
متن کاملTopic recognition for news speech based on keyword spotting
This paper describes topic identi cation for Japanese TV news speech based on the keyword spotting technique. Three thousands of nouns are selected as keywords which contribute to topic identi cation, based on criterion of mutual information and a length of the word. This set of the keywords identi ed the correct topic for 76.3% of articles from newspaper text data. Further, we performed keywor...
متن کاملRecognition and Rejection Performance in Wordspotting Systems Using Support Vector Machines
Support Vector Machines (SVM) is one such machine learning technique that learns the decision surface through a process of discrimination and has a good generalization capacity [6]. SVMs have been proven to be successful classifiers on several classical pattern recogntion problems [9, 11]. In this paper, one of the first applications of Support Vector Machines (SVM) technique for the problem of...
متن کاملPerformance Evaluation of Non-Keyword Modeling for Vocabulary-Independent Keyword Spotting
In this paper, we develop a keyword spotting system using vocabulary-independent speech recognition technique, and investigate several non-keyword modeling methods to improve its performance. In order to overcome the weakness of conventional syllable model, we propose the syllable filler based on syllable information of keywords and syllable-like filler model. The former prohibits syllable fill...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015