Errors, Repetition, and Contrastive Emphasis in Speech Recognition
نویسندگان
چکیده
In repeating an utterance for the benefit of a listener, talkers can use contrastive stress to mark those parts of an utterance that were misrecognized. In this study, we found that talkers use a consistent set of markers to indicate contrastive stress: Listeners unaware of the nature of the misrecognition can readily identify a contrastively stressed word. Based on an analysis of these markers, we have implemented an automatic algorithm that identifies contrastive stress with about the same accuracy as humans, using amplitude, duration, silence, and pitch cues. This ability to detect contrastive stress can be effectively exploited, as part of error recovery strategies, by a recognition system.
منابع مشابه
An Experimental Study on Dynamic Features of Speech Structure
One of the biggest difficulties in automatic speech recognition (ASR) is how to deal with variations of speech signals caused by non-linguistic information, such as age, gender, etc. Various methods have been proposed to compensate for the variations and one of them is speech structure [1]. Speech structure, which extracts only contrastive features and discards absolute features, is proved to b...
متن کاملCompensating hyperarticulation for automatic speech recognition
This thesis details the effects of hyperarticulation in the context of automatic speech recognition used for human-to-machine interaction. Hyperarticulation can be characterised as a speaking mode exhibiting an exaggerated articulation and occurs as a natural reaction in an effort to resolve recognition errors. Despite the user’s attempt to disambiguate word confusions, hyperarticulation causes...
متن کاملIdentification of Contrast and Its Emphatic Realization in HMM based Speech Synthesis
The work presented in this paper proposes to identify contrast in the form of contrastive word pairs and prosodically signal it with emphatic accents in a Text-to-Speech (TTS) application using a Hidden-Markov-Model (HMM) based speech synthesis system. We first describe a novel method to automatically detect contrastive word pairs using textual features only and report its performance on a corp...
متن کاملTowards a Contrastive Pragmatic Analysis of Congratulation Speech Act in Persian and English
This paper aims at studying the speech act of congratulation in Persian and English with regard to semantic formulas. To gather the semantic formulas related to congratulation, the researchers chose 100 movies (50 in Persian and 50 in English) as the instrument of the study. The only model of cross-cultural comparison was related to that of Elwood (2004). Therefore, we used Elwood’s model as th...
متن کاملA Comparative Study of Feature-Domain Error Concealment Techniques for Distributed Speech Recognition
This paper presents a comparative study of different error concealment (EC) techniques in the context of distributed speech recognition (DSR) that exploits repetition, interpolation or subvector concealment to counteract transmission errors. A number of experiments are conducted and the results demonstrate that repetition is as good as, or even better than, linear interpolation whereas the subv...
متن کامل