A Study of the Effectiveness of Articulatory Strokes for Phonemic Recognition
نویسندگان
چکیده
This paper explores a framework to incorporate articulatory movement information into a classical ASR scheme based on the concept of articulatory stroke. Articulatory stroke is a geometrical segmental unit which corresponds to a target approaching-releasing articulatory gesture. It has been shown that critical and non-critical (i.e., secondary or dummy) articulatory gestures can be classified with about 88% accuracy using the stroke parameters. Phonetic recognition accuracy is also investigated by augmenting the conventional MFCC features with the articulatory stroke features (obtained using the MOCHA corpus). It is found that the phonetic recognition accuracy increases 15% with respect to the best result using the ordinary MFCC parameters only. This provides supporting evidence for the usefulness of the articulatory stroke representation of articulatory movements not only for speech production description but also for automatic speech recognition.
منابع مشابه
Recognition of Sequence of Print and Ink Strokes: Investigation the Effect of Handwriting Pressure, Hue of Ink, Printer and Paper Type
By introducing of digital techniques, forensic document examiners has been encouraged to work with better accuracy in non-destructive ways. The aim of this study was to present a non-destructive, accessible, economic (affordable), user friendly, portable, useful and easy technique for specifying the order of crossing lines of ink stroke and printed text. The intersections of LaserJet and In...
متن کاملبازشناسی برخط حروف مجزای دستنویس فارسی بر اساس تشخیص گروه بدنه اصلی با استفاده از ماشین بردار پشتیبان
In this paper a new method for the online recognition of handwritten Persian characters has been proposed which uses a set of simple features and Support Vector Machine (SVM) as a classifier. The task of preprocessing allows us to equalize feature vectors from different characters. This algorithm is implemented in two steps. In the first step, input character is classified into one of eighteen ...
متن کاملStructural Representation and Matching of Articulatory Speech Structures based on the Evolving Transformation System (ETS) Formalism
A formal structural representation of speech consistent with the principles of combinatorial structure theory is presented in this paper. The representation is developed within the Evolving Transformation System (ETS) formalism and encapsulates speech processes at the articulatory level. We show how the class structure of several consonantal phonemes of English can be expressed with the help of...
متن کاملThe Effect of Transcribing on Beginning Learners’ Phonemic Perception
A large number of studies dealing with phonology have focused their attention on phonological production at the expense of phonological perception which provides the foundation stone for phonological production. This study focuses on phonological perception at phonemic level. The purpose of the study is helping beginning learners improve their perception of the English phonemes which are confus...
متن کاملHierarchical partition of the articulatory state space for overlapping-feature based speech recognition
We describe our recent work on improving an overlapping articulatory feature (sub-phonemic) based speech recognizer with robustness to the requirement of training data. A new decision-tree algorithm is developed and applied to the recognizer design which results in hierarchical partitioning of the articulatory state space. The articulatory states associated with common acoustic correlates, a ph...
متن کامل