Mouth/voise synthesis for lipreading research
نویسندگان
چکیده
منابع مشابه
Classifying Visemes for Automatic Lipreading
Automatic lipreading is automatic speech recognition that uses only visual information. The relevant data in a video signal is isolated and features are extracted from it. From a sequence of feature vectors, where every vector represents one video image, a sequence of higher level semantic elements is formed. These semantic elements are “visemes” the visual equivalent of “phonemes” The develope...
متن کاملComparing visual features for lipreading
For automatic lipreading, there are many competing methods for feature extraction. Often, because of the complexity of the task these methods are tested on only quite restricted datasets, such as the letters of the alphabet or digits, and from only a few speakers. In this paper we compare some of the leading methods for lip feature extraction and compare them on the GRID dataset which uses a co...
متن کاملLearning Sequential Patterns for Lipreading
This paper presents a machine learning approach to Lip Reading and proposes a novel learning technique called sequential pattern boosting that allows us to efficiently search and combine temporal patterns to form strong spatio-temporal classifiers. Attempts at automatic lip reading need to address the demanding challenge that the problem is inherently temporal in nature. It is crucial to model ...
متن کاملReal-time lip-tracking for lipreading
This paper presents a new approach to lip tracking for lipreading. Instead of only tracking features on lips, we propose to track lips along with other facial features such as pupils and nostril. In the new approach, the face is rst located in an image using a stochastic skin-color model, the eyes, lip-corners and nostrils are then located and tracked inside the facial region. The new approach ...
متن کاملIncremental Difference as Feature for Lipreading
This paper represents a method of computing incremental difference features on the basis of scan line projection and scan converting lines for the lipreading problem on a set of isolated word utterances. These features are affine invariants and found to be effective in identification of similarity between utterances by the speaker in spatial domain. KeywordsIncremental Difference Feature, Eucli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of the Acoustical Society of America
سال: 1977
ISSN: 0001-4966
DOI: 10.1121/1.2015856