نتایج جستجو برای: lip reading

تعداد نتایج: 130722  

2010
Takeshi Saitoh Ryosuke Konishi

This paper discusses the influence of the word lip reading by change of the frame rate. The proposed method applies active appearance model to extract the face and several lip regions. Then, our method calculates trajectory feature and applies DP matching. We set the target words as the Japanese 25 words, and took 250 utterance scenes per a speaker from ten Japanese men. Though many of research...

Journal: :CoRR 2018
M. Faisal Sanaullah Manzoor

Human lip-reading is a challenging task. It requires not only knowledge of underlying language but also visual clues to predict spoken words. Experts need certain level of experience and understanding of visual expressions learning to decode spoken words. Now-a-days, with the help of deep learning it is possible to translate lip sequences into meaningful words. The speech recognition in the noi...

2016
Daehyun Lee Jongmin Lee Kee-Eung Kim

It is well known that automatic lip-reading (ALR), also known as visual speech recognition (VSR), enhances the performance of speech recognition in a noisy environment and also has applications itself. However, ALR is a challenging task due to various lip shapes and ambiguity of visemes (the basic unit of visual speech information). In this paper, we tackle ALR as a classification task using en...

Journal: :CoRR 2017
Hassan Akbari Himani Arora Liangliang Cao Nima Mesgarani

In this study, we propose a deep neural network for reconstructing intelligible speech from silent lip movement videos. We use auditory spectrogram as spectral representation of speech and its corresponding sound generation method resulting in a more natural sounding reconstructed speech. Our proposed network consists of an autoencoder to extract bottleneck features from the auditory spectrogra...

2008
Stephen J. Cox Richard Harvey Yuxuan Lan Jacob L. Newman Barry-John Theobald

In speech recognition, the problem of speaker variability has been well studied. Common approaches to dealing with it include normalising for a speaker’s vocal tract length and learning a linear transform that moves the speaker-independent models closer to to a new speaker. In pure lip-reading (no audio) the problem has been less well studied. Results are often presented that are based on speak...

2015
Helen L. Bear Richard Harvey Yuxuan Lan

In machine lip-reading there is continued debate and research around the correct classes to be used for recognition. In this paper we use a structured approach for devising speaker-dependent viseme classes, which enables the creation of a set of phoneme-to-viseme maps where each has a different quantity of visemes ranging from two to 45. Viseme classes are based upon the mapping of articulated ...

Journal: :CoRR 2010
Ahmad Basheer Hassanat

.................................................................................. i ACKNOWLEDGMENT.................................................................... iv ABBREVIATIONS.......................................................................... v CONTENTS................................................................................... viii LIST OF FIGURES...........................

Journal: :Signal & Image Processing : An International Journal 2018

Journal: :IEEE Access 2021

A survey on automated lip-reading approaches is presented in this paper with the main focus being deep learning related methodologies which have proven to be more fruitful for both feature extraction and classification. This also provides comparisons of all different components that make up systems including audio-visual databases, extraction, classification networks schemas. The contributions ...

Journal: :Pattern Recognition 2012
Yiu-ming Cheung Xin Liu Xinge You

Lip tracking has played a significant role in a lip reading system. In this paper, we present a local region based approach to lip tracking, which consists of two phases: (i) lip contour extraction for the first lip frame, and followed by (ii) lip tracking in the subsequent lip frames. Initially, we construct a localized color active color model provided that the foreground and background regio...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید