Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading
نویسندگان
چکیده
The goal of this work is to recognize words, phrases, and sentences being spoken by a talking face without given the audio. Current deep learning approaches for lip reading focus on exploring appearance optical flow information videos. However, these methods do not fully exploit characteristics motion. In addition flow, mouth contour deformation usually conveys significant that complementary others. modeling dynamic has received little attention than flow. work, we propose novel model contours called Adaptive Semantic-Spatio-Temporal Graph Convolution Network (ASST-GCN), go beyond previous automatically both spatial temporal from To combine contour, two-stream visual front-end network proposed. Experimental results demonstrate proposed method significantly outperforms state-of-the-art several large-scale benchmarks.
منابع مشابه
Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting
The goal of traffic forecasting is to predict the future vital indicators (such as speed, volume and density) of the local traffic network in reasonable response time. Due to the dynamics and complexity of traffic network flow, typical simulation experiments and classic statistical methods cannot satisfy the requirements of mid-and-long term forecasting. In this work, we propose a novel deep le...
متن کاملSpatio - Temporal Adaptive Interlaced
In this paper, we propose two adaptive interlaced-to-progressive conversion techniques in which the adequacy of the estimated motion vector is evaluated. If the motion vector is unlikely to give a good temporal motion compensated interpolation result, spatial interpolation is favored or selected to avoid temporal artifacts. In the rst proposed interlaced-to-progressive conversion technique, cal...
متن کاملConvolutional Learning of Spatio-temporal Features
We address the problem of learning good features for understanding video data. We introduce a model that learns latent representations of image sequences from pairs of successive images. The convolutional architecture of our model allows it to scale to realistic image sizes whilst using a compact parametrization. In experiments on the NORB dataset, we show our model extracts latent “flow fields...
متن کاملTopology adaptive graph convolutional networks
Convolution acts as a local feature extractor in convolutional neural networks (CNNs). However, the convolution operation is not applicable when the input data is supported on an irregular graph such as with social networks, citation networks, or knowledge graphs. This paper proposes the topology adaptive graph convolutional network (TAGCN), a novel graph convolutional network that generalizes ...
متن کاملAdaptive Graph Convolutional Neural Networks
Graph Convolutional Neural Networks (Graph CNNs) are generalizations of classical CNNs to handle graph data such as molecular data, point could and social networks. Current filters in graph CNNs are built for fixed and shared graph structure. However, for most real data, the graph structures varies in both size and connectivity. The paper proposes a generalized and flexible graph CNN taking dat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2022
ISSN: ['1520-9210', '1941-0077']
DOI: https://doi.org/10.1109/tmm.2021.3102433