Learning body part‐based pose lexicons for semantic action recognition

نویسندگان

چکیده

Semantic action recognition aims to classify actions based on the associated semantics, which can be applied in video captioning and human-machine interaction. In this paper problem is addressed by jointly learning multiple pose lexicons body parts. Specifically, visual models are learnt, one model with part, characterises likelihood of an observed frame being generated from hidden poses. Moreover, lexicon simultaneously learnt along models. One part that establishes a probabilistic mapping between poses semantic parsed textual instructions. To capture temporal relations among parts, transition also measure probability alignment transitioned position another position. The part-based provides novel method cross-modality correlation, other spatial data. Action classification finally formulated as finding maximum posterior given sequences frames follow poses, subject most likely sequences. Experiments were conducted five datasets validate effectiveness proposed method.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Action recognition by learning pose representations

Pose detection is one of the fundamental steps for the recognition of human actions. In this paper we propose a novel trainable detector for recognizing human poses based on the analysis of the skeleton. The main idea is that a skeleton pose can be described by the spatial arrangements of its joints. Starting from this consideration, we propose a trainable pose detector, that can be configured ...

متن کامل

Learning Compact Lexicons for CCG Semantic Parsing

We present methods to control the lexicon size when learning a Combinatory Categorial Grammar semantic parser. Existing methods incrementally expand the lexicon by greedily adding entries, considering a single training datapoint at a time. We propose using corpus-level statistics for lexicon learning decisions. We introduce voting to globally consider adding entries to the lexicon, and pruning ...

متن کامل

Latent Pose Estimator for Continuous Action Recognition

Recently, models based on conditional random fields (CRF) have produced promising results on labeling sequential data in several scientific fields. However, in the vision task of continuous action recognition, the observations of visual features have dimensions as high as hundreds or even thousands. This might pose severe difficulties on parameter estimation and even degrade the performance. To...

متن کامل

Efficient Pose-Based Action Recognition

Action recognition from 3d pose data has gained increasing attention since the data is readily available for depth or RGB-D videos. The most successful approaches so far perform an expensive feature selection or mining approach for training. In this work, we introduce an algorithm that is very efficient for training and testing. The main idea is that rich structured data like 3d pose does not r...

متن کامل

Learning semantic features for action recognition via diffusion maps

Efficient modeling of actions is critical for recognizing human actions. Recently, bag of video words (BoVW) representation, in which features computed around spatiotemporal interest points are quantized into video words based on their appearance similarity, has been widely and successfully explored. The performance of this representation however, is highly sensitive to two main factors: the gr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Iet Computer Vision

سال: 2022

ISSN: ['1751-9632', '1751-9640']

DOI: https://doi.org/10.1049/cvi2.12143