Log-Euclidean bag of words for human action recognition
نویسندگان
چکیده
Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this paper, we tackle the problem of categorising human actions by devising Bag of Words (BoW) models based on covariance matrices of spatio-temporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of Symmetric Positive Definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, we propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison to several state-of-the-art methods.
منابع مشابه
Identification of Human Actions in Video Database
The Human activity detection is a more highly sought research as they are used for the Surveillance, Healthcare system, content-based search, and interactive game applications. The Human activity recognition is carried out by three main stages namely object segmentation, feature extraction, their representation and human action detection using different algorithms. This paper is an attempt to i...
متن کاملLearning Mid-level Words on Riemannian Manifold for Action Recognition
Human action recognition remains a challenging task due to the various sources of video data and large intraclass variations. It thus becomes one of the key issues in recent research to explore effective and robust representation to handle such challenges. In this paper, we propose a novel representation approach by constructing mid-level words in videos and encoding them on Riemannian manifold...
متن کاملDiscriminant Bag of Words based representation for human action recognition
In this paper we propose a novel framework for human action recognition based on Bag of Words (BoWs) action representation, that unifies discriminative codebook generation and discriminant subspace learning. The proposed framework is able to, naturally, incorporate several (linear or non-linear) discrimination criteria for discriminant BoWs-based action representation. An iterative optimization...
متن کاملHuman Action Recognition Based on Boosting
Human action recognition is an active research field in computer vision and image processing. In this paper we propose a novel method for the task of recognition of human actions in video image sequences. First of all, a video sequence is represented as a collection of spatial-temporal words by extracting space-time interest points, which is used to characterize action. Then visual words are us...
متن کاملEfficient Codebook for Human Activity Recognition in Surveillance Video
M. E. Student, Dept of Computer Science, Annamalai University, India. Associate Professor, Dept of Computer Science, Annamalai University, India. Research Scholar, Dept of Computer Science, Annamalai University, India. [email protected], [email protected], [email protected] ABSTRACT—Automatic human activity recognition methods are useful for many applications such as Video Surveillanc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IET Computer Vision
دوره 9 شماره
صفحات -
تاریخ انتشار 2015