Learning human activities and object affordances from RGB-D videos
نویسندگان
چکیده
منابع مشابه
Learning human activities and object affordances from RGB-D videos
Understanding human activities and object affordances are two very important skills, especially for personal robots which operate in human environments. In this work, we consider the problem of extracting a descriptive labeling of the sequence of sub-activities being performed by a human, and more importantly, of their interactions with the objects in the form of associated affordances. Given a...
متن کاملHuman Activity Learning using Object Affordances from RGB-D Videos
Human activities comprise several sub-activities performed in a sequence and involve interactions with various objects. This makes reasoning about the object affordances a central task for activity recognition. In this work, we consider the problem of jointly labeling the object affordances and human activities from RGBD videos. We frame the problem as a Markov Random Field where the nodes repr...
متن کاملLearning Spatio-Temporal Structure from RGB-D Videos for Human Activity Detection and Anticipation
We consider the problem of detecting past activities as well as anticipating which activity will happen in the future and how. We start by modeling the rich spatio-temporal relations between human poses and objects (called affordances) using a conditional random field (CRF). However, because of the ambiguity in the temporal segmentation of the sub-activities that constitute an activity, in the ...
متن کاملLearning Graph Matching for Object Detection from Rgb-d Images
We propose an optimization method for estimating parameters in graph-theoretical formulations of the matching problem for object detection. Unlike several methods which optimize parameters for graph matching in a way to promote correct correspondences and to restrict wrong ones, our approach aims at improving performance in the more general task of object detection. In our formulation, similari...
متن کاملHuman activity analysis and classification using RGB-D videos
This thesis encompasses parts of the field of computer vision. The main problem dealt with throughout this project is how to automate classification of specific human activities via video streams. The essence of this project is therefore that the developed algorithm shall be able to distinguish these activities from one another. A number of restrictions were imposed on the data-set in order to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The International Journal of Robotics Research
سال: 2013
ISSN: 0278-3649,1741-3176
DOI: 10.1177/0278364913478446