Articulated Object Detection
نویسندگان
چکیده
We develop an object detection algorithm for difficult-to-detect objects with consumerlevel range cameras. The increasing popularity of such devices made the process of obtaining depth data extremely easy. The availability of depth data motivates the need for algorithms that will be able to process it in meaningful ways, so that it then can be used in applications like robot navigation or object retrieval. However the quality of the point clouds obtained using these cameras often leaves much to be desired, which makes the processing stage very challenging. This project aims at developing a method that is able to process the data in a way that will conceptually alleviate the data issues. In particular we consider the problem of detecting articulated objects like laptops in the office-type, indoor environments. This report has two main contributions. The first is a graph-based scene representation that approximates the captured environment using simple, well described geometric primitives (nodes) and describes the geometric relationships between them (edges). A series of experiments to determine set of stable features to detect these relationships has been performed. A graph created in this way serves as a global scene descriptor,and the object detection can be posed as a sub-graph matching problem. Ways of determining node-to-node correspondence between scene and object of interest graph representations were investigated. Simple method to find this node to node correspondence has been developed, exploiting the fact that most of the scenes can be described by small and simple graphs. The problem has been also posed to fit into existing, advanced framework for graph matching. The second contribution is the way of to reason about the missing data. This project focused on the laptop detection, that can suffer from dramatic data loss, due to the fact that laptop screen tend to reflect structured light pattern. The method described is able to cope with such missing data in a robust way that is also able to reason about the data loss due to other issues, i.e. partial occlusion. The system for laptop detection in cluttered office environment has been developed as a proof of concept for the presented approach. It is argued that the graph based approach is a reliable way of performing object detection, that however suffers greatly from the poor data quality, by performing the tests using synthetically generated data as well. The performance of the detection system is presented both for synthetically generated data and the captured real-world data set. The limitations and possibilities for improvements are discussed.
منابع مشابه
A Structural Filter Approach to Human Detection
Occlusions and articulated poses make human detection much more difficult than common more rigid object detection like face or car. In this paper, a Structural Filter (SF) approach to human detection is presented in order to deal with occlusions and articulated poses. A three-level hierarchical object structure consisting of words, sentences and paragraphs in analog to text grammar is proposed ...
متن کاملA Study on Human Gaze Detection Based on 3D Eye Model
Robust fake iris detection p. 10 A study on fast Iris restoration based on focus checking p. 19 A spatio-temporal metric for dynamic mesh comparison p. 29 Facetoface : an isometric model for facial animation p. 38 Matching two-dimensional articulated shapes using generalized multidimensional scaling p. 48 Further developments in geometrical algorithms for ear biometrics p. 58 Composition of com...
متن کاملArticulated Video Object Extraction by the Combination of Spatial and Temporal Segmentation
Generally moving object extraction schemes rely on either optical flow or frame difference. Though optical flow based methods can deal with moving camera situation with ease, the inconsistency at object boundaries causes extraction with inaccurate object boundaries. While frame difference approach, such as three frame difference, yields accurate object boundaries, it usually cannot deal with mo...
متن کاملRecognizing Articulated Objects Using Invariance
Articulated objects can have many degrees of freedom. Recognizing such an object in a single image can involve a search in a high-dimensional space that involves all these degrees of freedom, in addition to the usual unknown viewpoint. In this paper we use invariance to reduce this search space to a manageable size. Our method avoids feature detection for improved robustness. We apply the metho...
متن کاملFisher Discriminant Analysis (FDA), a supervised feature reduction method in seismic object detection
Automatic processes on seismic data using pattern recognition is one of the interesting fields in geophysical data interpretation. One part is the seismic object detection using different supervised classification methods that finally has an output as a probability cube. Object detection process starts with generating a pickset of two classes labeled as object and non-object and then selecting ...
متن کاملContours Extraction Using Line Detection and Zernike Moment
Most of the contour detection methods suffers from some drawbacks such as noise, occlusion of objects, shifting, scaling and rotation of objects in image which they suppress the recognition accuracy. To solve the problem, this paper utilizes Zernike Moment (ZM) and Pseudo Zernike Moment (PZM) to extract object contour features in all situations such as rotation, scaling and shifting of object i...
متن کامل