3-D Geometry Enhanced Superpixels for RGB-D Data
نویسندگان
چکیده
Abstract. This paper introduces a novel 3-D geometry enhanced superpixels for RGB-D data. First, we reconstruct the 3-D geometry of the scene by projecting the depth map into 3-D coordinates. Then, a distance metric for superpixel clustering is constructed using 3-D geometry and color information. Finally, pixels are iteratively clustered into superpixels using the proposed distance metric. The proposed method is able to distinguish objects in similar colors due to the introduced 3-D geometry. The oversegmentation results on RGB-D pairs in the Middlebury datasets demonstrate that our approach shows better performance than other three state-of-the-art superpixel methods. The proposed superpixels are also evaluated in the application of segmentation, and we achieve the best segmentation results compared with three state-of-theart segmentation methods.
منابع مشابه
Joint 3D Object and Layout Inference from a Single RGB-D Image
Inferring 3D objects and the layout of indoor scenes from a single RGB-D image captured with a Kinect camera is a challenging task. Towards this goal, we propose a high-order graphical model and jointly reason about the layout, objects and superpixels in the image. In contrast to existing holistic approaches, our model leverages detailed 3D geometry using inverse graphics and explicitly enforce...
متن کاملUnsupervised Segmentation of RGB-D Images
While unsupervised segmentation of RGB images has never led to results comparable to supervised segmentation methods, a surprising message of this paper is that unsupervised image segmentation of RGB-D images yields comparable results to supervised segmentation. We propose an unsupervised segmentation algorithm that is carefully crafted to balance the contribution of color and depth features in...
متن کاملHand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کامل21/2 D Scene Reconstruction of Indoor Scenes from Single RGB-D Images
Using the Manhattan world assumption we propose a new method for global 21/2D geometry estimation of indoor environments from single low quality RGB-D images. This method exploits both color and depth information at the same time and allows to obtain a full representation of an indoor scene from only a single shot of the Kinect sensor. The main novelty of our proposal is that it allows estimati...
متن کاملمدلسازی صفحهای محیطهای داخلی با استفاده از تصاویر RGB-D
In robotic applications and especially 3D map generation of indoor environments, analyzing RGB-D images have become a key problem. The mapping problem is one of the most important problems in creating autonomous mobile robots. Autonomous mobile robots are used in mine excavation, rescue missions in collapsed buildings and even planets’ exploration. Furthermore, indoor mapping is beneficial in f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013