Point Cloud Computing for Rigid and Deformable 3D Object Recognition
نویسنده
چکیده
Machine vision is a technologically and economically important field of computer vision. It eases automatization of inspection and manipulation tasks, which in turn enables cost savings and quality improvement in industrial processes. Usually, 2D or intensity images are used for such applications. However, thanks to several technological advances, nowadays there are sensors available that allow depth or 3D measurements with high resolution, frequency and accuracy at a reasonable cost. Such 3D data enables new applications that are difficult or impossible to implement with 2D images only. This work develops several performant, robust and accurate algorithms for processing such 3D data. The algorithms were developed with the requirements of industrial image processing in mind. They are, however, applicable to other areas such as robotics and reverse engineering as well. Two fundamental challenges are solved in this work: The fast localization of 3D points that neighbor a given query point and the detection of rigid and deformable objects in 3D point clouds or in multimodal data. Additionally, a fast and robust method for refining the position of two point clouds is presented and some fundamental algorithms regarding rotations are discussed. For the detection of nearest neighbors in 3D point clouds, a voxel-based method is introduced that allows almost constant lookup times of O(log(log(N))). Additionally, in contrast to prior art, lookup times are almost independent of the distribution of query and data points, allowing the use of this method in real-time systems. A variant of the method that allows for approximate nearest neighbor lookups further improves the required time for creating the underlying data structure. For the detection of rigid and deformable objects in 3D and multimodal data, a local variant of the Hough transform is introduced that circumvents the usual problems of voting schemes over high-dimensional parameter spaces. The very robust, fast, and generic baseline method detects rigid objects in 3D point clouds. It works for arbitrary free-form objects and different 3D sensors alike and can cope with large amounts of clutter, noise, occlusion, sparse data and multiple object instances. Three variants of the baseline method are introduced that detect rigid objects in multimodal data, geometric primitives in 3D point clouds, and deformable objects in 3D point clouds. For this, the parameter space, the used feature, the model representation and the number of voting rounds of the original method are modified according to the type of data and model. All methods are evaluated using up-to-date datasets.
منابع مشابه
A novel Local feature descriptor using the Mercator projection for 3D object recognition
Point cloud processing is a rapidly growing research area of computer vision. Introducing of cheap range sensors has made a great interest in the point cloud processing and 3D object recognition. 3D object recognition methods can be divided into two categories: global and local feature-based methods. Global features describe the entire model shape whereas local features encode the neighborhood ...
متن کاملUnsupervised Skeleton Extraction and Motion Capture from Kinect Video via 3D Deformable Matching
This paper presents a novel method to extract skeletons of complex articulated objects from 3D point cloud sequences collected by the Kinect. Our approach is more robust than the traditional video-based and stereobased approaches, as the Kinect directly provides 3D information without any markers, 2D-to-3D-transition assumptions, and feature point extraction. We track all the raw 3D points on t...
متن کاملDisplacement-based grasping of deformable objects
Robotic grasping of deformable objects is inherently different from that of rigid objects, and is an under-researched area. Difficulties arise not only from expensive deformable modeling, but also from the changing object geometry under grasping force. This dissertation studies strategies of grasping deformable objects using two robotic fingers. Discovering the inapplicability of the traditiona...
متن کاملTracking elastic deformable objects with an RGB-D sensor for a pizza chef robot
This paper presents a method for tracking a 3D textureless object which undergoes elastic deformations, using the point cloud data provided by an RGB-D sensor and in real-time. This solution is expected to be useful for enhanced manipulation of humanoid robotic systems, especially in the case of pizza dough to be ideally manipulated by a pizza chef robot. Our tracking framework relies on a prio...
متن کامل106. The 3D object recognition with environmental adaptability based on VFH descriptor and region growing segmentation
3D object recognition is a basic research in the machine vision field. Microsoft KINECT V2 is utilized to collect external environmental information. The point cloud file is obtained after processing the collected information. In order to filter the point cloud and obtain point cloud model of a single object in the environment after region growing segmentation, the point cloud is applied to poi...
متن کامل