PixelLaser: Computing Range from Monocular Texture
نویسندگان
چکیده
The impressive advances in robotic spatial reasoning over the past decade have relied primarily on rich sensory data provided by laser range finders. Relative to cameras, however, lasers are heavy, bulky, power-hungry, and expensive. This work proposes and evaluates an image-segmentation pipeline that produces range scans from ordinary webcameras. Starting with a nearestneighbor classification of image patches, we investigate the tradeoffs in accuracy, resolution, calibration, and speed that come from estimating range-toobstacles using only single images. Experiments atop the low-cost iRobot Create platform demonstrate the accessibility and power of this pixel-based alternative to laser scans. 1 Motivation and Context Robots' spatial reasoning capabilities have matured a great deal over the past decade. Effective localization, mapping, and navigation algorithms have proven themselves in long-term autonomous driving [3,17], tour-guide [2,15], and office-navigation [8] applications. Robots’ most robust and widespread examples of spatial reasoning rely upon the ubiquitous laser range finder (LRF), which uses the time-of-flight of projected laser light to directly compute the range to nearby obstacles within the laser range finder’s field of view. Although effective, LRFs have drawbacks that have prevented them from entering the fast-growing field of low-cost commercially viable autonomous robots. As an alternative, monocular vision offers advantages relative to laser scans across several axes: cameras are less power-hungry, less heavy, less bulky, less range-limited, and, perhaps most importantly, less expensive. Less is more, however, when it comes to computation. Extracting range from pixel intensities requires far more algorithmic and computational effort than extracting range from time-of-flight. Range-from-vision approaches typically use temporal feature correspondence across a monocular image stream to deduce distance from pixels [6]. This body of work is mature, but it is worth noting that these techniques are most successful when significant spatial context is used to support feature matching. Large patches of pixels facilitate accurate and precise correspondence.
منابع مشابه
PixelLaser: Learning Range via Texture
The problem of finding a robot's range-to-obstacles is a fundamental one with an elegant solution: the laser range finder (LRF). This work has developed algorithms for replacing a laser with a camera for indoor applications. Our approach uses machine learning algorithms to segment the groundplane from single images flexibly, quickly, and robustly. We then transform those segmentations into lase...
متن کاملEcologically invalid monocular texture leads to longer perceptual latencies in random-dot stereograms.
Two experiments were conducted to explore Gillam and Borsting's (1988, Perception 17 603-608) report that uncorrelated monocular texture facilitates stereopsis by shortening the latency to see depth in random-dot stereograms. Experiment 1 used stereograms similar, in pattern but not disparity, to Gillam and Borsting's with monocular texture present or absent. A third condition, where monocular ...
متن کاملThree Dimensional Computer Vision for Micro Air Vehicles
Results are described of an ongoing project whose goal is to provide advanced Computer Vision for small low flying autonomous aircraft. The work consists of two parts; range-based vision for object recognition and pose estimation, and monocular vision for navigation and collision avoidance. A wide variety of range imaging methods were considered for the former, and it was found that a promising...
متن کاملContent and context of monocular regions determine perceived depth in random dot, unpaired background and phantom stereograms
Perceived depth was measured for three-types of stereograms with the colour/texture of half-occluded (monocular) regions either similar to or dissimilar to that of binocular regions or background. In a two-panel random dot stereogram the monocular region was filled with texture either similar or different to the far panel or left blank. In unpaired background stereograms the monocular region ei...
متن کاملDichoptically cancelled motion
We sought to determine whether or not motion-from-texture mechanisms have access to monocular input. Adopting a strategy used by Kolb and Braun (1995. Nature, 377, 336-338), we created drifting textures that were invisible to purely binocular processes. Monocular signals readily conveyed motions defined by local orientation and flicker. However, when left- and right-eye signals were displayed s...
متن کامل