Connectionist Pyramid Powered Perceptual Organization: Visual Grouping with Hierarchical Structures of Neural Networks
نویسنده
چکیده
This paper describes a new approach for organizing image data. Perceptual organization in biological vision is the process of pre-attentively grouping and structuring perceptual information into shapes and forms. In computer vision, processes like these are required in order to transform pixels into structural forms that can be used in higher-level interpretation. Pyramidal structures have been used for perceptual organization in computer vision. The tapering of the structure provides a basis for the \bringing together" of spatially separate image elements. Artiicial neu-ral networks are a means of a system learning by example rather than being pre-coded with the necessary rules. (A problem with performing perceptual organization with computer systems is that biological perceptual organization is not that well understood.) This study focussed on the grouping of line segments into longer lines. A multi-layer feed-forward neural network is successfully trained with backpropagation to perform grouping of line segments within a 2 2 neighbourhood at the bottom of a pyramid. Training at higher levels presents diiculties. The successful training at the base of the pyramid with these small neighbourhoods is encouraging. Better grouping behaviour can be anticipated based on the extra information in larger neighbourhoods.
منابع مشابه
A Compositionally Architecture for Perceptual Feature Grouping
We propose a compositionality architecture for perceptual organization which establishes a novel, generic, algorithmic framework for feature binding and condensation of semantic information in images. The underlying algorithmic ideas require a hierarchical structure for various types of objects and their groupings, which are guided by gestalt laws from psychology. A rich set of predefined featu...
متن کاملTopology-preserving perceptual segmentation using the Combinatorial Pyramid
Scene understanding and other high-level visual tasks usually rely on segmenting the captured images for dealing with a more efficient mid-level representation. Although this segmentation stage will consider topological constraints for the set of obtained regions (e.g., their internal connectivity), it is typical that the importance of preserving the topological relationships among regions will...
متن کاملVisual cortical mechanisms of perceptual grouping: interacting layers, networks, columns, and maps
The visual cortex has a laminar organization whose circuits form functional columns in cortical maps. How this laminar architecture supports visual percepts is not well understood. A neural model proposes how the laminar circuits of V1 and V2 generate perceptual groupings that maintain sensitivity to the contrasts and spatial organization of scenic cues. The model can decisively choose which gr...
متن کاملHierarchical Models of the Visual System
Hierarchical models of the visual system are neural networks with a layered topology: In these networks, the receptive fields of units (i.e., the region of the visual space that units respond to) at one level of the hierarchy are constructed by combining inputs from units at a lower level. After a few processing stages, small receptive fields tuned to simple stimuli get combined to form larger ...
متن کاملReinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks
The processing of a visual stimulus can be subdivided into a number of stages. Upon stimulus presentation there is an early phase of feedforward processing where the visual information is propagated from lower to higher visual areas for the extraction of basic and complex stimulus features. This is followed by a later phase where horizontal connections within areas and feedback connections from...
متن کامل