Visual Routines and Visual Search: A Real-Time Implementation and an Automata-Theoretic Analysis
نویسنده
چکیده
I describe a real-time implementation of Ull-man's visual routine processor (VRP) theory of intermediate vision for visual search. The system performs serial self-terminating visual search and computes 2D spatial relations of objects from live color video using low cost hardware. I present a formal model of a VRP with unbounded resources and quantify the amount of external control structure required to solve Horn clauses using the VRP. In discussing the effect of resource limitations I show that contemporary models of biological visual attention are unable to solve surprisingly simple queries. I also describe a novel logic programming system that finds satisfying variable assignments for Horn clause queries using the VRP. The system contains no internal database: all logic variables are directly grounded in the world using VRP queries. Finally, I briefly discuss experiments with natural language interpretation and motor control using the VRP. Experiments on real data are given. 1 1 Introduction Shimon Ullman proposed the visual routines theory of intermediate vision as a way of explaining how the human visual system might solve certain visual tasks (such as computing spatial relations) that seem to require serial processing [Ullman, 1984]. At a gross level, the theory proposes that the visual system contains a set of registers that can contain different types of visual data, a means of focusing visual attention on task-relevant portions of the image, and a set of primitive "instructions," such as coloring and line drawing, that can be combined like instructions in a computer program to compute useful Bryson kindly read drafts of this paper and provided useful comments. properties of an image. These resources are collectively referred to as the visual routine processor or VRP. The visual routines theory have received increasing attention from the AI community in recent years Much of this attention has come from the reactive reasoning and planning community, in part because Agre and Chapman's implementation of the visual routines model provides an alternative interface between reasoning and perception, which is both easier to implement and more biologically plausible than standard database-like interfaces. Despite the level of interest, there has yet to be a VRP implementation that runs on real camera images. To date, VRP systems have run either off of hand-drawn bitmaps [Romanycia, 1987] or have been directly interfaced to the world model of a world simulator, thus bypassing low-level vision entirely [Agre and Chapman, 1987][Chapman, 1990][R eece and Shafer, 1991]. …
منابع مشابه
A Novel Approach to Background Subtraction Using Visual Saliency Map
Generally human vision system searches for salient regions and movements in video scenes to lessen the search space and effort. Using visual saliency map for modelling gives important information for understanding in many applications. In this paper we present a simple method with low computation load using visual saliency map for background subtraction in video stream. The proposed technique i...
متن کاملSimulation of Position Based Visual Control and Performance Tests of 6R Robot
This paper presents simulation and experimental results of position-based visual servoing control process of a 6R robot using 2 fixed cameras. This method has the ability to deal with real time changes in the relative position of the target-object with respect to robot. Also, greater accuracy and independency of servo control structure from the target pose coordinates are the additional advanta...
متن کاملIdentification of the underlying factors affecting information seeking behavior of users interacting with the visual search option in EBSCO: a grounded theory study
Background and Aim: Information seeking is interactive behavior of searcher with information systems and this active interaction occurs in a real environment known as background or context. This study investigated the factors influencing the formation of layers of context and their impact on the interaction of the user with search option dialoge in EBSCO database. Method: Data from 28 semi-stru...
متن کاملVisual routines and visual search : a real - time implementation and anautomata -
I describe a real-time implementation of Ull-man's visual routine processor (VRP) theory of intermediate vision for visual search. The system performs serial self-terminating visual search and computes 2D spatial relations of objects from live color video using low cost hardware. I present a formal model of a VRP with unbounded resources and quantify the amount of external control structure req...
متن کاملIntermediate Vision: Architecture, Implementation, and Use
This article describes an implemented architecture for intermediate vision. By integrating a variety of intermediate visual mechanisms and putting them to use in support of concrete activity, the implementation demonstrates their utility. The sytem, SIVS, models psychophysical discoveries about visual attention and search. It is designed to be efficiently implementable in slow, massively parall...
متن کامل