Functional Object Class Detection Based on Learned Affordance Cues
نویسندگان
چکیده
Current approaches to visual object class detection mainly focus on the recognition of basic level categories, such as cars, motorbikes, mugs and bottles. Although these approaches have demonstrated impressive performance in terms of recognition, their restriction to these categories seems inadequate in the context of embodied, cognitive agents. Here, distinguishing objects according to functional aspects based on object affordances is important in order to enable manipulation of and interaction between physical objects and cognitive agent. In this paper, we propose a system for the detection of functional object classes, based on a representation of visually distinct hints on object affordances (affordance cues). It spans the complete range from tutordriven acquisition of affordance cues, learning of corresponding object models, and detecting novel instances of functional object classes in real images.
منابع مشابه
Affordance Prediction via Learned Object Attributes
We present a novel method for learning and predicting the affordances of an object based on its physical and visual attributes. Affordance prediction is a key task in autonomous robot learning, as it allows a robot to reason about the actions it can perform in order to accomplish its goals. Previous approaches to affordance prediction have either learned direct mappings from visual features to ...
متن کاملAspect Transition Graph: an Affordance-Based Model
In this work we introduce the Aspect Transition Graph (ATG), an affordance-based model that is grounded in the robot’s own actions and perceptions. An ATG summarizes how observations of an object or the environment changes in the course of interaction. Through the Robonaut 2 simulator, we demonstrate that by exploiting these learned models the robot can recognize objects and manipulate them to ...
متن کاملA Novel Face Detection Method Based on Over-complete Incoherent Dictionary Learning
In this paper, face detection problem is considered using the concepts of compressive sensing technique. This technique includes dictionary learning procedure and sparse coding method to represent the structural content of input images. In the proposed method, dictionaries are learned in such a way that the trained models have the least degree of coherence to each other. The novelty of the prop...
متن کاملChange Detection Gamasiab River Margins in Kermanshah by Comparison Pixel Base and Object Orientd Algorithms
Introduction Land use reflects the interactive characteristics of humans and the environment and describes how human exploitation works for one or more targets on the ground. Land use is usually defined on the basis of human use of the land, with an emphasis on the functional role of land in economic activities. Land use, which is associated with human activity, is undergoing change over time....
متن کاملAffordance based Part Recognition for Grasping and Manipulation
Affordances (unit utility, functional and topological relationships) and semantic scene understanding are key to building a generic, scalable and cognitive architecture for visual perception. ‘Affordance based object recognition’ or recognition based on affordance features is an important step in this regard. In this paper, we extend the scope of affordance features to define ‘Conceptual Equiva...
متن کامل