Determi ling Linear Shape Change : Toward Automatic Ger Object Recognition Programs

نویسندگان

  • Katsushi Ikeuchi
  • Ki Sang Hong
چکیده

A 3D object localization task may be divided into two steps. First, one object appearance will be classified into one of the topo-logically equivalent classes of the 3D object appearances, referred to as aspects of the object (aspect classification). Then, the precise attitude and position of the object will be determined within one aspect (linear shape change determination (LSCD)). We have been working on designing a compiler which automatically generates an object localization program from a given object and sensor model; the compiler adheres to the above two-step strategy of aspect classification and linear change determination. For the first step, the compiler generates a decision tree structure program. Each branch node of the tree represents a neceSSary decision, such as comparing the area size of a visible face, to classify one object appearance into a smaller number of aspect groups. Along this tree, an object appearance is to be classified into one particular aspect at a leaf node of the tree. This paper will investigate the design of the compiler component to generate the second step. The compiler extends each leaf node of the tree and connects several nodes so that it performs the LSCD. The compiler chooses the largest 3D face as the primal face among several visible faces at the aspect corresponding to a leaf node. The compiler has priority rules which will select one particular method out of several possible ones; this rule defines a face coordinate system on a primal face. By using these rules, it analyzes the primal face, defines the face coordinate system on it, and registers the defining method to a node connected to the leaf node. The compiler also embodies the transformation from the face to the body coordinate system at the node. In order to increase the accuracy of the attitude and p i-tion, the compiler further puts two more nodes at each branch of the program. The first node establishes correspondences between model edges and image edges. The second node iteratively solves the transformation equation to determine the object's attitude and position using these correspondences. We have prepared a program library, which is a collection of prototypical objects to perform tasks mentioned before. In compile mode, the compiler retrieves necessary strategies defined at each node, finds proper prototypical objects from the program library, instantiates and inserts these objects to the node. In run mode, these instance objects run and determine …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Learning to Recognize Objects - Toward Automatic Calibration of Color Vision for Sony Robots

Color detection can be seriously affected by lighting conditions and other variations in the environment. The robot vision systems need to be recalibrated as lighting conditions change, otherwise they fail to recognize objects or classify them incorrectly. This paper describes experiments toward object recognition under different lightning conditions. We propose to train the vision system to re...

متن کامل

Recognition and Reconstruction of Partially Occluded Objects

A new automatic system for the recognition and reconstruction of rescaled and/or rotated partially occluded objects is presented. The objects to be recognized are described by 2D views and each view is occluded by several half-planes. The whole object views and their visible parts (linear cuts) are then stored in a database. To establish if a region R of an input image represents an object poss...

متن کامل

3D Models Recognition in Fourier Domain Using Compression of the Spherical Mesh up to the Models Surface

Representing 3D models in diverse fields have automatically paved the way of storing, indexing, classifying, and retrieving 3D objects. Classification and retrieval of 3D models demand that the 3D models represent in a way to capture the local and global shape specifications of the object. This requires establishing a 3D descriptor or signature that summarizes the pivotal shape properties of th...

متن کامل

Appearance-Based Vision and the Automatic Generation of Object Recognition Programs

The generation of recognition programs by hand is a time-consuming, laborintensive task that typically results in a special purpose program for the recogmtion of a single object or a small set of objects. Recent work in automatic code generation has demonstrated the feasibility of automatically generating object recognition programs from CAD-based descxiptions of objects. Many of the programs w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991