Recognizing Open-Vocabulary Relations between Objects in Images

نویسندگان

  • Masayasu Muraoka
  • Sumit Maharjan
  • Masaki Saito
  • Kota Yamaguchi
  • Naoaki Okazaki
  • Takayuki Okatani
  • Kentaro Inui
چکیده

How can we describe the relations between objects in a picture? As recent deep neural networks have exhibited impressive performance in identifying individual entities in a picture, in this study we turn our attention to recognize inter-object relations. To recognize open-domain relations, (a) we propose collecting relational concepts automatically from an image-text corpus. In addition, using collected relational instances, (b) we train a classifier to recognize inter-object relations. A relation recognition experiment conducted in our study suggests that relative information calculated from objects improves relation recognition effectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing 3D Objects by Using Models Learned Automatically from 2D Training Images

A scheme for learning and recognizing 3D objects from their 2D views is presented. The scheme proceeds in two stages. In the rst stage, we try to learn a model automatically from 2D training images of di erent objects which belong to the same object class and consequently have similar shape of parts and similar adjacency relations between the parts. In the second stage, the generated model is u...

متن کامل

Object name learning and object perception: a deficit in late talkers.

Two experiments examined the relation between early object name learning and the ability to represent objects by their abstract shapes. In Experiment 1, two-year-old children with productive vocabularies in the bottom 20th percentile--'late talkers'--were compared with (1) same-age children with larger vocabularies, and (2) younger children matched for productive vocabulary, on their ability to...

متن کامل

مدل‌سازی روابط توپولوژیک سه بعدی فازی در محیط GIS

Nowadays, geospatial information systems (GIS) are widely used to solve different spatial problems based on various types of fundamental data: spatial, temporal, attribute and topological relations. Topological relations are the most important part of GIS which distinguish it from the other kinds of information technologies. One of the important mechanisms for representing topological relations...

متن کامل

The Identification of Index Terms in Natural Language Object Descriptions

"The flowering part, it looks like someone is sticking their tongue out" (a subject's description of Arethusa bulbosa, see Figure 1). The mechanisms that people use in natural settings to describe objects to one another can be used to inform the design of image retrieval and museum systems. The image retrieval problem may be recast as an object description problem where the images are of object...

متن کامل

Query-Adaptive R-CNN for Open-Vocabulary Object Detection and Retrieval

We address the problem of open-vocabulary object retrieval and localization, which is to retrieve and localize objects from a very large-scale image database immediately by a textual query (e.g., a word or phrase). We first propose Query-Adaptive R-CNN, a simple yet strong framework for open-vocabulary object detection. Query-Adaptive RCNN is a simple extension of Faster R-CNN from closedvocabu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016