Visual scenes are categorized by function.

نویسندگان

  • Michelle R Greene
  • Christopher Baldassano
  • Andre Esteva
  • Diane M Beck
  • Li Fei-Fei
چکیده

How do we know that a kitchen is a kitchen by looking? Traditional models posit that scene categorization is achieved through recognizing necessary and sufficient features and objects, yet there is little consensus about what these may be. However, scene categories should reflect how we use visual information. Therefore, we test the hypothesis that scene categories reflect functions, or the possibilities for actions within a scene. Our approach is to compare human categorization patterns with predictions made by both functions and alternative models. We collected a large-scale scene category distance matrix (5 million trials) by asking observers to simply decide whether 2 images were from the same or different categories. Using the actions from the American Time Use Survey, we mapped actions onto each scene (1.4 million trials). We found a strong relationship between ranked category distance and functional distance (r = .50, or 66% of the maximum possible correlation). The function model outperformed alternative models of object-based distance (r = .33), visual features from a convolutional neural network (r = .39), lexical distance (r = .27), and models of visual features. Using hierarchical linear regression, we found that functions captured 85.5% of overall explained variance, with nearly half of the explained variance captured only by functions, implying that the predictive power of alternative models was because of their shared variance with the function-based model. These results challenge the dominant school of thought that visual features and objects are sufficient for scene categorization, suggesting instead that a scene's category may be determined by the scene's function.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online multiple people tracking-by-detection in crowded scenes

Multiple people detection and tracking is a challenging task in real-world crowded scenes. In this paper, we have presented an online multiple people tracking-by-detection approach with a single camera. We have detected objects with deformable part models and a visual background extractor. In the tracking phase we have used a combination of support vector machine (SVM) person-specific classifie...

متن کامل

Spatial and Chromatic Filters Derived from an Information-theoretic Analysis of Natural Scenes

Neurons in the early stages of visual processing should represent the statistical properties of natural scenes efficiently. In previous studies, independent component analysis (ICA) was applied to images based on monochromatic or trichromatic cone arrays (Bell & Sejnowski, 1997; Lee et al., 1999). Here, we compare the results of this approach to physiological data based on the experimental cond...

متن کامل

Modulation of early ERPs by accurate categorization of objects in scenes.

The categorization of objects within natural scenes is carried out in a sequence of stages, which may build on the detection of perceptual regularities in the visual appearance of objects or may represent a more semantic level of categorization. Here, we examined the neural correlates of correct categorization of objects in scenes, using natural scenes which were equalized in color and spectral...

متن کامل

Nonhomogeneous resolution of images of natural scenes.

The aim of this research is to model and simulate the loss of visual resolution as a function of retinal eccentricity in the perception of natural scenes. The model of visual resolution is based on a space-variant low-pass filter, having a variable convolution kernel according to retinal eccentricity. The parameters of the model are computed from psychophysical measures of visual acuity as a fu...

متن کامل

The neural basis of perceiving person interactions.

This study examined whether the grouping of people into meaningful social scenes (e.g., two people having a chat) impacts the basic perceptual analysis of each partaking individual. To explore this issue, we measured neural activity using functional magnetic resonance imaging (fMRI) while participants sex-categorized congruent as well as incongruent person dyads (i.e., two people interacting in...

متن کامل

Evaluation of Students' Vision Screening Test by School Nurse

ABSTRACT Health screening in student, such as Visual screening test usually carry out by school nurses in many rural schools in Iran. While students are in the growth age, visual acuity may be affected by growth. A Study was carried out to evaluate vision screening test on students in Hamadan, Iran. A sample of 878 pupils examined by a school nurse using E- Chart. According to the test, chil...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of experimental psychology. General

دوره 145 1  شماره 

صفحات  -

تاریخ انتشار 2016