Describing Human Aesthetic Perception by Deeply-learned Attributes from Flickr

نویسنده

  • L. Zhang
چکیده

Many aesthetic models in computer vision suffer from two shortcomings: 1) the low descriptiveness and interpretability of those hand-crafted aesthetic criteria (i.e., nonindicative of region-level aesthetics), and 2) the difficulty of engineering aesthetic features adaptively and automatically toward different image sets. To remedy these problems, we develop a deep architecture to learn aesthetically-relevant visual attributes from Flickr1, which are localized by multiple textual attributes in a weakly-supervised setting. More specifically, using a bag-ofwords (BoW) representation of the frequent Flickr image tags, a sparsity-constrained subspace algorithm discovers a compact set of textual attributes (e.g., landscape and sunset) for each image. Then, a weakly-supervised learning algorithm projects the textual attributes at image-level to the highly-responsive image patches at pixel-level. These patches indicate where humans look at appealing regions with respect to each textual attribute, which are employed to learn the visual attributes. Psychological and anatomical studies have shown that humans perceive visual concepts hierarchically. Hence, we normalize these patches and feed them into a five-layer convolutional neural network (CNN) to mimick the hierarchy of human perceiving the visual attributes. We apply the learned deep features on image retargeting, aesthetics ranking, and retrieval. Both subjective and objective experimental results thoroughly demonstrate the competitiveness of our approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Explanation of Harry Broudy’s View with Respect to Aesthetic Education and its Link to Education via Pedagogical Theater

Pedagogical theater with a continuous process and with an emphasis on the simple learning of various concepts and lessons assists to grow and thus enhance individual and group behavior in society. The process of performing the exercises encourages the talent and creativity of the participants in learning and ensures their active participation. This study aims to establish a link between the ide...

متن کامل

User-Generated Collection Level Metadata in an Online Photo- sharing System

Photoset and Group descriptions in Flickr, a large-scale online photo-sharing system, offer insight into the collection description and collection building practices of Flickr users. Photosets, assembled by individual users, appear to evolve from bottom-up, derived from the components of the individual users’ context to evolve from the bottom up, and are based on selected attributes which a par...

متن کامل

Analysis of The Relationship Between Theoretical Aesthetic Ideas And Modern- Postmodern Architectural Styles; (A Comparative Study Of Modern And Postmodern Architecture)ِِِ

Physical attributes have always been a qualitative indicator for evaluating an architectural work. These character influenced by function, technology and changing the process of creation and perception of beauty in modern times; and influenced by content, culture, history, meaning and symbolic linguistic structures in the postmodern era. In accordance with the evolution of aesthetic theories si...

متن کامل

Learning Photography Aesthetics with Deep CNNs

Automatic photo aesthetic assessment is a challenging arti cial intelligence task. Existing computational approaches have focused on modeling a single aesthetic score or class (good or bad photo), however these do not provide any details on why the photograph is good or bad; or which attributes contribute to the quality of the photograph. To obtain both accuracy and human-interpretability, we a...

متن کامل

Learning beautiful (and ugly) attributes

Current approaches to aesthetic image analysis either provide accurate or interpretable results. To get both accuracy and interpretability, we advocate the use of learned visual attributes as mid-level features. For this purpose, we propose to discover and learn the visual appearance of attributes automatically, using the recently introduced AVA database which contains more than 250,000 images ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1605.07699  شماره 

صفحات  -

تاریخ انتشار 2016