Towards a Fusion of Region-based and Saliency-based Models

نویسندگان

Trong-Tôn Pham

Nicolas Eric Maillot

چکیده

This thesis addresses the problems of automatic image annotation (AIA) for the purpose of image indexing & retrieval in an Annotation Based Image Retrieval (ABIR) system. Specifically, we study different models of image representation in the AIA area. Up to our knowledge, nobody has tried to combine the following approaches for image representation: region-based approach and saliency-based approach. We think this combination will give a model which captures at the same time the global information and the details of objects. The proposed approach is composed of three main stages. In the first part, the image processing stage consists of building an image representation and extracting the visual features from the image entities. Image presentation is driven by the segmentation process and the keypoint detection algorithm. Each region or keypoint is associated with a set of low-level features (color, histogram, texture, spatial location and invariant local features). The second stage consists of leaning the relationship between semantics and visual features in image. Machine learning algorithm has been used to cluster the visual features into visterms which is an intermediate representation between high-level semantics and low-level features. The learning phase results in a co-occurrence matrix of words and visterms. The fusion of different models is expressed by the fusion of its corresponding word-by-term matrix. The high dimension of the resulting matrix makes the matching process more expensive. This can be reduced by using dimensionality reduction methods (i.e. Latent Semantic Analysis). The last stage consists of the automatic annotation propagation scheme for a new image. The latter will be quantized in term of the visterm frequency. The propagated words list is then ranked based on the cosine similarity between visterms extracted in the image and visterms associated with words. Experiments are conducted on Corel image datasets containing 5000 images and show good annotation performance and demonstrate the improvement of the fusion models compared to the Translation model. keywords: Automatic Image Annotation, Machine Learning, Image Processing.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reduced-Reference Image Quality Assessment based on saliency region extraction

In this paper, a novel saliency theory based RR-IQA metric is introduced. As the human visual system is sensitive to the salient region, evaluating the image quality based on the salient region could increase the accuracy of the algorithm. In order to extract the salient regions, we use blob decomposition (BD) tool as a texture component descriptor. A new method for blob decomposition is propos...

متن کامل

Compressed-Sampling-Based Image Saliency Detection in the Wavelet Domain

When watching natural scenes, an overwhelming amount of information is delivered to the Human Visual System (HVS). The optic nerve is estimated to receive around 108 bits of information a second. This large amount of information can’t be processed right away through our neural system. Visual attention mechanism enables HVS to spend neural resources efficiently, only on the selected parts of the...

متن کامل

Just Noticeable Difference Estimation Using Visual Saliency in Images

Due to some physiological and physical limitations in the brain and the eye, the human visual system (HVS) is unable to perceive some changes in the visual signal whose range is lower than a certain threshold so-called just-noticeable distortion (JND) threshold. Visual attention (VA) provides a mechanism for selection of particular aspects of a visual scene so as to reduce the computational loa...

متن کامل

A Saliency Detection Model via Fusing Extracted Low-level and High-level Features from an Image

Saliency regions attract more human’s attention than other regions in an image. Low- level and high-level features are utilized in saliency region detection. Low-level features contain primitive information such as color or texture while high-level features usually consider visual systems. Recently, some salient region detection methods have been proposed based on only low-level features or hig...

متن کامل

Graph-based Visual Saliency Model using Background Color

Visual saliency is a cognitive psychology concept that makes some stimuli of a scene stand out relative to their neighbors and attract our attention. Computing visual saliency is a topic of recent interest. Here, we propose a graph-based method for saliency detection, which contains three stages: pre-processing, initial saliency detection and final saliency detection. The initial saliency map i...

متن کامل

Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs

This paper proposes a novel saliency detection method by combining region-level saliency estimation and pixel-level saliency prediction with CNNs (denoted as CRPSD). For pixel-level saliency prediction, a fully convolutional neural network (called pixel-level CNN) is constructed by modifying the VGGNet architecture to perform multiscale feature learning, based on which an image-to-image predict...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Towards a Fusion of Region-based and Saliency-based Models

نویسندگان

چکیده

منابع مشابه

Reduced-Reference Image Quality Assessment based on saliency region extraction

Compressed-Sampling-Based Image Saliency Detection in the Wavelet Domain

Just Noticeable Difference Estimation Using Visual Saliency in Images

A Saliency Detection Model via Fusing Extracted Low-level and High-level Features from an Image

Graph-based Visual Saliency Model using Background Color

Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs

عنوان ژورنال:

اشتراک گذاری