Learning discriminative spatial representation for image classification
نویسندگان
چکیده
Spatial Pyramid Representation (SPR) [7] introduces spatial layout information to the orderless bag-of-features (BoF) representation. SPR has become the standard and has been shown to perform competitively against more complex methods for incorporating spatial layout. In SPR the image is divided into regular grids. However, the grids are taken as uniform spatial partitions without any theoretical motivation. In this paper, we address this issue and propose to learn the spatial partitioning with the BoF representation. We define a space of grids where each grid is obtained by a series of recursive axis aligned splits of cells. We cast the classification problem in a maximum margin formulation with the optimization being over the weight vector and the spatial grid. In addition to experiments on two challenging public datasets (Scene-15 and Pascal VOC 2007) showing that the learnt grids consistently perform better than the SPR while being much smaller in vector length, we also introduce a new dataset of human attributes and show that the current method is well suited to the recognition of spatially localized human attributes.
منابع مشابه
Deep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملImage Classification via Sparse Representation and Subspace Alignment
Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...
متن کاملJoint sparse model-based discriminative K-SVD for hyperspectral image classification
Sparse representation classification (SRC) is being widely investigated on hyperspectral images (HSI). For SRC methods to achieve high classification performance, not only is the development of sparse representation models essential, the designing and learning of quality dictionaries also plays an important role. That is, a redundant dictionary with well-designated atoms is required in order to...
متن کاملیادگیری واژه نامه برای آشکارسازی محل پلاک خودرو
Car license plate detection has been always a challenging task in the context of traffic control and traffic offenses. In this paper, the problem of license plate detection from gray scale images taken in natural conditions is addressed. Our car plate database consists of images with severe imaging conditions such as low quality images, far distanced cameras, and severe weather conditions. The ...
متن کاملDiscriminative and Compact Dictionary Design for Hyperspectral Image Classification using Learning VQ Framework Sparse representation provides an efficient description for high-dimensional Hyperspectral Imagery
Discriminative and Compact Dictionary Design for Hyperspectral Image Classification using Learning VQ Framework Report Title Sparse representation provides an efficient description for high-dimensional Hyperspectral Imagery (HSI) and also encodes discriminative information useful for classification. However, due to the large size of typical HSI images, the naive way to construct a dictionary wi...
متن کاملCompact, Adaptive and Discriminative Spatial Pyramid for Improved Scene and Object Classification
The release of challenging datasets with a vast number of images, requires the development of efficient image representations and algorithms which are able to manipulate these largescale datasets efficiently. Nowadays the Bag-of-Words (BoW) based image representation is the most successful approach in the context of object and scene classification tasks. However, its main drawback is the absenc...
متن کامل