نتایج جستجو برای: spatial pyramid match kernel

تعداد نتایج: 466241  

2013
Jun Zhu Quan Zhou Weijia Zou Rui Zhang Wenjun Zhang

Human action recognition is an increasingly important research topic in the fields of video sensing, analysis and understanding. Caused by unconstrained sensing conditions, there exist large intra-class variations and inter-class ambiguities in realistic videos, which hinder the improvement of recognition performance for recent vision-based action recognition systems. In this paper, we propose ...

Journal: :Image Vision Comput. 2014
Carolina Redondo-Cabrera Roberto Javier López-Sastre Francisco Javier Acevedo-Rodríguez Saturnino Maldonado-Bascón

This paper proposes a novel approach to recognize object and scene categories in depth images. We introduce a Bag of Words (BoW) representation in 3D, the Selective 3D Spatial Pyramid Matching Kernel (3DSPMK). It starts quantizing 3D local descriptors, computed from point clouds, to build a vocabulary of 3D visual words. This codebook is used to build the 3DSPMK, which starts partitioning a wor...

2015
Xiao Li Zhanbo Li Haibo Pang

In the recent digital age, support vector machines (SVMs) that use a spatial pyramid matching (SPM) kernel have been around the globe for image classification. Although this is popular, there exists many problems in its use; for examples, nonlinear SVMs have a high complexity in training and testing. Applying the algorithms to big datasets, which holds many images, greater than a thousand is a ...

2014
Yi Zhang Jian Qin Fanglin Chen Dewen Hu

This working note describes the method of the NUDT team for scene classification and object recognition in the ImageCLEF 2014 Robot Vision Challenge. The method is composed of two steps: 1. spatial pyramid match (SPM) and a Pyramid of HOG (Histograms of Oriented Gradient) are incorporated to represent an indoor place image. 2. a multiclass SVM (Support Vector Machine) is utilized to classify an...

2007
Tom Yeh John Lee Trevor Darrell

Histogram pyramid representations computed from a vocabulary tree of visual words have proven valuable for a range of image indexing and recognition tasks; however, they have only used a single, fixed partition of feature space. We present a new efficient algorithm to incrementally compute set-of-trees (forest) vocabulary representations, and show they improve recognition and indexing performan...

Journal: :EURASIP J. Image and Video Processing 2017
Fatemeh Zamani Mansour Jamzad

Real-world image classification, which aims to determine the semantic class of un-labeled images, is a challenging task. In this paper, we focus on two challenges of image classification and propose a method to address both of them simultaneously. The first challenge is that representing images by heterogeneous features, such as color, shape and texture, helps to provide better classification a...

2007
Kristen Grauman

It is often useful to represent a single example by a set of the local features that comprise it. However, this representation poses a challenge to many conventional learning techniques, since sets may vary in cardinality and the elements are unordered. To compare sets of features, researchers often resort to solving for the least-cost correspondences, but this is computationally expensive and ...

2017
Chaojie Mao Yingming Li Zhongfei Zhang Yaqing Zhang Xi Li

In this work, we present a deep convolutional pyramid person matching network (PPMN) with specially designed Pyramid Matching Module to address the problem of person reidentification. The architecture takes a pair of RGB images as input, and outputs a similiarity value indicating whether the two input images represent the same person or not. Based on deep convolutional neural networks, our appr...

Journal: :CoRR 2016
Andrew Shin Masataka Yamaguchi Katsunori Ohnishi Tatsuya Harada

The workflow of extracting features from images using convolutional neural networks (CNN) and generating captions with recurrent neural networks (RNN) has become a de-facto standard for image captioning task. However, since CNN features are originally designed for classification task, it is mostly concerned with the main conspicuous element of the image, and often fails to correctly convey info...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید