Coarse2Fine: a two-stage training method for fine-grained visual classification
نویسندگان
چکیده
Small inter-class and large intra-class variations are the key challenges in fine-grained visual classification. Objects from different classes share visually similar structures, objects same class can have poses viewpoints. Therefore, proper extraction of discriminative local features (e.g., bird’s beak or car’s headlight) is crucial. Most recent successes on this problem based upon attention models which localize attend parts. In work, we propose a training method for networks, Coarse2Fine, creates differentiable path attended feature maps to input space. Coarse2Fine learns an inverse mapping function informative regions raw image, will guide better features. Besides, initialization weights. Our experiments show that reduces classification error by up 5.1% common datasets.
منابع مشابه
Fine-Grained Visual Classification of Aircraft
This paper introduces FGVC-Aircraft, a new dataset containing 10,000 images of aircraft spanning 100 aircraft models, organised in a three-level hierarchy. At the finer level, differences between models are often subtle but always visually measurable, making visual recognition challenging but possible. A benchmark is obtained by defining corresponding classification tasks and evaluation protoco...
متن کاملA Two-Layer Local Constrained Sparse Coding Method for Fine-Grained Visual Categorization
Fine-grained categories are more difficulty distinguished than generic categories due to the similarity of inter-class and the diversity of intra-class. Therefore, the fine-grained visual categorization (FGVC) is considered as one of challenge problems in computer vision recently. A new feature learning framework, which is based on a two-layer local constrained sparse coding architecture, is pr...
متن کاملImproving Fine-Grained Visual Classification using Pairwise Confusion
Fine-Grained Visual Classification (FGVC) datasets contain small sample sizes, along with significant intra-class variation and interclass similarity. While prior work has addressed intra-class variation using localization and segmentation techniques, inter-class similarity may also affect feature learning and reduce classification performance. In this work, we address this problem using a nove...
متن کاملIs Fine Grained Classification Different?
We performed experiments on two fine-grained classification tasks using a state-of-the-art pipeline (descriptor + dictionary + LLC encoding + max pooling + linear SVM). We found that this standard pipeline out-performed a dictionary-free classification technique (stacked evidence trees) that was specifically designed for fine-grained classification. The success of the method depends on two fact...
متن کاملBilinear CNNs for Fine-grained Visual Recognition
We present a simple and effective architecture for fine-grained visual recognition called Bilinear Convolutional Neural Networks (B-CNNs). These networks represent an image as a pooled outer product of features derived from two CNNs and capture localized feature interactions in a translationally invariant manner. B-CNNs belong to the class of orderless texture representations but unlike prior w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Machine Vision and Applications
سال: 2021
ISSN: ['1432-1769', '0932-8092']
DOI: https://doi.org/10.1007/s00138-021-01180-y