Dilated-Scale-Aware Category-Attention ConvNet for Multi-Class Object Counting

نویسندگان

چکیده

Object counting aims to estimate the number of objects in images. The leading approaches focus on single category task and achieve impressive performance. Note that there are multiple categories real scenes. Multi-class object expands scope application task. multi-target detection can multi-class some scenarios. However, it requires dataset annotated with bounding boxes. Compared point annotations mainstream issues, coordinate box-level more difficult obtain. In this paper, we propose a simple yet efficient network based point-level annotations. Specifically, first change traditional output channel from one multiclass counting. Since all use same feature extractor our proposed framework, their features will interfere mutually shared space. We further design multi-mask structure suppress harmful interaction among objects. Extensive experiments challenging benchmarks illustrate method achieves state-of-the-art

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-scale Location-aware Kernel Representation for Object Detection

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of object proposals for final classification and regression. Recent classification methods demonstrate that the integration of highorder statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model w...

متن کامل

Multi-scale Cortical Keypoint Representation for Attention and Object Detection

Keypoints (junctions) provide important information for focus-of-attention (FoA) and object categorization/recognition. In this paper we analyze the multi-scale keypoint representation, obtained by applying a linear and quasi-continuous scaling to an optimized model of cortical end-stopped cells, in order to study its importance and possibilities for developing a visual, cortical architecture. ...

متن کامل

Visual Attention Models of Object Counting

We develop a sequential learning model using a recurrent neural network architecture and reinforcement learning to recognize and count objects in images. Simple feedforward neural networks perform well on this task when trained using backpropagation; however, convolutional neural networks are computationally expensive and results are less certain when the image input has imperfect resolution ou...

متن کامل

Large-Scale Category Structure Aware Image Categorization

Most previous research on image categorization has focused on medium-scale data sets, while large-scale image categorization with millions of images from thousands of categories remains a challenge. With the emergence of structured large-scale dataset such as the ImageNet, rich information about the conceptual relationships between images, such as a tree hierarchy among various image categories...

متن کامل

A New Class of Multi-Category Learning for Multiple Video Object Extraction

Video object (VO) extraction is of great importance in multimedia processing. In recent years approaches have been proposed to deal with VO extraction as a classification problem. This type of methods calls for state-of-the-art classifiers because the performance is directly related to the accuracy of classification. Promising results have been reported for single object extraction using Suppor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Signal Processing Letters

سال: 2021

ISSN: ['1558-2361', '1070-9908']

DOI: https://doi.org/10.1109/lsp.2021.3096119