Weakly supervised foreground learning for weakly supervised localization and detection

نویسندگان

چکیده

Modern deep learning models require large amounts of accurately annotated data, which is often difficult to satisfy. Hence, weakly supervised tasks, including object localization (WSOL) and detection (WSOD), have recently received attention in the computer vision community. In this paper, we motivate propose foreground (WSFL) task by showing that both WSOL WSOD can be greatly improved if groundtruth masks are available. More importantly, a complete WSFL pipeline with low computational cost, generates pseudo boxes, learns masks, does not need any annotations. With help predicted our model, achieve 74.37% correct accuracy on CUB for WSOL, 55.7% mean average precision VOC07 WSOD, thereby establish new state-of-the-art tasks. Our model also shows excellent transfer ability.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Collaborative Learning for Weakly Supervised Object Detection

Weakly supervised object detection has recently received much attention, since it only requires imagelevel labels instead of the bounding-box labels consumed in strongly supervised learning. Nevertheless, the save in labeling expense is usually at the cost of model accuracy. In this paper, we propose a simple but effective weakly supervised collaborative learning framework to resolve this probl...

متن کامل

Weakly Supervised Learning for Salient Object Detection

Recent advances of supervised salient object detection models demonstrate significant performance on benchmark datasets. Training such models, however, requires expensive pixel-wise annotations of salient objects. Moreover, many existing salient object detection models assume that at least a salient object exists in the input image. Such an impractical assumption leads to less appealing salienc...

متن کامل

Weakly Supervised Action Detection

Detection of human action in videos has many applications such as video surveillance and content based video retrieval. Actions can be considered as spatio-temporal objects corresponding to spatio-temporal volumes in a video. The problem of action detection can thus be solved similarly to object detection in 2D images [3] where typically an object classifier is trained using positive and negati...

متن کامل

Weakly Supervised Learning of Foreground-Background Segmentation Using Masked RBMs

We propose an extension of the Restricted Boltzmann Machine (RBM) that allows the joint shape and appearance of foreground objects in cluttered images to be modeled independently of the background. We present a learning scheme that learns this representation directly from cluttered images with only very weak supervision. The model generates plausible samples and performs foreground-background s...

متن کامل

Weakly-supervised Dictionary Learning

We present a probabilistic modeling and inference framework for discriminative analysis dictionary learning under a weak supervision setting. Dictionary learning approaches have been widely used for tasks such as low-level signal denoising and restoration as well as high-level classification tasks, which can be applied to audio and image analysis. Synthesis dictionary learning aims at jointly l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2023

ISSN: ['1873-5142', '0031-3203']

DOI: https://doi.org/10.1016/j.patcog.2022.109279