Oracle MCG: A first peek into COCO Detection Challenges
نویسندگان
چکیده
Microsoft COCO [2] is a new annotated database in computer vision consisting of more than 200.000 images. There are currently more than one million annotated objects from 80 categories, with fully segmented masks. With respect to Pascal [1], the previous available dataset with semantic segmentation annotations, COCO has four times the number of categories and two orders of magnitude more images and annotated objects. In this context, the challenges for object detection in COCO have recently been presented. In a nutshell, competing methods should provide a list of detections on each image in the form of bounding boxes or segmentation masks. Each detection should have associated one of the 80 categories and a confidence score. These new challenges on a new dataset are unchartered territory for researchers used to work on Pascal for some years. This work’s main aim is to set a reference point in the challenges to get a first grasp of which results to expect. To do so, we provide Oracle MCG, a hypothetical detector consisting of an oracle picking the best object proposal of a state-of-the-art technique on all annotated objects. This result could be understood as the upper-bound result one could get by restricting themselves to selecting object proposals without any further refinement. In particular, we use the publicly-available pre-computed results given by MCG [3] on the validation set of COCO (also available for the test set). On average, this method produces 5075 proposals per image. We then overlap all proposals on each image against all annotated objects and pick the best one for each of them. Each of these best proposals is associated to the annotated category and assigned a constant score of 1. The results obtained, as given by the COCO official evaluation software, are shown in Table 1 for bounding boxes and in Table 2 for segmentation masks. The highlighted row is the one that will define the ranking of the competition. Below some observations about the results.
منابع مشابه
انتخاب ناحیههای کاندید در سیستمهای تشخیص و شناسایی اشیاء
According to the studies carried out in recent years, determination of the regional proposal is one of the crucial steps in detection and recognition of the objects included in an image. In fact, determination of this region has been like a bottleneck, gaining a significant computational energy. As a result, selection of suitable and fast approaches, under these circumstance, may enhance the pe...
متن کاملOracle Performance for Visual Captioning
The task of associating images and videos with a natural language description has attracted a great amount of attention recently. The state-of-the-art results on some of the standard datasets have been pushed into the regime where it has become more and more difficult to make significant improvements. Instead of proposing new models, this work investigates performances that an oracle can obtain...
متن کاملMaximum Coverage Problem with Group Budget Constraints and Applications
We study a variant of the maximum coverage problem which we label the maximum coverage problem with group budget constraints (MCG). We are given a collection of sets S = {S1, S2, . . . , Sm} where each set Si is a subset of a given ground set X. In the maximum coverage problem the goal is to pick k sets from S to maximize the cardinality of their union. In the MCG problem S is partitioned into ...
متن کاملA MultiPath Network for Object Detection
The recent COCO object detection dataset presents several new challenges for object detection. In particular, it contains objects at a broad range of scales, less prototypical images, and requires more precise localization. To address these challenges, we test three modifications to the standard Fast R-CNN object detector: (1) skip connections that give the detector access to features at multip...
متن کاملCOCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
This paper describes the COCO-Text dataset. In recent years large-scale datasets like SUN and Imagenet drove the advancement of scene understanding and object recognition. The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images. The dataset is based on the MS COCO dataset, which contains images of complex everyday scenes. The images were not coll...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1509.03660 شماره
صفحات -
تاریخ انتشار 2015