Multimodal Object Detection via Probabilistic Ensembling
نویسندگان
چکیده
Object detection with multimodal inputs can improve many safety-critical systems such as autonomous vehicles (AVs). Motivated by AVs that operate in both day and night, we study object RGB thermal cameras, since the latter provides much stronger signatures under poor illumination. We explore strategies for fusing information from different modalities. Our key contribution is a probabilistic ensembling technique, ProbEn, simple non-learned method fuses together detections multi-modalities. derive ProbEn Bayes’ rule first principles assume conditional independence across Through marginalization, elegantly handles missing modalities when detectors do not fire on same object. Importantly, also notably improves even assumption does hold, e.g., outputs other fusion methods (both off-the-shelf trained in-house). validate two benchmarks containing aligned (KAIST) unaligned (FLIR) images, showing outperforms prior work more than 13% relative performance!
منابع مشابه
Producing Scores for Customers via Ensembling SVM
Supervised by Dr. Yilong Yin Email:[email protected] School of Computer Science and Technology, Shandong University Jinan 250100, China Abstract This report shows our solution to PAKDD Competition 2007. Following a brief description of the data mining task, we discuss four difficulties to be dealt with in this task. Then, we show how to do the data pre-processing. To weaken class-imbalance of th...
متن کاملMultimodal Sparse Features for Object Detection
In this paper the sparse coding principle is employed for the representation of multimodal image data, i.e. image intensity and range. We estimate an image basis for frontal face images taken with a Time-ofFlight (TOF) camera to obtain a sparse representation of facial features, such as the nose. These features are then evaluated in an object detection scenario where we estimate the position of...
متن کاملScaling for Multimodal 3D Object Detection
We investigate two methods for scalable 3D object detection. We base our approach on a recently proposed template matching algorithm [5] for detecting 3D objects. First, we demonstrate that it is possible to gain significant increase in runtime performance of the algorithm at almost no cost in accuracy by quickly rejecting most regions of the image with low-resolution templates. Second, we inve...
متن کاملProbabilistic visual learning for object detection
We present an unsupervised technique for visual learning which is based on density estimation in high-dimensional spaces using an eigenspace decomposition. Two types of density estimates are derived for modeling the training data: a multivariate Gaussian (for a unimodal distribution) and a multivariate Mixture-of-Gaussians model (for multimodal distributions). These probability densities are th...
متن کاملObject Detection Using Probabilistic Graph Model
We propose a Probabilistic Graph based Model for interactive image segmentation. A multilayer probabilistic Graph is constructed from an oversegmentation of the image to model the relationships among super pixel regions, edge segments and vertices. We used an iterative procedure to merge several regions based on the probability of the regions. Regions are merged until the user is satisfied with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-20077-9_9