Towards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory
نویسندگان
چکیده
Recently, automated surveillance cameras can change a visible sensor and thermal for all-day operation. However, existing single-modal pedestrian detectors mainly focus on detecting pedestrians in only one specific modality (i.e., or thermal), so they cannot cope with other modal inputs. In addition, recent multispectral have shown remarkable performance by adopting modalities, but also limitations practical applications (e.g., different Field-of-View (FoV) frame rate). this paper, we introduce versatile detector that shows robust detection any single modality. We propose multisensory-matching contrastive loss to reduce the difference between visual representation of modalities. Moreover, modality, design Multispectral Recalling (MSR) Memory. The MSR Memory enhances features recalling To guide store contexts, loss. It enables encode more discriminative input believe our method is step forward be applied variety real-world applications. comprehensive experimental results verify effectiveness proposed method.
منابع مشابه
Multispectral Deep Neural Networks for Pedestrian Detection
Multispectral pedestrian detection is essential for around-the-clock applications, e.g., surveillance and autonomous driving. We deeply analyze Faster R-CNN for multispectral pedestrian detection task and then model it into a convolutional network (ConvNet) fusion problem. Further, we discover that ConvNet-based pedestrian detectors trained by color or thermal images separately provide compleme...
متن کاملMultispectral Image Dense Matching
We build a dataset including four kinds of multispectral images with labeled key point correspondence. The dataset includes 7 RGB/NIR image pairs, 4 RGB/Depth image pairs, 3 Flash/no-flash image pairs and 4 different exposure image pairs. In each pair, we uniformly select corner points and label their correspondences. The resolution of the RGB/Depth image pairs is 640 × 480 and all the other im...
متن کاملMultispectral imaging using a single bucket detector.
Existing multispectral imagers mostly use available array sensors to separately measure 2D data slices in a 3D spatial-spectral data cube. Thus they suffer from low photon efficiency, limited spectrum range and high cost. To address these issues, we propose to conduct multispectral imaging using a single bucket detector, to take full advantage of its high sensitivity, wide spectrum range, low c...
متن کاملThe Fastest Pedestrian Detector in the West
We demonstrate a multiscale pedestrian detector operating in near real time (∼6 fps on 640x480 images) with state-of-the-art detection performance. The computational bottleneck of many modern detectors is the construction of an image pyramid, typically sampled at 8-16 scales per octave, and associated feature computations at each scale. We propose a technique to avoid constructing such a finely...
متن کاملMultisensory context portends object memory
Multisensory processes facilitate perception of currently-presented stimuli and can likewise enhance later object recognition. Memories for objects originally encountered in a multisensory context can be more robust than those for objects encountered in an exclusively visual or auditory context [1], upturning the assumption that memory performance is best when encoding and recognition contexts ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i1.20001