Multi‐scale pedestrian detection with global–local attention and multi‐scale receptive field context

نویسندگان

چکیده

As a basic component in the field of computer vision, pedestrian detection plays an essential role several real-world applications such as video surveillance. The promising performance has been achieved relying on deep learning, but large-scale variance and small-scale remain inherently hard before. In order to deal with aforementioned problems, this paper proposes multi-scale method global–local attention receptive context (MRFC). To make network focus pedestrians, we add high-resolution branch original detector. better integrate incongruous semantic feature, module is embedded highlight feature representation pedestrians so implement fusion effectively. adapt achieve scale-variance detection, MRFC applied. Based integrating above structures, proposed achieves competitive results Caltech CityPersons datasets. source code released https://github.com/xiaopan999/yolov5-pedestrian_detection.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

3D-Guided Multiscale Sliding Window for Pedestrian Detection

The most relevant modules of a pedestrian detector are the candidate generation and the candidate classification. The former aims at presenting image windows to the latter so that they are classified as containing a pedestrian or not. Much attention has being paid to the classification module, while candidate generation has mainly relied on (multiscale) sliding window pyramid. However, candidat...

متن کامل

Multiscale keypoint hierarchy for Focus-of-Attention and object detection

Hypercolumns in area V1 contain frequencyand orientation-selective simple and complex cells for line (bar) and edge coding, plus end-stopped cells for keypoint (vertex) detection. A single-scale (single-frequency) mathematical model of single and double end-stopped cells on the basis of Gabor filter responses was developed by Heitger et al. (1992 Vision Research 32 963-981). We developed an imp...

متن کامل

Electrostatic Field-Based Multiscale Corner Detection: A Physics-Motivated Approach

Corners represent special features of interest in images. They are very useful in many vision problems such as optical ow, structure from motion, and motion correspondence. A corner may be deened as the junction point between two or more straight line edges, or a point on the object's boundary curve having a curvature extremum. Our deenition of a corner is a point having an electrostatic eld ex...

متن کامل

Multiscale Discriminant Saliency for Visual Attention

The bottom-up saliency, an early stage of humans’ visual attention, can be considered as a binary classification problem between center and surround classes. Discriminant power of features for the classification is measured as mutual information between features and two classes distribution. The estimated discrepancy of two feature classes very much depends on considered scale levels; then, mul...

متن کامل

Multiscale Modeling of Pedestrian Dynamics: Individuality vs. Collectivity

The dynamics of human crowds are mainly ruled by mutual interactions among pedestrians. The latter develop indeed behavioral strategies based on their perception of the state of the surrounding environment, including especially the presence of neighboring individuals. For instance, when heading for a certain destination pedestrians normally deviate from their preferred paths in order to avoid c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Iet Computer Vision

سال: 2022

ISSN: ['1751-9632', '1751-9640']

DOI: https://doi.org/10.1049/cvi2.12125