Re-Introducing BN Into Transformers for Vision Tasks

نویسندگان

چکیده

In recent years, Transformer-based models have exhibited significant advancements over previous in natural language processing and vision tasks. This powerful methodology has also been extended to the 3D point cloud domain, where it can mitigate inherent difficulties posed by irregular disorderly nature of clouds. However, attention mechanism within Transformer presents challenges for utilizing Batch Normalization (BN), as statistical information cannot be extracted efficiently from data set. Thus, this study proposes a novel residual structure, ResBN, which effectively handle data. Additionally, replace BN transformer 2D image processing, we introduce Patch (PN) technique. ResBN PN are evaluated on datasets respectively through experiments, demonstrating their efficacy enhancing classification performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Introducing OpenMP Tasks into the HYDRO Benchmark

The HYDRO mini-application has been successfully used as a research vehicle in previous PRACE projects [6]. In this paper, we evaluate the benefits of the tasking model introduced in recent OpenMP standards [9]. We have developed a new version of HYDRO using the concept of OpenMP tasks and this implementation is compared to already existing and optimized OpenMP versions of HYDRO.

متن کامل

Consistency Checking for Vision Tasks

Video surveillance systems usually have to operate on thousands or ten-thousands of frames. Interactive frame-byframe assessment of results, therefore, is time consuming and expensive. We present a simple and fast approach which allows automated cross-checking of image segmentations obtained by different algorithms or two versions of the same algorithm. Image regions show up in a number of vide...

متن کامل

Metric Predicate Transformers : Towards aNotion of Re nement for

A compositional weakest precondition semantics is given for a parallel language with recursion using a new metric resumption domain. By extending the classical duality of predicate vs. state transformers, the weakest precondition semantics for the parallel language is shown to be isomorphic to the standard metric state transformer semantics. Moreover , a notion of reenement for predicate transf...

متن کامل

(Re)introducing Regular Graph Languages

Distributions over strings and trees can be represented by probabilistic regular languages, which characterise many models in natural language processing. Recently, several datasets have become available which represent natural language phenomena as graphs, so it is natural to ask whether there is an equivalent of probabilistic regular languages for graphs. This paper presents regular graph lan...

متن کامل

DeViouS: A Distributed Environment for Vision Tasks

We present a system for the integration of computer vision tasks in a distributed environment. This system, called DeViouS, is based on the client/server model and runs in a heterogeneous environment of Unix workstations. It takes advantage of the free cycles in modern workstation environments to distribute and speed up the execution of vision tasks. Two primary goals of DeViouS are to provide ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3283612