Hydra Attention: Efficient Attention with Many Heads

نویسندگان

چکیده

While transformers have begun to dominate many tasks in vision, applying them large images is still computationally difficult. A reason for this that self-attention scales quadratically with the number of tokens, which turn, image size. On larger (e.g., 1080p), over 60% total computation network spent solely on creating and attention matrices. We take a step toward solving issue by introducing Hydra Attention, an extremely efficient operation Vision Transformers (ViTs). Paradoxically, efficiency comes from taking multi-head its extreme: using as heads there are features, Attention linear both tokens features no hidden constants, making it significantly faster than standard off-the-shelf ViT-B/16 factor token count. Moreover, retains high accuracy ImageNet and, some cases, actually improves it.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Splenic marginal zone lymphoma: hydra with many heads?

In this issue of the Journal, Baseggio et al. 1 report on a series of 24 patients with CD5-positive, t(11;14)-negative splenic marginal zone lymphoma (SMZL) diagnosed by means of cytology and flow cytometry of peripheral blood. All the patients were splenectomized at diagnosis or during follow-up and, consequently, spleen specimens were available for histological examination in all cases. The b...

متن کامل

Smashing the Stack with Hydra: The Many Heads of Advanced Polymorphic Shellcode

Recent work on the analysis of polymorphic shellcode engines suggests that modern obfuscation methods would soon eliminate the usefulness of signature-based network intrusion detection methods [36] and supports growing views that the new generation of shellcode cannot be accurately and efficiently represented by the string signatures which current IDS and AV scanners rely upon. In this paper, w...

متن کامل

Motor Performance in Relation with Sustained Attention in Children with Attention Deficit Hyperactivity Disorder

Objective: Present study compares relationship between motor performance, sustained attention and impulse control in children with Attention Deficit Hyperactivity Disorder and normal children. Materials & Methods: In this descriptive-analytic study, 21 boys with ADHD and 21 normal boys in the age range of 7- 10 years old were participated. Motor performance by using Bruininks Oseretsky Test ...

متن کامل

Effectiveness of Music Therapy on Sustained Attention and Selective Attention in Children with Attention Deficit/ Hyperactivity Disorder

The aim of this study was to evaluate the effectiveness of music therapy on sustained attention and selective attention in children with attention deficit hyperactivity disorder. The research method was quasi-experimental with a pre-test-post-test design with a control group. The statistical population included all children with attention deficit-hyperactivity disorder who referred to psycholog...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-25082-8_3