Sample-wise dynamic precision quantization for neural network acceleration

نویسندگان

چکیده

Quantization is a well-known method for deep neural networks (DNNs) compression and acceleration. In this work, we propose the Sample-Wise Dynamic Precision (SWDP) quantization scheme, which can switch bit-width of weights activations in model according to task difficulty input samples at runtime. Using low-precision easy images brings advantages terms computational energy efficiency. We also an adaptive hardware design efficient implementation our SWDP networks. The experimental results on various datasets demonstrate that achieves average 3.3× speedup 3.0× saving over bit-level dynamically composable architecture BitFusion.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sample-Wise Aiding in GPS/INS Ultra-Tight Integration for High-Dynamic, High-Precision Tracking

By aiding GPS receiver tracking loops with INS estimates of signal dynamics, GPS/INS ultra-tight coupling can improve the navigation performance in challenging environments. Traditionally the INS data are injected into the loops once every loop update interval, which limits the levels of dynamics accommodated. This paper presents a sample-wise aiding method, which interpolates the aiding Dopple...

متن کامل

Point-wise Convolutional Neural Network

Deep learning with 3D data such as reconstructed point clouds and CAD models has received great research interests recently. However, the capability of using point clouds with convolutional neural network has been so far not fully explored. In this paper, we present a convolutional neural network for semantic segmentation and object recognition with 3D point clouds. At the core of our network i...

متن کامل

Adaptive Quantization for Deep Neural Network

In recent years Deep Neural Networks (DNNs) have been rapidly developed in various applications, together with increasingly complex architectures. The performance gain of these DNNs generally comes with high computational costs and large memory consumption, which may not be affordable for mobile platforms. Deep model quantization can be used for reducing the computation and memory costs of DNNs...

متن کامل

Practical Block-wise Neural Network Architecture Generation

Convolutional neural networks have gained a remarkable success in computer vision. However, most usable network architectures are hand-crafted and usually require expertise and elaborate design. In this paper, we provide a block-wise network generation pipeline called BlockQNN which automatically builds high-performance networks using the Q-Learning paradigm with epsilon-greedy exploration stra...

متن کامل

Convolutional Neural Network for Pixel-Wise Skyline Detection

Outdoor augmented reality applications are an emerging class of software systems that demand the fast identification of natural objects, such as plant species or mountain peaks, in low power mobile devices. Convolutional Neural Networks (CNN) have exhibited superior performance in a variety of computer vision tasks, but their training is a labor intensive task and their execution requires non n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEICE Electronics Express

سال: 2022

ISSN: ['1349-2543', '1349-9467']

DOI: https://doi.org/10.1587/elex.19.20220229