Sample-wise dynamic precision quantization for neural network acceleration
نویسندگان
چکیده
Quantization is a well-known method for deep neural networks (DNNs) compression and acceleration. In this work, we propose the Sample-Wise Dynamic Precision (SWDP) quantization scheme, which can switch bit-width of weights activations in model according to task difficulty input samples at runtime. Using low-precision easy images brings advantages terms computational energy efficiency. We also an adaptive hardware design efficient implementation our SWDP networks. The experimental results on various datasets demonstrate that achieves average 3.3× speedup 3.0× saving over bit-level dynamically composable architecture BitFusion.
منابع مشابه
Sample-Wise Aiding in GPS/INS Ultra-Tight Integration for High-Dynamic, High-Precision Tracking
By aiding GPS receiver tracking loops with INS estimates of signal dynamics, GPS/INS ultra-tight coupling can improve the navigation performance in challenging environments. Traditionally the INS data are injected into the loops once every loop update interval, which limits the levels of dynamics accommodated. This paper presents a sample-wise aiding method, which interpolates the aiding Dopple...
متن کاملPoint-wise Convolutional Neural Network
Deep learning with 3D data such as reconstructed point clouds and CAD models has received great research interests recently. However, the capability of using point clouds with convolutional neural network has been so far not fully explored. In this paper, we present a convolutional neural network for semantic segmentation and object recognition with 3D point clouds. At the core of our network i...
متن کاملAdaptive Quantization for Deep Neural Network
In recent years Deep Neural Networks (DNNs) have been rapidly developed in various applications, together with increasingly complex architectures. The performance gain of these DNNs generally comes with high computational costs and large memory consumption, which may not be affordable for mobile platforms. Deep model quantization can be used for reducing the computation and memory costs of DNNs...
متن کاملPractical Block-wise Neural Network Architecture Generation
Convolutional neural networks have gained a remarkable success in computer vision. However, most usable network architectures are hand-crafted and usually require expertise and elaborate design. In this paper, we provide a block-wise network generation pipeline called BlockQNN which automatically builds high-performance networks using the Q-Learning paradigm with epsilon-greedy exploration stra...
متن کاملConvolutional Neural Network for Pixel-Wise Skyline Detection
Outdoor augmented reality applications are an emerging class of software systems that demand the fast identification of natural objects, such as plant species or mountain peaks, in low power mobile devices. Convolutional Neural Networks (CNN) have exhibited superior performance in a variety of computer vision tasks, but their training is a labor intensive task and their execution requires non n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Electronics Express
سال: 2022
ISSN: ['1349-2543', '1349-9467']
DOI: https://doi.org/10.1587/elex.19.20220229