Quantized neural network design under weight capacity constraint

نویسندگان

  • Sungho Shin
  • Kyuyeon Hwang
  • Wonyong Sung
چکیده

The complexity of deep neural network algorithms for hardware implementation can be lowered either by scaling the number of units or reducing the word-length of weights. Both approaches, however, can accompany the performance degradation although many types of research are conducted to relieve this problem. Thus, it is an important question which one, between the network size scaling and the weight quantization, is more effective for hardware optimization. For this study, the performances of fully-connected deep neural networks (FCDNNs) and convolutional neural networks (CNNs) are evaluated while changing the network complexity and the word-length of weights. Based on these experiments, we present the effective compression ratio (ECR) to guide the trade-off between the network size and the precision of weights when the hardware resource is limited.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

APPLICATION OF NEURAL NETWORK IN EVALUATION OF SEISMIC CAPACITY FOR STEEL STRUCTURES UNDER CRITICAL SUCCESSIVE EARTHQUAKES

Depending on the tectonic activities, most buildings subject to multiple earthquakes, while a single design earthquake is suggested in most seismic design codes. Perhaps, the lack of easy assessment to second shock information and sometimes use of inappropriate methods in estimating these features cause successive earthquakes mainly were ignored in the analysis procedure. In order to overcome t...

متن کامل

BinaryRelax: A Relaxation Approach For Training Deep Neural Networks With Quantized Weights

We propose BinaryRelax, a simple two-phase algorithm, for training deep neural networks with quantized weights. The set constraint that characterizes the quantization of weights is not imposed until the late stage of training, and a sequence of pseudo quantized weights is maintained. Specifically, we relax the hard constraint into a continuous regularizer via Moreau envelope, which turns out to...

متن کامل

Weight reduction of aluminum disc wheels under fatigue constraints using a sequential neural network approximation method

This paper describes a weight reduction problem of aluminum disc wheels under cornering fatigue constraints. It is a special structural optimization problem because of the existence of the implicit fatigue constraint. A sequential neural network approximation method is presented to solve this type of discrete-variable engineering optimization problems. First a back-propagation neural network is...

متن کامل

Designing Path for Robot Arm Extensions Series with the Aim of Avoiding Obstruction with Recurring Neural Network

In this paper, recurrent neural network is used for path planning in the joint space of the robot with obstacle in the workspace of the robot. To design the neural network, first a performance index has been defined as sum of square of error tracking of final executor. Then, obstacle avoidance scheme is presented based on its space coordinate and its minimum distance between the obstacle and ea...

متن کامل

Optimal Coding Subgraph Selection under Survivability Constraint

Nowadays communication networks have become an essential and inevitable part of human life. Hence, there is an ever-increasing need for expanding bandwidth, decreasing delay and data transfer costs. These needs necessitate the efficient use of network facilities. Network coding is a new paradigm that allows the intermediate nodes in a network to create new packets by combining the packets recei...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1611.06342  شماره 

صفحات  -

تاریخ انتشار 2016