An effective algorithm for hyperparameter optimization of neural networks

نویسندگان

  • Gonzalo I. Diaz
  • Achille Fokoue
  • Giacomo Nannicini
  • Horst Samulowitz
چکیده

A major challenge in designing neural network (NN) systems is to determine the best structure and parameters for the network given the data for the machine learning problem at hand. Examples of parameters are the number of layers and nodes, the learning rates, and the dropout rates. Typically, these parameters are chosen based on heuristic rules and manually fine-tuned, which may be very time-consuming, because evaluating the performance of a single parametrization of the NN may require several hours. This paper addresses the problem of choosing appropriate parameters for the NN by formulating it as a box-constrained mathematical optimization problem, and applying a derivative-free optimization tool that automatically and effectively searches the parameter space. The optimization tool employs a radial basis function model of the objective function (the prediction accuracy of the NN) to accelerate the discovery of configurations yielding high accuracy. Candidate configurations explored by the algorithm are trained to a small number of epochs, and only the most promising candidates receive full training. The performance of the proposed methodology is assessed on benchmark sets and in the context of predicting drug-drug interactions, showing promising results. The optimization tool used in this paper is open-source.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling and Optimization of Roll-bonding Parameters for Bond Strength of Ti/Cu/Ti Clad Composites by Artificial Neural Networks and Genetic Algorithm

This paper deals with modeling and optimization of the roll-bonding process of Ti/Cu/Ti composite for determination of the best roll-bonding parameters leading to the maximum Ti/Cu bond strength by combination of neural network and genetic algorithm. An artificial neural network (ANN) program has been proposed to determine the effect of practical parameters, i.e., rolling temperature, reduction...

متن کامل

Hardness Optimization for Al6061-MWCNT Nanocomposite Prepared by Mechanical Alloying Using Artificial Neural Networks and Genetic Algorithm

Among artificial intelligence approaches, artificial neural networks (ANNs) and genetic algorithm (GA) are widely applied for modification of materials property in engineering science in large scale modeling. In this work artificial neural network (ANN) and genetic algorithm (GA) were applied to find the optimal conditions for achieving the maximum hardness of Al6061 reinforced by multiwall car...

متن کامل

The Optimization of the Effective Parameters of the Die in Parallel Tubular Channel Angular Pressing Process by Using Neural Network and Genetic Algorithm Methods

One of reasons that researchers in recent years have tried to produce ultrafine grained materials is producing lightweight components with high strength and reliability. There are disparate methods for production of ultra-fine grain materials,one of which is severe plastic deformation method. Severe plastic deformation method comprises different processes, one of which is Parallel tubular chann...

متن کامل

On Hyperparameter Optimization in Learning Systems

We study two procedures (reverse-mode and forward-mode) for computing the gradient of the validation error with respect to the hyperparameters of any iterative learning algorithm. These procedures mirror two ways of computing gradients for recurrent neural networks and have different trade-offs in terms of running time and space requirements. The reverse-mode procedure extends previous work by ...

متن کامل

The Optimization of the Effective Parameters of the Die in Parallel Tubular Channel Angular Pressing Process by Using Neural Network and Genetic Algorithm Methods

One of reasons that researchers in recent years have tried to produce ultrafine grained materials is producing lightweight components with high strength and reliability. There are disparate methods for production of ultra-fine grain materials,one of which is severe plastic deformation method. Severe plastic deformation method comprises different processes, one of which is Parallel tubular chann...

متن کامل

Traffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization

Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IBM Journal of Research and Development

دوره 61  شماره 

صفحات  -

تاریخ انتشار 2017