IDIAP Technical report
نویسنده
چکیده
Proper initialization is one of the most important prerequisites for fast convergence of feed-forward neural networks like high order and multilayer perceptrons. This publication aims at determining the optimal value of the initial weight v ariance (or range), which is the principal parameter of random weight initialization methods for both types of neural networks. An overview of random weight initialization methods for multilayer perceptrons is presented. These methods are extensively tested using eight real-world benchmark data sets and a broad range of initial weight v ariances by means of more than 30 000 simulations, in the aim to nd the best weight initialization method for multilayer perceptrons. For high order networks, a large number of experiments (more than 200 000 simulations) was performed, using three weight distributions, three activation functions, several network orders, and the same eight data sets. The results of these experiments are compared to weight initialization techniques for multilayer perceptrons, which leads to the proposal of a suitable weight initialization method for high order perceptrons. The conclusions on the weight initialization methods for both types of networks are justiied by suuciently small conndence intervals of the mean convergence times.
منابع مشابه
IDIAP Resear h Report 02 - 03 Estimation of Conditional Distributions usingGaussian Mixture Models
متن کامل
Relevant consequence and empirical inquiry
A criterion of adequacy is proposed for theories of relevant consequence. According to the criterion, scientists whose deductive reasoning is limited to some proposed subset of the standard consequence relation must not thereby suffer a reduction in scientific competence. A simple theory of relevant consequence is introduced and shown to satisfy the criterion with respect to a formally defined ...
متن کامل