J un 1 99 5 Unrealizable Learning in Binary Feed - Forward Neural Networks DRAFT February 1 , 2008
نویسنده
چکیده
Statistical mechanics is used to study unrealizable generalization in two large feed-forward neural networks with binary weights and output, a perceptron and a tree committee machine. The student is trained by a teacher being larger, i.e. having more units than the student. It is shown that this is the same as using training data corrupted by Gaussian noise. Each machine is considered in the high temperature limit and in the replica symmetric approximation as well as for one step of replica symmetry breaking. For the perceptron a phase transition is found for low noise. However the transition is not to optimal learning. If the noise is increased the transition disappears. In both cases ǫ g will approach optimal performance with a (ln α/α) k decay for large α. For the tree committee machine noise in the input layer is studied, as well as noise in the hidden layer. If there is no noise in the input layer there is, in the case of one step of repl! ica symmetry breaking, a phase tra nsition to optimal learning at some finite α for all levels of noise in the hidden layer. When noise is added to the input layer the generalization behavior is similar to that of the perceptron. For one step of replica symmetry breaking, in the realizable limit, the values of the spinodal points found in this paper disagree with previously reported estimates [1],[2]. Here the value α sp = 2.79 is found for the tree committee machine and α sp = 1.67 for the perceptron.
منابع مشابه
An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network
RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...
متن کاملArtificial Neural Networks (ANN) for the simultaneous spectrophotometric determination of fluoxetine and sertraline in pharmaceutical formulations and biological fluid
Simultaneous spectrophotometric estimation of Fluoxetine and Sertraline in tablets were performed using UV–Vis spectroscopic and Artificial Neural Networks (ANN). Absorption spectra of two components were recorded in 200–300 (nm) wavelengths region with an interval of 1 nm. The calibration models were thoroughly evaluated at several concentration levels using the spectra of synthetic binary mix...
متن کاملArtificial Neural Networks (ANN) for the simultaneous spectrophotometric determination of fluoxetine and sertraline in pharmaceutical formulations and biological fluid
Simultaneous spectrophotometric estimation of Fluoxetine and Sertraline in tablets were performed using UV–Vis spectroscopic and Artificial Neural Networks (ANN). Absorption spectra of two components were recorded in 200–300 (nm) wavelengths region with an interval of 1 nm. The calibration models were thoroughly evaluated at several concentration levels using the spectra of synthetic binary mix...
متن کاملEffect of sound classification by neural networks in the recognition of human hearing
In this paper, we focus on two basic issues: (a) the classification of sound by neural networks based on frequency and sound intensity parameters (b) evaluating the health of different human ears as compared to of those a healthy person. Sound classification by a specific feed forward neural network with two inputs as frequency and sound intensity and two hidden layers is proposed. This process...
متن کاملA novel approach to prediction of the 3-dimensional structures of protein backbones by neural networks.
Three-dimensional structures of protein backbones have been predicted using neural networks. A feed forward neural network was trained on a class of functionally, but not structurally, homologous proteins, using backpropagation learning. The network generated tertiary structure information in the form of binary distance constraints for the C(alpha) atoms in the protein backbone. The binary dist...
متن کامل