Nips*97 the Eeciency and the Robustness of Natural Gradient Descent Learning Rule Sub-category: Dynamics of Learning Algorithms Category: Theory
نویسنده
چکیده
We have discovered a new scheme to represent the Fisher information matrix of a stochastic multi-layer perceptron. Based on this scheme, we have designed an algorithm to compute the inverse of the Fisher information matrix. When the input dimension n is much larger than the number of hidden neurons, the complexity of this algorithm is of order O(n 2) while the complexity of conventional algorithms for the same purpose is of order O(n 3). The inverse of the Fisher information matrix is used in the natural gradient descent algorithm to train single-layer or multi-layer per-ceptrons. It is connrmed by simulation that the natural gradient descent learning rule is not only eecient but also robust.
منابع مشابه
The Efficiency and the Robustness of Natural Gradient Descent Learning Rule
The inverse of the Fisher information matrix is used in the natural gradient descent algorithm to train single-layer and multi-layer perceptrons. We have discovered a new scheme to represent the Fisher information matrix of a stochastic multi-layer perceptron. Based on this scheme, we have designed an algorithm to compute the natural gradient. When the input dimension n is much larger than the ...
متن کاملFast Online Policy Gradient Learning with SMD Gain Vector Adaptation
Reinforcement learning by direct policy gradient estimation is attractive in theory but in practice leads to notoriously ill-behaved optimization problems. We improve its robustness and speed of convergence with stochastic meta-descent, a gain vector adaptation method that employs fast Hessian-vector products. In our experiments the resulting algorithms outperform previously employed online sto...
متن کاملA Hybrid Optimization Algorithm for Learning Deep Models
Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...
متن کاملA Hybrid Optimization Algorithm for Learning Deep Models
Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...
متن کاملPosition Control of a Pulse Width Modulated Pneumatic Systems: an Experimental Comparison
In this study, a new adaptive controller is proposed for position control of pneumatic systems. Difficulties associated with the mathematical model of the system in addition to the instability caused by Pulse Width Modulation (PWM) in the learning-based controllers using gradient descent, motivate the development of a new approach for PWM pneumatics. In this study, two modified Feedback Error L...
متن کامل