Fast Approximation of the Gauss--Newton Hessian Matrix for the Multilayer Perceptron
نویسندگان
چکیده
We introduce a fast algorithm for entrywise evaluation of the Gauss--Newton Hessian (GNH) matrix fully connected feed-forward neural network. The has precomputation step and sampling step. While it generally requires $\mathcal{O}(Nn)$ work to compute an entry (and entire column) in GNH network with $N$ parameters $n$ data points, our reduces cost $\mathcal{O}(n+d/\epsilon^2)$ work, where $d$ is output dimension $\epsilon$ prescribed accuracy (independent $N$). One application constructing hierarchical-matrix ($\mathcal{H}$-matrix) approximation solving linear systems eigenvalue problems. It $\mathcal{O}(N^2)$ memory $\mathcal{O}(N^3)$ store factorize matrix, respectively. $\mathcal{H}$-matrix only $\mathcal{O}(N r_o)$ footprint r_o^2)$ be factorized, $r_o \ll N$ maximum rank off-diagonal blocks matrix. demonstrate performance on classification autoencoder networks. (A corrected version attached.)
منابع مشابه
Exact Calculation of the Hessian Matrix for the Multilayer Perceptron
The elements of the Hessian matrix consist of the second derivatives of the error measure with respect to the weights and thresholds in the network. They are needed in Bayesian estimation of network regularization parameters, for estimation of error bars on the network outputs, for network pruning algorithms, and for fast re-training of the network following a small change in the training data....
متن کاملEfficient Calculation of the Gauss-Newton Approximation of the Hessian Matrix in Neural Networks
The Levenberg-Marquardt (LM) learning algorithm is a popular algorithm for training neural networks; however, for large neural networks, it becomes prohibitively expensive in terms of running time and memory requirements. The most time-critical step of the algorithm is the calculation of the Gauss-Newton matrix, which is formed by multiplying two large Jacobian matrices together. We propose a m...
متن کاملAdaptive Cross Approximation for Compressing the Jacobian Matrix in the Gauss-newton Inversion
Among offshore hydrocarbon exploration technologies, the controlled source electromagnetic (CSEM) method have gained a lot of interest in both academia and industry because of its ability to detect hydrocarbon reservoirs [1]. In order to maximally extract information from the data, a full nonlinear inversion approach is employed [2]. In such an approach, the investigation domain is subdivided i...
متن کاملstudy of cohesive devices in the textbook of english for the students of apsychology by rastegarpour
this study investigates the cohesive devices used in the textbook of english for the students of psychology. the research questions and hypotheses in the present study are based on what frequency and distribution of grammatical and lexical cohesive devices are. then, to answer the questions all grammatical and lexical cohesive devices in reading comprehension passages from 6 units of 21units th...
the use of appropriate madm model for ranking the vendors of mci equipments using fuzzy approach
abstract nowadays, the science of decision making has been paid to more attention due to the complexity of the problems of suppliers selection. as known, one of the efficient tools in economic and human resources development is the extension of communication networks in developing countries. so, the proper selection of suppliers of tc equipments is of concern very much. in this study, a ...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SIAM Journal on Matrix Analysis and Applications
سال: 2021
ISSN: ['1095-7162', '0895-4798']
DOI: https://doi.org/10.1137/19m129961x