stochastic gradient descent learning

نتایج جستجو برای: stochastic gradient descent learning

تعداد نتایج: 840759 فیلتر نتایج به سال:

Fast and Strong Convergence of Online Learning Algorithms

Journal: :CoRR 2017

Zheng-Chu Guo Lei Shi

In this paper, we study the online learning algorithm without explicit regularization terms. This algorithm is essentially a stochastic gradient descent scheme in a reproducing kernel Hilbert space (RKHS). The polynomially decaying step size in each iteration can play a role of regularization to ensure the generalization ability of online learning algorithm. We develop a novel capacity dependen...

متن کامل

Deep Within-Class Covariance Analysis for Acoustic Scene Classification

Journal: :CoRR 2017

Hamid Eghbal-zadeh Matthias Dorfer Gerhard Widmer

Within-Class Covariance Normalization (WCCN) is a powerful post-processing method for normalizing the within-class covariance of a set of data points. WCCN projects the observations into a linear sub-space where the within-class variability is reduced. This property has proven to be beneficial in subsequent recognition tasks. The central idea of this paper is to reformulate the classic WCCN as ...

متن کامل

Neural Networks with Few Multiplications

Journal: :CoRR 2015

Zhouhan Lin Matthieu Courbariaux Roland Memisevic Yoshua Bengio

For most deep learning algorithms training is notoriously time consuming. Since most of the computation in training neural networks is typically spent on floating point multiplications, we investigate an approach to training that eliminates the need for most of these. Our method consists of two parts: First we stochastically binarize weights to convert multiplications involved in computing hidd...

متن کامل

Semi-Supervised Convex Training for Dependency Parsing

2008

Qin Iris Wang Dale Schuurmans Dekang Lin

We present a novel semi-supervised training algorithm for learning dependency parsers. By combining a supervised large margin loss with an unsupervised least squares loss, a discriminative, convex, semi-supervised learning algorithm can be obtained that is applicable to large-scale problems. To demonstrate the benefits of this approach, we apply the technique to learning dependency parsers from...

متن کامل

Neural Graph Machines: Learning Neural Networks Using Graphs

Journal: :CoRR 2016

Thang D. Bui Sujith Ravi Vivek Ramavajjala

Label propagation is a powerful and flexible semi-supervised learning technique on graphs. Neural network architectures, on the other hand, have proven track records in many supervised learning tasks. In this work, we propose a training objective for neural networks, Neural Graph Machines, for combining the power of neural networks and label propagation. The new objective allows the neural netw...

متن کامل

Elephant Search with Deep Learning for Microarray Data Analysis

Journal: :CoRR 2017

Mrutyunjaya Panda

Even though there is a plethora of research in Microarray gene expression data analysis, still, it poses challenges for researchers to effectively and efficiently analyze the large yet complex expression of genes. The feature (gene) selection method is of paramount importance for understanding the differences in biological and non-biological variation between samples. In order to address this p...

متن کامل

Symmetry-invariant optimization in deep networks

Journal: :CoRR 2015

Vijay Badrinarayanan Bamdev Mishra Roberto Cipolla

Recent works have highlighted scale invariance or symmetry that is present in the weight space of a typical deep network and the adverse effect that it has on the Euclidean gradient based stochastic gradient descent optimization. In this work, we show that these and other commonly used deep networks, such as those which use a max-pooling and sub-sampling layer, possess more complex forms of sym...

متن کامل

The Efficiency and the Robustness of Natural Gradient Descent Learning Rule

1997

Howard Hua Yang Shun-ichi Amari

The inverse of the Fisher information matrix is used in the natural gradient descent algorithm to train single-layer and multi-layer perceptrons. We have discovered a new scheme to represent the Fisher information matrix of a stochastic multi-layer perceptron. Based on this scheme, we have designed an algorithm to compute the natural gradient. When the input dimension n is much larger than the ...

متن کامل

Asymptotic and finite-sample properties of estimators based on stochastic gradients∗

2016

Panos Toulis Edoardo M. Airoldi Joe Blitzstein Leon Bottou Bob Carpenter David Dunson Andrew Gelman Brian Kulis Xiao-Li Meng Natesh Pillai

Stochastic gradient descent procedures have gained popularity for parameter estimation from large data sets. However, their statistical properties are not well understood, in theory. And in practice, avoiding numerical instability requires careful tuning of key parameters. Here, we introduce implicit stochastic gradient descent procedures, which involve parameter updates that are implicitly def...

متن کامل

Training Stacked Denoising Autoencoders for Representation Learning

2014

Jason Liang Keith Kelly

We implement stacked denoising autoencoders, a class of neural networks that are capable of learning powerful representations of high dimensional data. We describe stochastic gradient descent for unsupervised training of autoencoders, as well as a novel genetic algorithm based approach that makes use of gradient information. We analyze the performance of both optimization algorithms and also th...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید