stochastic gradient descent learning

InfiniteBoost: building infinite ensembles with gradient descent

Journal: :CoRR 2017

Alex Rogozhnikov Tatiana Likhomanenko

In machine learning ensemble methods have demonstrated high accuracy for the variety of problems in different areas. The most known algorithms intensively used in practice are random forests and gradient boosting. In this paper we present InfiniteBoost — a novel algorithm, which combines the best properties of these two approaches. The algorithm constructs the ensemble of trees for which two pr...

متن کامل

Generalized relevance learning vector quantization

Journal: :Neural networks : the official journal of the International Neural Network Society 2002

Barbara Hammer Thomas Villmann

We propose a new scheme for enlarging generalized learning vector quantization (GLVQ) with weighting factors for the input dimensions. The factors allow an appropriate scaling of the input dimensions according to their relevance. They are adapted automatically during training according to the specific classification task whereby training can be interpreted as stochastic gradient descent on an a...

متن کامل

On the Relationship Between the OpenAI Evolution Strategy and Stochastic Gradient Descent

Journal: :CoRR 2017

Xingwen Zhang Jeff Clune Kenneth O. Stanley

Because stochastic gradient descent (SGD) has shown promise optimizing neural networks with millions of parameters and few if any alternatives are known to exist, it has moved to the heart of leading approaches to reinforcement learning (RL). For that reason, the recent result from OpenAI showing that a particular kind of evolution strategy (ES) can rival the performance of SGD-based deep RL me...

متن کامل

Conjugate gradient neural network in prediction of clay behavior and parameters sensitivities

Journal: Numerical Methods in Civil Engineering 2016

Hamed Memarian fard,

The use of artificial neural networks has increased in many areas of engineering. In particular, this method has been applied to many geotechnical engineering problems and demonstrated some degree of success. A review of the literature reveals that it has been used successfully in modeling soil behavior, site characterization, earth retaining structures, settlement of structures, slope stabilit...

متن کامل

Weighted parallel SGD for distributed unbalanced-workload training system

Journal: :CoRR 2017

Daning Cheng Shigang Li Yunquan Zhang

Stochastic gradient descent (SGD) is a popular stochastic optimization method in machine learning. Traditional parallel SGD algorithms, e.g., SimuParallel SGD [1], often require all nodes to have the same performance or to consume equal quantities of data. However, these requirements are difficult to satisfy when the parallel SGD algorithms run in a heterogeneous computing environment; low-perf...

متن کامل

Probabilistic Line Searches for Stochastic Optimization

Journal: :Journal of Machine Learning Research 2015

Maren Mahsereci Philipp Hennig

In deterministic optimization problems, line search routines are a standard tool ensuring stability and efficiency. In the stochastic setting, no direct equivalent has so far been formulated, because uncertain gradients do not allow for a strict sequence of decisions collapsing the search space. We construct a probabilistic version of the line search paradigm, by combining the structure of exis...

متن کامل

Joint cross-domain classification and subspace learning for unsupervised adaptation

Journal: :Pattern Recognition Letters 2015

Basura Fernando Tatiana Tommasi Tinne Tuytelaars

Domain adaptation aims at adapting the knowledge acquired on a source domain to a new different but related target domain. Several approaches have been proposed for classification tasks in the unsupervised scenario, where no labeled target data are available. Most of the attention has been dedicated to searching a new domain-invariant representation, leaving the definition of the prediction fun...

متن کامل

Distributed Deep Learning Using Synchronous Stochastic Gradient Descent

Journal: :CoRR 2016

Dipankar Das Sasikanth Avancha Dheevatsa Mudigere Karthikeyan Vaidyanathan Srinivas Sridharan Dhiraj D. Kalamkar Bharat Kaul Pradeep Dubey

We design and implement a distributed multinode synchronous SGD algorithm, without altering hyperparameters, or compressing data, or altering algorithmic behavior. We perform a detailed analysis of scaling, and identify optimal design points for different networks. We demonstrate scaling of CNNs on 100s of nodes, and present what we believe to be record training throughputs. A 512 minibatch VGG...

متن کامل

Fast Asynchronous Parallel Stochastic Gradient Decent

Journal: :CoRR 2015

Shen-Yi Zhao Wu-Jun Li

Stochastic gradient descent (SGD) and its variants have become more and more popular in machine learning due to their efficiency and effectiveness. To handle large-scale problems, researchers have recently proposed several parallel SGD methods for multicore systems. However, existing parallel SGD methods cannot achieve satisfactory performance in real applications. In this paper, we propose a f...

متن کامل

No more pesky learning rates

2013

Tom Schaul Sixin Zhang Yann LeCun

The performance of stochastic gradient descent (SGD) depends critically on how learning rates are tuned and decreased over time. We propose a method to automatically adjust multiple learning rates so as to minimize the expected error at any one time. The method relies on local gradient variations across samples. In our approach, learning rates can increase as well as decrease, making it suitabl...

متن کامل