gradient descent algorithm

نتایج جستجو برای: gradient descent algorithm

تعداد نتایج: 869527 فیلتر نتایج به سال:

StingyCD: Safely Avoiding Wasteful Updates in Coordinate Descent

2017

• Less time than an identical iteration of Algorithm 1 if q(t−1) ≤ τi and x i = 0 (the update is skipped) and rr is not updated. Specifically, StingyCD requires O(1) time, while CD requires O(NNZ (Ai)) time. • The same amount of time (up to an O(1) term) as a CD iteration if the update is not skipped and rr is not updated. In particular, both algorithms require the same number of O(NNZ (Ai)) op...

متن کامل

Adaptive Back-Propagation in On-Line Learning of Multilayer Networks

1995

Ansgar Heinrich Ludolf West David Saad

An adaptive back-propagation algorithm is studied and compared with gradient descent (standard back-propagation) for on-line learning in two-layer neural networks with an arbitrary number of hidden units. Within a statistical mechanics framework , both numerical studies and a rigorous analysis show that the adaptive back-propagation method results in faster training by breaking the symmetry bet...

متن کامل

Stochastic Backward Euler: An Implicit Gradient Descent Algorithm for k-Means Clustering

Journal: :Journal of Scientific Computing 2018

متن کامل

A New Conjugate Gradient Algorithm with Sufficient Descent Property for Unconstrained Optimization

Journal: :Mathematical Problems in Engineering 2015

متن کامل

Parameter Identification of ARX Models Based on Modified Momentum Gradient Descent Algorithm

Journal: :Complexity 2020

متن کامل

Taming the Wild: A Unified Analysis of Hogwild-Style Algorithms

Journal: :Advances in neural information processing systems 2015

Christopher De Sa Ce Zhang Kunle Olukotun Christopher Ré

Stochastic gradient descent (SGD) is a ubiquitous algorithm for a variety of machine learning problems. Researchers and industry have developed several techniques to optimize SGD's runtime performance, including asynchronous execution and reduced precision. Our main result is a martingale-based analysis that enables us to capture the rich noise models that may arise from such techniques. Specif...

متن کامل

Comparison of Simplified Gradient Descent Algorithms for Decoding Ldpc Codes

2014

Boorle Ashok Kumar Padma Sree

In this paper it is shown that multi GDBF algorithm exhibits much faster convergence as compared to the single GDBF algorithm. The multi GDBF algorithm require less iterations when compared to the single GDBF algorithm for the search point to closely approach the local maximum point taking into consideration the gradient descent bit flipping (GDBF) algorithms exhibiting better decoding performa...

متن کامل

Low-Rank Gradient Descent

Journal: :IEEE open journal of control systems 2023

Several recent empirical studies demonstrate that important machine learning tasks such as training deep neural networks, exhibit a low-rank structure, where most of the variation in loss function occurs only few directions input space. In this paper, we leverage structure to reduce high computational cost canonical gradient-based methods gradient descent (GD). Our proposed Low-Rank Gradient De...

متن کامل

Connectivity Preserving Distributed Maximizing Coverage Algorithm for Three Dimensional Mobile Sensor Networks

2012

Mohammad Javad Heydari Saeid Pashazadeh

Considering an under supervised 3D space where a group of mobile devices with limited sensing and communicating capabilities are deployed, this paper aims at proposing a decentralized self-deployment algorithm for agents to get maximum connected coverage topology. The problem is modeled as maximization which is solved completely distributed. In fact each agent tries to maximize its sensing volu...

متن کامل

On Spectral Properties of Steepest Descent Methods

2014

ROBERTA DE ASMUNDIS FILIPPO RICCIO GERARDO TORALDO G. TORALDO

In recent years it has been made more and more clear that the critical issue in gradient methods is the choice of the step length, whereas using the gradient as search direction may lead to very effective algorithms, whose surprising behaviour has been only partially explained, mostly in terms of the spectrum of the Hessian matrix. On the other hand, the convergence of the classical Cauchy stee...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید