نتایج جستجو برای: fuzzy approximators

تعداد نتایج: 90193  

2003
Rémi Coulom

This paper presents a model-based actorcritic algorithm in continuous time and space. Two function approximators are used: one learns the policy (the actor) and the other learns the state-value function (the critic). The critic learns with the TD(λ) algorithm and the actor by gradient ascent on the Hamiltonian. A similar algorithm had been proposed by Doya, but this one is more general. This al...

Journal: :Neural computation 2008
Ilya Sutskever Geoffrey E. Hinton

In this note, we show that exponentially deep belief networks can approximate any distribution over binary vectors to arbitrary accuracy, even when the width of each layer is limited to the dimensionality of the data. We further show that such networks can be greedily learned in an easy yet impractical way.

2011
Jaime Nava Vladik Kreinovich

Many real-life dependencies can be reasonably accurately described by linear functions. If we want a more accurate description, we need to take non-linear terms into account. To take nonlinear terms into account, we can either explicitly add quadratic terms to the regression equation, or, alternatively, we can use a neural network with a non-linear activation function. At first glance, regressi...

Journal: :IEEE transactions on neural networks 2006
Guang-Bin Huang Lei Chen Chee Kheong Siew

According to conventional neural network theories, single-hidden-layer feedforward networks (SLFNs) with additive or radial basis function (RBF) hidden nodes are universal approximators when all the parameters of the networks are allowed adjustable. However, as observed in most neural network implementations, tuning all the parameters of the networks may cause learning complicated and inefficie...

Journal: :CoRR 2015
Norbert Bátfai

The agent program, called Samu, is an experiment to build a disembodied DevRob (Developmental Robotics) chatter bot that can talk in a natural language like humans do. One of the main design feature is that Samu can be interacted with using only a character terminal. This is important not only for practical aspects of Turing test or Loebner prize, but also for the study of basic principles of D...

Journal: :CoRR 2014
Richard Peng

We give the first O(mpolylog(n)) time algorithms for approximating maximum flows in undirected graphs and constructing polylog(n)-quality cut-approximating hierarchical tree decompositions. Our algorithm invokes existing algorithms for these two problems recursively while gradually incorporating size reductions. These size reductions are in turn obtained via ultra-sparsifiers, which are key too...

2017
Zhongwen Xu Joseph Modayil Hado P. van Hasselt André Barreto David Silver Tom Schaul

Neural networks have a smooth initial inductive bias, such that small changes in input do not lead to large changes in output. However, in reinforcement learning domains with sparse rewards, value functions have non-smooth structure with a characteristic asymmetric discontinuity whenever rewards arrive. We propose a mechanism that learns an interpolation between a direct value estimate and a pr...

2004
A. Alessandri

where C~ E R" ,xt R" , and gt E RP are the state, control, and measurement vectors, respectively. The initial state 5 is unknown. We assume that 5 E X and 5 E U , where X and U are compact sets. Now, let us consider a sliding-window observer. This means that, a t a given stage t and for a given temporal window of length N stages, we have to recovery c ~ ~ on the basis of the last N + 1 measurem...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید