نتایج جستجو برای: temporal difference learning

تعداد نتایج: 1222164  

2013
Pablo Escandell-Montero José María Martínez-Martínez José David Martín-Guerrero Emilio Soria-Olivas Juan Gómez-Sanchís

This paper proposes a least-squares temporal difference (LSTD) algorithm based on extreme learning machine that uses a singlehidden layer feedforward network to approximate the value function. While LSTD is typically combined with local function approximators, the proposed approach uses a global approximator that allows better scalability properties. The results of the experiments carried out o...

Journal: :J. Artif. Intell. Res. 1995
Pawel Cichosz

Temporal diierence (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor. Currently the most important application of these methods is to temporal credit assignment in reinforcement learning. Well known reinforcement learning algorithms, such as AHC or Q-learning, may be viewed as instances of TD learning. This p...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه یاسوج - دانشکده ادبیات و علوم انسانی 1392

the effect of learning strategies on the speaking ability of iranian students in the context of language institutes abstract language learning strategies are of the most important factors that help language learners to learn a foreign language and how they can deal with the four language skills specifically speaking skill effectively. acknowledging the great impact of learning strategies...

Journal: :Bulletin of the Polish Academy of Sciences Technical Sciences 2016

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه مازندران 1387

vocabulary as a major component of language learning has been the object of numerous studies each of which has its own contribution to the field. finding the best way of learning the words deeply and extensively is the common objective of most of those studies. however, one effective way for achieving this goal is somehow neglected in the field. using a variety of activities such as games can r...

2010
Julien Hirel Philippe Gaussier Mathias Quoy Jean-Paul Banquet

In this paper we present a model of reinforcement learning (RL) which can be used to solve goal-oriented navigation tasks. Our model supposes that transitions between places are learned in the hippocampus (CA pyramidal cells) and associated with information coming from path-integration. The RL neural network acts as a bias on these transitions to perform action selection. RL originates in the b...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید