temporal difference learning

نتایج جستجو برای: temporal difference learning

تعداد نتایج: 1222164 فیلتر نتایج به سال:

Least-squares temporal difference learning based on extreme learning machine

2013

Pablo Escandell-Montero José María Martínez-Martínez José David Martín-Guerrero Emilio Soria-Olivas Juan Gómez-Sanchís

This paper proposes a least-squares temporal difference (LSTD) algorithm based on extreme learning machine that uses a singlehidden layer feedforward network to approximate the value function. While LSTD is typically combined with local function approximators, the proposed approach uses a global approximator that allows better scalability properties. The results of the experiments carried out o...

متن کامل

Truncating Temporal Differences: On the Efficient Implementation of TD(lambda) for Reinforcement Learning

Journal: :J. Artif. Intell. Res. 1995

Pawel Cichosz

Temporal diierence (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor. Currently the most important application of these methods is to temporal credit assignment in reinforcement learning. Well known reinforcement learning algorithms, such as AHC or Q-learning, may be viewed as instances of TD learning. This p...

متن کامل

the effect of learning strategies on the speaking ability of iranian students in the context of language institutes

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه یاسوج - دانشکده ادبیات و علوم انسانی 1392

مژگان صادقی, محمد جعفر جباری, علی کاظمی,

the effect of learning strategies on the speaking ability of iranian students in the context of language institutes abstract language learning strategies are of the most important factors that help language learners to learn a foreign language and how they can deal with the four language skills specifically speaking skill effectively. acknowledging the great impact of learning strategies...

On the worst-case analysis of temporal-difference learning algorithms

Journal: :Machine Learning 1996

متن کامل

Frontal operculum temporal difference signals and social motor response learning

Journal: :Human Brain Mapping 2009

متن کامل

Fuzzy interpretation for temporal-difference learning in anomaly detection problems

Journal: :Bulletin of the Polish Academy of Sciences Technical Sciences 2016

متن کامل

Deterministic limit of temporal difference reinforcement learning for stochastic games

Journal: :Physical Review E 2019

متن کامل

An Imperfect Dopaminergic Error Signal Can Drive Temporal-Difference Learning

Journal: :PLoS Computational Biology 2011

متن کامل

بررسی تاثیر کوتاه مدت وبلند مدت بازیهای زبانی بریادگیری واژگان

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه مازندران 1387

فخرالسادات رضایی کلاگر, رجب علی رعیتی دماوندی, باقر یعقوبی,

vocabulary as a major component of language learning has been the object of numerous studies each of which has its own contribution to the field. finding the best way of learning the words deeply and extensively is the common objective of most of those studies. however, one effective way for achieving this goal is somehow neglected in the field. using a variety of activities such as games can r...

Why and How Hippocampal Transition Cells Can Be Used in Reinforcement Learning

2010

Julien Hirel Philippe Gaussier Mathias Quoy Jean-Paul Banquet

In this paper we present a model of reinforcement learning (RL) which can be used to solve goal-oriented navigation tasks. Our model supposes that transitions between places are learned in the hippocampus (CA pyramidal cells) and associated with information coming from path-integration. The RL neural network acts as a bias on these transitions to perform action selection. RL originates in the b...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید