temporal difference learning

نتایج جستجو برای: temporal difference learning

تعداد نتایج: 1222164 فیلتر نتایج به سال:

TD models of reward predictive responses in dopamine neurons

Journal: :Neural networks : the official journal of the International Neural Network Society 2002

Roland E. Suri

This article focuses on recent modeling studies of dopamine neuron activity and their influence on behavior. Activity of midbrain dopamine neurons is phasically increased by stimuli that increase the animal's reward expectation and is decreased below baseline levels when the reward fails to occur. These characteristics resemble the reward prediction error signal of the temporal difference (TD) ...

متن کامل

An analysis of temporal-difference learning with function approximation

Journal: :IEEE Transactions on Automatic Control 1997

متن کامل

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2020

متن کامل

Temporal difference learning does not always lead to STDP

Journal: :Frontiers in Systems Neuroscience 2009

متن کامل

A Reinforcement Learning Model Based on Temporal Difference Algorithm

Journal: :IEEE Access 2019

متن کامل

Is Temporal Difference Learning Optimal? An Instance-Dependent Analysis

Journal: :SIAM journal on mathematics of data science 2021

Related DatabasesWeb of Science You must be logged in with an active subscription to view this.Article DataHistorySubmitted: 14 April 2020Accepted: 01 March 2021Published online: 05 October 2021Keywordstemporal difference learning, Polyak--Ruppert averaging, variance reductionAMS Subject Headings68Q25, 68R10, 68U05Publication DataISSN (online): 2577-0187Publisher: Society for Industrial and App...

متن کامل

Improving Generalization for Temporal Difference Learning: The Successor Representation

Journal: :Neural Computation 1993

متن کامل

Temporal difference models describe higher-order learning in humans

Journal: :Nature 2004

متن کامل

A temporal difference method for multi-objective reinforcement learning

Journal: :Neurocomputing 2017

متن کامل

Correlation minimizing replay memory in temporal-difference reinforcement learning

Journal: :Neurocomputing 2020

متن کامل

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید