نتایج جستجو برای: temporal difference learning

تعداد نتایج: 1222164  

Journal: :Neural networks : the official journal of the International Neural Network Society 2002
Roland E. Suri

This article focuses on recent modeling studies of dopamine neuron activity and their influence on behavior. Activity of midbrain dopamine neurons is phasically increased by stimuli that increase the animal's reward expectation and is decreased below baseline levels when the reward fails to occur. These characteristics resemble the reward prediction error signal of the temporal difference (TD) ...

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2020

Journal: :Frontiers in Systems Neuroscience 2009

Journal: :SIAM journal on mathematics of data science 2021

Related DatabasesWeb of Science You must be logged in with an active subscription to view this.Article DataHistorySubmitted: 14 April 2020Accepted: 01 March 2021Published online: 05 October 2021Keywordstemporal difference learning, Polyak--Ruppert averaging, variance reductionAMS Subject Headings68Q25, 68R10, 68U05Publication DataISSN (online): 2577-0187Publisher: Society for Industrial and App...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید