نتایج جستجو برای: مدل reward beta

تعداد نتایج: 336113  

2014
Leong Teen Wei Rashad Yazdanifard

Each employee’s performance is important in an organization. A way to motivate it is through the application of reinforcement theory which is developed by B. F. Skinner. One of the most commonly used methods is positive reinforcement in which one’s behavior is strengthened or increased based on consequences. This paper aims to review the impact of positive reinforcement on the performances of e...

2007
Elisabeth A. Murray

Recent research provides new insights into amygdala contributions to positive emotion and reward. Studies of neuronal activity in the monkey amygdala and of autonomic responses mediated by the monkey amygdala show that, contrary to a widely held view, the amygdala is just as important for processing positive reward and reinforcement as it is for negative. In addition, neuropsychological studies...

2003
Stuart J. Russell Andrew Zimdars

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and runs its own reinforcement learning process. It supplies to a central arbitrator the Q-values (according to its own reward function) for each possible action. The arbitrator selects an action maximizing the sum of Q-v...

Journal: :Wireless Personal Communications 2014
Xiaoxiong Zhong Yang Qin Li Li

Cognitive radio (CR) has emerged as a promising technology to improve spectrum utilization. Capacity analysis is very useful in investigating the ultimate performance limits for wireless networks. Meanwhile, with increasing potential future applications for the CR systems, it is necessary to explore the limitations on their capacity in a dynamic spectrum access environment. However, due to spec...

2012
Michel Tokic Günther Palm

Stochastic neurons are deployed for efficient adaptation of exploration parameters by gradient-following algorithms. The approach is evaluated in model-free temporal-difference learning using discrete actions. The advantage is in particular memory efficiency, because memorizing exploratory data is only required for starting states. Hence, if a learning problem consist of only one starting state...

Journal: :Neuron 2013
John T. Arsenault Koen Nelissen Bechir Jarraya Wim Vanduffel

Stimulus-reward coupling without attention can induce highly specific perceptual learning effects, suggesting that reward triggers selective plasticity within visual cortex. Additionally, dopamine-releasing events-temporally surrounding stimulus-reward associations-selectively enhance memory. These forms of plasticity may be evoked by selective modulation of stimulus representations during dopa...

Journal: :Journal of Artificial Intelligence Research 2022

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As such, these must extensively interact with the environment in order to discover rewards and optimal policies. In most RL applications, however, users have program function and, hence, there is opportunity make visible -- show function's code agent so it can exploit internal structure learn policies a more samp...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه علوم کشاورزی و منابع طبیعی گرگان - دانشکده علوم کشاورزی 1391

به منظور کمّی سازی واکنش جوانه زنی و سبزشدن گیاه جو نسبت به دما، رطوبت و عمق کاشت، جوانه زنی این گیاه تحت تأثیر تیمار های دمایی مختلف (5، 10، 15، 20، 25، 30 و 35 درجه سانتی گراد) و پتانسیل های مختلف رطوبت (0، 2-، 4-، 6- و 8- بار) و سبز شدن در عمق های مختلف کاشت (2، 5، 7، 9 و 12 سانتی متر) در آزمایشگاه و گلخانه دانشگاه علوم کشاورزی و منابع طبیعی گرگان در سال 1391 مورد بررسی قرار گرفت. نتایج نشان ...

2014
Makoto Suzuki Hikari Kirimoto Kazuhiro Sugawara Mineo Oyama Sumio Yamada Jun-ichi Yamamoto Atsuhiko Matsunaga Michinari Fukuda Hideaki Onishi

Horizontal intracortical projections for agonist and antagonist muscles exist in the primary motor cortex (M1), and reward may induce a reinforcement of transmission efficiency of intracortical circuits. We investigated reward-induced change in M1 excitability for agonist and antagonist muscles. Participants were 8 healthy volunteers. Probabilistic reward tasks comprised 3 conditions of 30 tria...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه پیام نور - دانشگاه پیام نور استان تهران - دانشکده ادبیات و علوم انسانی 1393

چکیده : از جمله عواملی که رفتار سازمانی هر فرد را به شدت تحت تأثیر قرار می دهد، هوش هیجانی و هوش معنوی هستند. این پژوهش با هدف بررسی رابطه بین هوش هیجانی و معنوی با تعهد سازمانی و رضایت شغلی کارکنان ادارات ورزش و جوانان استان همدان انجام شده است . جامعه آماری این پژوهش شامل کلیه کارکنان اداره های تربیت بدنی استان همدان (140=n) می باشد . نمونه آماری دراین پژوهش برابر با جامعه آماری است. در این ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید