نتایج جستجو برای: reward packages

تعداد نتایج: 46029  

Journal: :J. Inf. Sci. Eng. 2016
Degan Zhang Yanan Zhu Si Liu Xiaodan Zhang Jinjie Song

DEGAN ZHANG†, YANAN ZHU‡, SI LIU, XIAODAN ZHANG AND JINJIE SONG Key Laboratory of Computer Vision and System Ministry of Education, Tianjin Tianjin Key Lab of Intelligent Computing and Novel Software Technology Tianjin University of Technology Tianjin, 300384 P.R. China School of Electrical and Information Engineering The University of Sydney Sydney, NSW 2006, Australia Institute of Scientific ...

2014
Leong Teen Wei Rashad Yazdanifard

Each employee’s performance is important in an organization. A way to motivate it is through the application of reinforcement theory which is developed by B. F. Skinner. One of the most commonly used methods is positive reinforcement in which one’s behavior is strengthened or increased based on consequences. This paper aims to review the impact of positive reinforcement on the performances of e...

2007
Elisabeth A. Murray

Recent research provides new insights into amygdala contributions to positive emotion and reward. Studies of neuronal activity in the monkey amygdala and of autonomic responses mediated by the monkey amygdala show that, contrary to a widely held view, the amygdala is just as important for processing positive reward and reinforcement as it is for negative. In addition, neuropsychological studies...

2003
Stuart J. Russell Andrew Zimdars

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and runs its own reinforcement learning process. It supplies to a central arbitrator the Q-values (according to its own reward function) for each possible action. The arbitrator selects an action maximizing the sum of Q-v...

Journal: :Wireless Personal Communications 2014
Xiaoxiong Zhong Yang Qin Li Li

Cognitive radio (CR) has emerged as a promising technology to improve spectrum utilization. Capacity analysis is very useful in investigating the ultimate performance limits for wireless networks. Meanwhile, with increasing potential future applications for the CR systems, it is necessary to explore the limitations on their capacity in a dynamic spectrum access environment. However, due to spec...

2012
Michel Tokic Günther Palm

Stochastic neurons are deployed for efficient adaptation of exploration parameters by gradient-following algorithms. The approach is evaluated in model-free temporal-difference learning using discrete actions. The advantage is in particular memory efficiency, because memorizing exploratory data is only required for starting states. Hence, if a learning problem consist of only one starting state...

2015
Carinna Margaret Torgerson Andrei Irimia S.-Y. Matthew Goh John D. Van Horn

Pursuant to its commitment to cultivating a greater understanding of mental illness, the National Institutes of Health (NIH) have created the National Database for Clinical Trials, where data from a wide variety of NIH-funded studies are deposited in the hope that as many qualified researchers as possible can examine these data. As the designated Data Coordinating Center in the Autism Center of...

Journal: :Neuron 2013
John T. Arsenault Koen Nelissen Bechir Jarraya Wim Vanduffel

Stimulus-reward coupling without attention can induce highly specific perceptual learning effects, suggesting that reward triggers selective plasticity within visual cortex. Additionally, dopamine-releasing events-temporally surrounding stimulus-reward associations-selectively enhance memory. These forms of plasticity may be evoked by selective modulation of stimulus representations during dopa...

Journal: :Journal of Artificial Intelligence Research 2022

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As such, these must extensively interact with the environment in order to discover rewards and optimal policies. In most RL applications, however, users have program function and, hence, there is opportunity make visible -- show function's code agent so it can exploit internal structure learn policies a more samp...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید