نتایج جستجو برای: reward
تعداد نتایج: 29303 فیلتر نتایج به سال:
Zhang and Zafar proposed a video compression scheme based on the wavelet representation and multiresolution motion compensation (MRMC). In this letter, an additional masking module will be created to further enhance its efficiency. Specifically, between the modules of wavelet decomposition and MRMC, the masking module will be inserted which will construct binary images based on the difference o...
Reinforcement learning agents interacting with a complex environment like the real world are unlikely to behave optimally all the time. If such an agent is operating in real-time under human supervision, now and then it may be necessary for a human operator to press the big red button to prevent the agent from continuing a harmful sequence of actions—harmful either for the agent or for the envi...
Associative learning studies have shown that the anticipation of reward and punishment shapes the representation of sensory stimuli, which is further modulated by dopamine. Less is known about whether and how reward delivery activates sensory cortices and the role of dopamine at that time point of learning. We used an appetitive instrumental learning task in which participants had to learn that...
BACKGROUND The value of a predicted reward can be estimated based on the conjunction of both the intrinsic reward value and the length of time to obtain it. The question we addressed is how the two aspects, reward size and proximity to reward, influence the responses of neurons in rostral anterior cingulate cortex (rACC), a brain region thought to play an important role in reward processing. ...
Recent findings have demonstrated that reward feedback alone can drive motor learning. However, it is not yet clear whether reward feedback alone can lead to learning when a perturbation is introduced abruptly, or how a reward gradient can modulate learning. In this study, we provide reward feedback that decays continuously with increasing error. We asked whether it is possible to learn an abru...
Reward sensitivity or the tendency to engage in motivated approach behavior in the presence of rewarding stimuli, may be a contributing factor to vulnerability to disinhibitory behaviors. While evidence exists for a reward sensitivity-related increased response in reward brain areas (i.e., nucleus accumbens or midbrain) during the processing of reward cues, it is unknown how this trait modulate...
Simulating mood within a decision making process has been shown to allow cooperation to occur within the Prisoner’s Dilemma. In this paper we propose how to integrate a mood model into the classical reinforcement learning algorithm Sarsa, and show how this addition can allow self-interested agents to be successful within a multi agent environment. The human-inspired moody agent will learn to co...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید