reward

نتایج جستجو برای: reward

تعداد نتایج: 29303 فیلتر نتایج به سال:

Double Q($\sigma$) and Q($\sigma, \lambda$): Unifying Reinforcement Learning Control Algorithms

2017

Markus Dumke

Temporal-difference (TD) learning is an important field in reinforcement learning. Sarsa and Q-Learning are among the most used TD algorithms. The Q(σ) algorithm (Sutton and Barto (2017)) unifies both. This paper extends the Q(σ) algorithm to an online multi-step algorithm Q(σ, λ) using eligibility traces and introduces Double Q(σ) as the extension of Q(σ) to double learning. Experiments sugges...

متن کامل

Alcohol demand, delayed reward discounting, and craving in relation to drinking and alcohol use disorders.

Journal: :Journal of abnormal psychology 2010

James MacKillop Robert Miranda Peter M Monti Lara A Ray James G Murphy Damaris J Rohsenow John E McGeary Robert M Swift Jennifer W Tidey Chad J Gwaltney

A behavioral economic approach to alcohol use disorders (AUDs) emphasizes both individual and environmental determinants of alcohol use. The current study examined individual differences in alcohol demand (i.e., motivation for alcohol under escalating conditions of price) and delayed reward discounting (i.e., preference for immediate small rewards compared to delayed larger rewards) in 61 heavy...

متن کامل

Improvement in Game Agent Control Using State-Action Value Scaling

2008

Leo Galway Darryl Charles Michaela M. Black

The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned information over a series of consecutive games. Using a variation of the classic arcade game Pac-Man, the Sarsa algorithm has been utilised for the control of the Pac-Man game agent. The results indicate the use of stateaction...

متن کامل

Maximum relevancy maximum complementary feature selection for multi-sensor activity recognition

Journal: :Expert Syst. Appl. 2015

Saisakul Chernbumroong Shuang Cang Hongnian Yu

In the multi-sensor activity recognition domain, the input space is often large and contains irrelevant and overlapped features. It is important to perform feature selection in order to select the smallest number of features which can describe the outputs. This paper proposes a new feature selection algorithms using the maximal relevance and maximal complementary criteria (MRMC) based on neural...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید

Double Q($\sigma$) and Q($\sigma, \lambda$): Unifying Reinforcement Learning Control Algorithms

Alcohol demand, delayed reward discounting, and craving in relation to drinking and alcohol use disorders.

Improvement in Game Agent Control Using State-Action Value Scaling

Maximum relevancy maximum complementary feature selection for multi-sensor activity recognition

The relationships among aberrant salience, reward motivation, and reward sensitivity

The Relationship Between Impulsivity for Reward and Learning From Reward

Effort-reward imbalance in academic employees: Examining different reward systems.

A model of food reward learning with dynamic reward exposure

Pavlovian reward learning elicits attentional capture by reward-associated stimuli

Reward for Excellence