reward processes

نتایج جستجو برای: reward processes

تعداد نتایج: 554393 فیلتر نتایج به سال:

asymptotic behavior of multivariate reward processes with nonlinear reward functions

Journal: :bulletin of the iranian mathematical society 2011

k. khorshidian a. r. soltani

متن کامل

Asymptotic Behavior of Multivariate Reward Processes with Nonlinear Reward Functions

Journal: Bulletin of the Iranian Mathematical Society 2011

A. R. Soltani K. Khorshidian

متن کامل

COVARIANCE MATRIX OF MULTIVARIATE REWARD PROCESSES WITH NONLINEAR REWARD FUNCTIONS

Journal: Journal of Sciences Islamic Republic of Iran 2003

Multivariate reward processes with reward functions of constant rates, defined on a semi-Markov process, first were studied by Masuda and Sumita, 1991. Reward processes with nonlinear reward functions were introduced in Soltani, 1996. In this work we study a multivariate process , , where are reward processes with nonlinear reward functions respectively. The Laplace transform of the covar...

متن کامل

Neurotensin in reward processes

Journal: :Neuropharmacology 2020

متن کامل

Markov Decision Processes: Discounted Expected Reward or Average Expected Reward?

Journal: :Journal of Mathematical Analysis and Applications 1993

متن کامل

Sparse Reward Processes

Journal: :CoRR 2012

Christos Dimitrakakis

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is a relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Consequently, the agent is intrinsically motivated to explore its environment beyond the degree necessary to solve the current task it has at han...

متن کامل

Markov Decision Processes with Arbitrary Reward Processes

Journal: :Math. Oper. Res. 2008

Jia Yuan Yu Shie Mannor Nahum Shimkin

We consider a learning problem where the decision maker interacts with a standard Markov decision process, with the exception that the reward functions vary arbitrarily over time. We show that, against every possible realization of the reward process, the agent can perform as well—in hindsight—as every stationary policy. This generalizes the classical no-regret result for repeated games. Specif...

متن کامل

Real-reward testing for probabilistic processes

Journal: :Theoretical Computer Science 2014

متن کامل

Robust Average-Reward Markov Decision Processes

Journal: :Proceedings of the ... AAAI Conference on Artificial Intelligence 2023

In robust Markov decision processes (MDPs), the uncertainty in transition kernel is addressed by finding a policy that optimizes worst-case performance over an set of MDPs. While much literature has focused on discounted MDPs, average-reward MDPs remain largely unexplored. this paper, we focus where goal to find average reward set. We first take approach approximates using prove value function ...

متن کامل

Timing in reward and decision processes

Journal: :Philosophical Transactions of the Royal Society B: Biological Sciences 2014

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید