asymptotic behavior of multivariate reward processes with nonlinear reward functions

نویسندگان

k. khorshidian

a. r. soltani

چکیده

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asymptotic Behavior of Multivariate Reward Processes with Nonlinear Reward Functions

متن کامل

COVARIANCE MATRIX OF MULTIVARIATE REWARD PROCESSES WITH NONLINEAR REWARD FUNCTIONS

Multivariate reward processes with reward functions of constant rates, defined on a semi-Markov process, first were studied by Masuda and Sumita, 1991. Reward processes with nonlinear reward functions were introduced in Soltani, 1996. In this work we study a multivariate process , , where are reward processes with nonlinear reward functions respectively. The Laplace transform of the covar...

متن کامل

On Markov Decision Processes with Pseudo-Boolean Reward Functions

متن کامل

Asymptotics for renewal-reward processes with retrospective reward structure

Let {(Xi; Yi): i= : : : ;−1; 0; 1; : : :} be a doubly in nite renewal-reward process, where {Xi: i= : : :− 1; 0; 1; : : :} is an i.i.d. sequence of renewal cycle lengths and Yi= g(Xi−q; Xi−q+1; : : : ; Xi) is the lump reward earned at the end of the ith renewal cycle, with some function g :R q+1 → R . Starting with the rst renewal cycle (of duration X1) at the time origin, let C(t) denote the e...

متن کامل

Markov Decision Processes with Arbitrary Reward Processes

We consider a learning problem where the decision maker interacts with a standard Markov decision process, with the exception that the reward functions vary arbitrarily over time. We show that, against every possible realization of the reward process, the agent can perform as well—in hindsight—as every stationary policy. This generalizes the classical no-regret result for repeated games. Specif...

متن کامل

Sparse Reward Processes

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is a relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Consequently, the agent is intrinsically motivated to explore its environment beyond the degree necessary to solve the current task it has at han...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

عنوان ژورنال:

bulletin of the iranian mathematical society

ناشر: iranian mathematical society (ims)

ISSN 1017-060X

دوره 28

شماره No. 2 2011

کلمات کلیدی

semi markov processes reward processes laplace transform

میزبانی شده توسط پلتفرم ابری doprax.com