Risk-sensitive and minimax control of discrete-time, finite-state Markov decision processes

نویسندگان

Stefano P. Coraluppi

Steven I. Marcus

چکیده

This paper analyzes a connection between risk-sensitive and minimax criteria for discrete-time, nite-states Markov Decision Processes (MDPs). We synthesize optimal policies with respect to both criteria, both for nite horizon and discounted in nite horizon problem. A generalized decision-making framework is introduced, which includes as special cases a number of approaches that have been considered in the literature. The framework allows for discounted risk-sensitive and minimax formulations leading to stationary optimal policies on the in nite horizon. We illustrate our results with a simple machine replacement problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mixed risk-neutral/minimax control of discrete-time, finite-state Markov decision processes

This paper addresses the control design problem for discrete-time, nite-state Markov Decision Processes (MDPs), when both risk-neutral and minimax objectives are of interest. We introduce the mixed risk-neutral/minimax objective, and utilize results from risk-neutral and minimax control to derive an information state process and dynamic programming equations for the value function. We synthesiz...

متن کامل

Risk - Sensitive , Minimax , and Mixed Risk - Neutral / Minimax Control of Markov Decision Processes

This paper analyzes a connection between risk-sensitive and minimax criteria for discrete-time, nite-state Markov Decision Processes (MDPs). We synthesize optimal policies with respect to both criteria, both for nite horizon and discounted in nite horizon problems. A generalized decision-making framework is introduced, leading to stationary risk-sensitive and minimax optimal policies on the in ...

متن کامل

Mixed Risk-Neutral/Minimax Control of Markov Decision Processes

This paper introduces a formulation of the mixed risk-neutral/minimax control problem for Markov Decision Processes (MDPs). Drawing on results from risk-neutral control and minimax control, we derive an information state process and dynamic programming equations for the value function. Furthermore, we develop a methodology to synthesize an optimal control law on the nite horizon, and a near-opt...

متن کامل

Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management

In this paper we extend standard dynamic programming results for the risk sensitive optimal control of discrete time Markov chains to a new class of models. The state space is only ®nite, but now the assumptions about the Markov transition matrix are much less restrictive. Our results are then applied to the ®nancial problem of managing a portfolio of assets which are a ̈ected by Markovian micro...

متن کامل

Risk-Sensitive and Average Optimality in Markov Decision Processes

Abstract. This contribution is devoted to the risk-sensitive optimality criteria in finite state Markov Decision Processes. At first, we rederive necessary and sufficient conditions for average optimality of (classical) risk-neutral unichain models. This approach is then extended to the risk-sensitive case, i.e., when expectation of the stream of one-stage costs (or rewards) generated by a Mark...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Automatica

دوره 35 شماره

صفحات -

تاریخ انتشار 1999

Risk-sensitive and minimax control of discrete-time, finite-state Markov decision processes

نویسندگان

چکیده

منابع مشابه

Mixed risk-neutral/minimax control of discrete-time, finite-state Markov decision processes

Risk - Sensitive , Minimax , and Mixed Risk - Neutral / Minimax Control of Markov Decision Processes

Mixed Risk-Neutral/Minimax Control of Markov Decision Processes

Risk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management

Risk-Sensitive and Average Optimality in Markov Decision Processes

عنوان ژورنال:

اشتراک گذاری