Risk-sensitive and minimax control of discrete-time, finite-state Markov decision processes
نویسندگان
چکیده
This paper analyzes a connection between risk-sensitive and minimax criteria for discrete-time, nite-states Markov Decision Processes (MDPs). We synthesize optimal policies with respect to both criteria, both for nite horizon and discounted in nite horizon problem. A generalized decision-making framework is introduced, which includes as special cases a number of approaches that have been considered in the literature. The framework allows for discounted risk-sensitive and minimax formulations leading to stationary optimal policies on the in nite horizon. We illustrate our results with a simple machine replacement problem.
منابع مشابه
Mixed risk-neutral/minimax control of discrete-time, finite-state Markov decision processes
This paper addresses the control design problem for discrete-time, nite-state Markov Decision Processes (MDPs), when both risk-neutral and minimax objectives are of interest. We introduce the mixed risk-neutral/minimax objective, and utilize results from risk-neutral and minimax control to derive an information state process and dynamic programming equations for the value function. We synthesiz...
متن کاملRisk - Sensitive , Minimax , and Mixed Risk - Neutral / Minimax Control of Markov Decision Processes
This paper analyzes a connection between risk-sensitive and minimax criteria for discrete-time, nite-state Markov Decision Processes (MDPs). We synthesize optimal policies with respect to both criteria, both for nite horizon and discounted in nite horizon problems. A generalized decision-making framework is introduced, leading to stationary risk-sensitive and minimax optimal policies on the in ...
متن کاملMixed Risk-Neutral/Minimax Control of Markov Decision Processes
This paper introduces a formulation of the mixed risk-neutral/minimax control problem for Markov Decision Processes (MDPs). Drawing on results from risk-neutral control and minimax control, we derive an information state process and dynamic programming equations for the value function. Furthermore, we develop a methodology to synthesize an optimal control law on the nite horizon, and a near-opt...
متن کاملRisk sensitive control of finite state Markov chains in discrete time, with applications to portfolio management
In this paper we extend standard dynamic programming results for the risk sensitive optimal control of discrete time Markov chains to a new class of models. The state space is only ®nite, but now the assumptions about the Markov transition matrix are much less restrictive. Our results are then applied to the ®nancial problem of managing a portfolio of assets which are a ̈ected by Markovian micro...
متن کاملRisk-Sensitive and Average Optimality in Markov Decision Processes
Abstract. This contribution is devoted to the risk-sensitive optimality criteria in finite state Markov Decision Processes. At first, we rederive necessary and sufficient conditions for average optimality of (classical) risk-neutral unichain models. This approach is then extended to the risk-sensitive case, i.e., when expectation of the stream of one-stage costs (or rewards) generated by a Mark...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Automatica
دوره 35 شماره
صفحات -
تاریخ انتشار 1999