From Markov Chains to Stochastic Games
نویسنده
چکیده
Markov chains1 and Markov decision processes (MDPs) are special cases of stochastic games. Markov chains describe the dynamics of the states of a stochastic game where each player has a single action in each state. Similarly, the dynamics of the states of a stochastic game form a Markov chain whenever the players’ strategies are stationary. Markov decision processes are stochastic games with a single player. In addition, the decision problem faced by a player in a stochastic game when all other players choose a fixed profile of stationary strategies is equivalent to an MDP. The present chapter states classical results on Markov chains and Markov decision processes. The proofs use methods that introduce the reader to proofs of more general analog results on stochastic games.
منابع مشابه
Stochastic Dynamic Programming with Markov Chains for Optimal Sustainable Control of the Forest Sector with Continuous Cover Forestry
We present a stochastic dynamic programming approach with Markov chains for optimal control of the forest sector. The forest is managed via continuous cover forestry and the complete system is sustainable. Forest industry production, logistic solutions and harvest levels are optimized based on the sequentially revealed states of the markets. Adaptive full system optimization is necessary for co...
متن کاملPerturbations of Markov Chains with Applications to Stochastic Games
In this lecture we will review several topics that are extensively used in the study of n-player stochastic games. These tools were used in the proof of several results on non zero-sum stochastic games. Most of the results that are presented here appeared in Vieille (1997a,b), and some appeared in Solan (1998, 1999). The first main issue is Markov chains where the transition rule is a Puiseux p...
متن کاملStochastic bounds for a single server queue with general retrial times
We propose to use a mathematical method based on stochastic comparisons of Markov chains in order to derive performance indice bounds. The main goal of this paper is to investigate various monotonicity properties of a single server retrial queue with first-come-first-served (FCFS) orbit and general retrial times using the stochastic ordering techniques.
متن کاملObligation Blackwell Games and p-Automata
We recently introduced p-automata, automata that read discrete-time Markov chains and showed they provide an automata-theoretic framework for reasoning about pCTL model checking and abstraction of discrete time Markov chains. We used turn-based stochastic parity games to define acceptance of Markov chains by a special subclass of p-automata. Definition of acceptance required a reduction to a se...
متن کاملNonzero - Sum Stochastic Games
This paper extends the basic work that has been done on tero-sum stochastic games to those that are nonzerosum. Appropriately defined equilibrium points are shown to exist for both the case where the players seek to maximize the total value of their discounted period rewards and the case where they wish to maximize their average reward per period. For the latter case, conditions required on the...
متن کامل