Markov Games with Frequent Actions and Incomplete Information - The Limit Case
نویسندگان
چکیده
We study a two-player, zero-sum, stochastic game with incomplete information on one side in which the players are allowed to play more and more frequently. The informed player observes the realization of a Markov chain on which the payoffs depend, while the non-informed player only observes his opponent’s actions. We show the existence of a limit value as the time span between two consecutive stages vanishes; this value is characterized through an auxiliary optimization problem and as the solution of an Hamilton-Jacobi equation. Key-words: Markov games, incomplete information, zero-sum games, Hamilton-Jacobi equations, repeated games. A.M.S. classification : 91A05, 91A15, 60J10
منابع مشابه
Markov games with frequent actions and incomplete information
We study a two-player, zero-sum, stochastic game with incomplete information on one side in which the players are allowed to play more and more frequently. The informed player observes the realization of a Markov chain on which the payoffs depend, while the non-informed player only observes his opponent’s actions. We show the existence of a limit value as the time span between two consecutive s...
متن کاملUtilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs
Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...
متن کاملExistence of optimal strategies in Markov games with incomplete information
The existence of a value and optimal strategies is proved for the class of twoperson repeated games where the state follows a Markov chain independently of players’ actions and at the beginning of each stage only player one is informed about the state. The results apply to the case of standard signaling where players’ stage actions are observable, as well as to the model with general signals pr...
متن کاملThe Value of Markov Chain Games with Incomplete Information on Both Sides
We consider zero-sum repeated games with incomplete information on both sides, where the states privately observed by each player follow independent Markov chains. It generalizes the model, introduced by Aumann and Maschler in the sixties and solved by Mertens and Zamir in the seventies, where the private states of the players were fixed. It also includes the model introduced in Renault [20], o...
متن کاملIdentification and Estimation of Incomplete Information Games with Multiple Equilibria∗
The presence of multiple equilibria in games is a big challenge for identification and estimation. Without information of the equilibrium selection, it is impossible to perform counterfactual analysis. Allowing for possibly multiple equilibria, this paper provides nonparametric identification of finite games with incomplete information. Upon observing players’ actions from cross-sectional games...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Math. Oper. Res.
دوره 41 شماره
صفحات -
تاریخ انتشار 2016