The Stackelberg equilibrium for one-sided zero-sum partially observable stochastic games
نویسندگان
چکیده
Formulating cyber-security problems with attackers and defenders as a partially observable stochastic game has become trend recently. Among them, the one-sided two-player zero-sum (OTZ-POSG) emerged popular model because it allows players to compete for multiple stages based on partial knowledge of system. All existing work OTZ-POSG focused simultaneous move scenario assumed that one player’s actions are private in execution process. However, this assumption may questionable since action be detected by opponent through deploying detection strategies. Hence, paper, we propose turn-based public investigate existence properties Stackelberg equilibrium game. We first prove one-stage case show can converted into linear-fractional programming problem therefore solved linear programming. For stages, main challenge is information leakage issue run-time reveals certain achieve more rewards future. To deal issue, adopt concept ε-Stackelberg achieved finite-horizon OTZ-POSGs. space partition approach solve iteratively value function leader piece-wise follower constant stages. Finally, examples given illustrate functions constant.
منابع مشابه
Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games
We study two player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. The players have partial observability, however, the evader is given an advantage of knowing the current position of the units of the pursuer. We show that (1) value functions of this game depend only on the position of the pursuing units and the belief the pursuer has about the position o...
متن کاملDynamic Programming for Partially Observable Stochastic Games
We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterative elimination of dominated strategies in normal form games. We prove that it iteratively eliminates very weakly dominated strategies without first forming the normal form r...
متن کاملDefinable Zero-Sum Stochastic Games
Definable zero-sum stochastic games involve a finite number of states and action sets, reward and transition functions that are definable in an o-minimal structure. Prominent examples of such games are finite, semi-algebraic or globally subanalytic stochastic games. We prove that the Shapley operator of any definable stochastic game with separable transition and reward functions is definable in...
متن کاملPlanning for Weakly-Coupled Partially Observable Stochastic Games
Partially observable stochastic games (POSGs) provide a powerful framework for modeling multi-agent interactions. While elegant and expressive, the framework has been shown to be computationally intractable [Bernstein et al., 2002]. An exact dynamic programming algorithm for POSGs has been developed recently, but due to high computational demands, it has only been demonstrated to work on extrem...
متن کاملDynamic Programming Approximations for Partially Observable Stochastic Games
Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes with a price, namely a high computational cost. Solving POSGs optimally quickly becomes intractable after a few decision cycles. Our main contribution is to provide bounded approximation techniques, which enable us to sca...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Automatica
سال: 2022
ISSN: ['1873-2836', '0005-1098']
DOI: https://doi.org/10.1016/j.automatica.2022.110231