Solving zero-sum one-sided partially observable stochastic games

نویسندگان

چکیده

Many security and other real-world situations are dynamic in nature can be modelled as strictly competitive (or zero-sum) games. In these domains, agents perform actions to affect the environment receive observations -- possibly imperfect about situation effects of opponent's actions. Moreover, there is no limitation on total number an agent that is, fixed horizon. These settings partially observable stochastic games (POSGs). However, solving general POSGs computationally intractable, so we focus a broad subclass called one-sided POSGs. games, only one has information while their opponent full knowledge current situation. We provide picture for POSGs: (1) give theoretical analysis value functions, (2) show variant value-iteration algorithm converges this setting, (3) adapt heuristic search POSGs, (4) describe how use approximate functions derive strategies game, (5) demonstrate our solve non-trivial sizes analyze scalability three different domains: pursuit-evasion, patrolling,

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Programming for One-sided Partially Observable Pursuit-evasion Games

We study two player pursuit-evasion games with concurrent moves, infinite horizon, and discounted rewards. The players have partial observability, however, the evader is given an advantage of knowing the current position of the units of the pursuer. We show that (1) value functions of this game depend only on the position of the pursuing units and the belief the pursuer has about the position o...

متن کامل

Definable Zero-Sum Stochastic Games

Definable zero-sum stochastic games involve a finite number of states and action sets, reward and transition functions that are definable in an o-minimal structure. Prominent examples of such games are finite, semi-algebraic or globally subanalytic stochastic games. We prove that the Shapley operator of any definable stochastic game with separable transition and reward functions is definable in...

متن کامل

Dynamic Programming for Partially Observable Stochastic Games

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable Markov decision processes (POMDPs) and iterative elimination of dominated strategies in normal form games. We prove that it iteratively eliminates very weakly dominated strategies without first forming the normal form r...

متن کامل

Planning for Weakly-Coupled Partially Observable Stochastic Games

Partially observable stochastic games (POSGs) provide a powerful framework for modeling multi-agent interactions. While elegant and expressive, the framework has been shown to be computationally intractable [Bernstein et al., 2002]. An exact dynamic programming algorithm for POSGs has been developed recently, but due to high computational demands, it has only been demonstrated to work on extrem...

متن کامل

Dynamic Programming Approximations for Partially Observable Stochastic Games

Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes with a price, namely a high computational cost. Solving POSGs optimally quickly becomes intractable after a few decision cycles. Our main contribution is to provide bounded approximation techniques, which enable us to sca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Artificial Intelligence

سال: 2023

ISSN: ['2633-1403']

DOI: https://doi.org/10.1016/j.artint.2022.103838