Zero - Sum Games with Vector - Valued Payoffs
نویسنده
چکیده
In this lecture we formulate and prove the celebrated approachability theorem of Blackwell, which extends von Neumann's minimax theorem to zero-sum games with vector-valued payoffs [1]. (The proof here is based on the presentation in [2]; a similar presentation was given by Foster and Vohra [3].) This theorem is powerful in its own right, but also has significant implications for regret minimization; as we will see in the next lecture, the algorithmic insight behind Blackwell's theorem can be used to easily develop both external and internal regret minimizing algorithms. We first define two-player zero-sum games with vector-valued payoffs. Each player i has an action space A i (assumed to be finite). In a vector-valued game, the payoff to player 1 when the action pair (a 1 , a 2) is played is Π(a 1 , a 2) ∈ R K , for some finite K; that is, the payoff to player 1 is a vector. Similarly, the payoff to player 2 is −Π(a 1 , a 2). We use similar notation as earlier lectures: i.e., we let Π(s 1 , s 2) denote the expected payoff to player 1 when each player i uses mixed action s i ∈ ∆(A i). We will typically view s i as a vector in R A i , with s i (a i) equal to the probability that player i places on a i. The game is played repeatedly by the players. We use s t i to denote the mixed action chosen by player i at time t, and we let a t i denote the actual action played by player i at time t. We let h T = (a 0 ,. .. , a T −1) denote the history of the actual play up to time T. We assume that the payoffs all lie in the unit ball (with respect to the standard Euclidean norm): Π(a 1 , a 2) ≤ 1 for all a 1 , a 2. Since action spaces are finite, this just amounts to a rescaling of payoffs for analytical simplicity. 2 Approachability We first develop approachability in the scalar payoff setting. We then generalize to halfspaces in the vector-valued payoff setting, and finally state Blackwell's theorem for approachability of general convex sets. We first develop the notion of approachability in the one-dimensional (i.e., scalar payoff) setting, where K = 1. In this case players 1 …
منابع مشابه
A TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS
In this paper, we deal with games with fuzzy payoffs. We proved that players who are playing a zero-sum game with fuzzy payoffs against Nature are able to increase their joint payoff, and hence their individual payoffs by cooperating. It is shown that, a cooperative game with the fuzzy characteristic function can be constructed via the optimal game values of the zero-sum games with fuzzy payoff...
متن کاملAn interval-valued programming approach to matrix games with payoffs of triangular intuitionistic fuzzy numbers
The purpose of this paper is to develop a methodology for solving a new type of matrix games in which payoffs are expressed with triangular intuitionistic fuzzy numbers (TIFNs). In this methodology, the concept of solutions for matrix games with payoffs of TIFNs is introduced. A pair of auxiliary intuitionistic fuzzy programming models for players are established to determine optimal strategies...
متن کاملInterval Fuzzy Linear Programming Models to Solve Interval-valued Fuzzy Zero-Sum Games
In Game theory, there are situations in which it is very difficult to characterize the private information of each player. In this case, the payoffs can be given by approximate values, represented by fuzzy numbers. Whenever there is uncertainty in the modeling of those fuzzy numbers, interval fuzzy numbers may be used. This paper introduces two approaches for the solution of interval-valued fuz...
متن کاملAn interval matrix game and its extensions to fuzzy and stochastic games
In this paper, we consider an interval matrix game with interval valued payoffs, which is the generation of the traditional matrix game. The “saddle-points”of this interval matrix game are defined and characterized as equilibrium points of corresponding non-zero sum parametric games. Numerical examples are given to illustrate our idea. These results are extended to the fuzzy matrix games. Also,...
متن کاملRepeated Games with Incomplete Information on One Side: The Case of Different Discount Factors
Two players engage in a repeated game with incomplete information on one side, where the underlying stage-games are zero-sum. In the case where players evaluate their stage-payoffs by using different discount factors, the payoffs of the infinitely repeated game are typically non zero-sum. However, if players grow infinitely patient, then the equilibrium payoffs will sometimes approach the zero-...
متن کامل