A simple and numerically stable primal-dual algorithm for computing Nash-equilibria in sequential games with incomplete information
نویسنده
چکیده
We present a simple primal-dual algorithm for computing approximate Nash equilibria in two-person zero-sum sequential games with incomplete information and perfect recall (like Texas Hold’em poker). Our algorithm only performs basic iterations (i.e matvec multiplications, clipping, etc., and no calls to external first-order oracles, no matrix inversions, etc.) and is applicable to a broad class of two-person zero-sum games including simultaneous games and sequential games with incomplete information and perfect recall. The applicability to the latter kind of games is thanks to the sequence-form representation [10] which allows us to encode any such game as a matrix game with convex polytopial profiles. We prove that the number of iterations needed to produce a Nash equilibrium with a given precision is inversely proportional to the precision. We also present experimental results on simulated and real games (Kuhn poker). keywords: algorithmic-game theory; sequential game; incomplete information; perfect recall; approximate Nash equilibrium; primal-dual algorithm; convex-optimization
منابع مشابه
Computing Approximate Equilibria in Graphical Games on Arbitrary Graphs
We present PureProp: a new constraint satisfaction algorithm for computing pure-strategy approximate Nash equilibria in complete information games. While this seems quite limited in applicability, we show how PureProp unifies existing algorithms for 1) solving a class of complete information graphical games with arbitrary graph structure for approximate Nash equilibria (Kearns et al., 2001; Ort...
متن کاملRepeated games with incomplete information and discounting
We analyze discounted repeated games with incomplete information, such that the players’ payoffs depend only on their own type (known-own payoff case). We describe an algorithm for finding all equilibrium payoffs in games for which there exists an open set of belief-free equilibria of Hörner and Lovo (2009). This includes generic games with one-sided incomplete information and a large and impor...
متن کاملProjected-gradient algorithms for generalized equilibrium seeking in Aggregative Games are preconditioned Forward-Backward methods
We show that projected-gradient methods for the distributed computation of generalized Nash equilibria in aggregative games are preconditioned forward-backward splitting methods applied to the KKT operator of the game. Specifically, we adopt the preconditioned forward-backward design, recently conceived by Yi and Pavel in the manuscript “A distributed primal-dual algorithm for computation of ge...
متن کاملComputing pure Bayesian-Nash equilibria in games with finite actions and continuous types
We extend the well-known fictitious play (FP) algorithm to compute pure-strategy Bayesian-Nash equilibria in private-value games of incomplete information with finite actions and con-tinuous types (G-FACTs). We prove that, if the frequency distribution of actions (fictitiousplay beliefs) converges, then there exists a pure-strategy equilibrium strategy that is con-sistent with i...
متن کاملDiscounted Repeated Games with General Incomplete Information
We analyze discounted repeated games with incomplete information, and such that the payoffs of the players depend only on their own type (known-own payoff case). We describe an algorithm to find all equilibrium payoffs in games for which there exists an open set of belief-free equilibria of Horner and Lovo (2009). This includes generic games with one-sided incomplete information and a large and...
متن کامل