A new consequence of Simpson’s paradox: Stable co-operation in one-shot Prisoner’s Dilemma from populations of individualistic learning agents

نویسندگان

Nick Chater

Ivo Vlaev

Maurice Grinberg

چکیده

Normative theories of individual choice in economics typically assume that interacting agents should each act individualistically: i.e., they should maximize their own utility function. Specifically, game theory proposes that interaction should be governed by Nash equilibria. Computationally limited agents (whether artificial, animal or human) may not, however, have the capacity to carry out the sophisticated reasoning to converge directly on Nash equilibria. Nonetheless it is often assumed that Nash equilibria will be obtained, in any case, if agents embody simple learning algorithms like reinforcement learning. If so, then learners should converge on Nash equilibria, after sufficient practice in playing a game---and hence, for example, individualistic agents should end up playing D (defect) in one-shot Prisoners’ Dilemmas (PD). In an experiment and in a multi-agent simulation, we show, however, that this is not always the case---under certain circumstances, reinforcement learners can converge on co-operative behaviour in PD. That is, even though each agent would receive higher pay-off from switching to D, agents obtain more reinforcement, on average, from playing C, and hence C is more strongly reinforced. This effect arises from a well-known statistical paradox, Simpson’s paradox. We speculate that this effect may be relevant to some aspects of real-world human co-operative behaviour.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simpson’s Paradox Can Emerge from the N-Player Prisoner’s Dilemma: Implications for the Evolution of Altruistic Behavior

Simulations of the n-player Prisoner’s Dilemma in multiple populations reveal that Simpson’s paradox can emerge in such game-theoretic situations. The relative proportion of cooperators can decrease in each separate sub-population, while the proportion of cooperators in the total population can nonetheless increase, at least transiently. Factors that determine the longevity of this effect are u...

متن کامل

Group Biases in the Prisoner's Dilemma Game: Looking for Simpson's Paradox Effects on Cooperation

The role of Simpson’s paradox effects on cooperation in one-shot intraand inter-group Prisoner’s Dilemma games is explored. Three experimental conditions are considered in a between-subject design – a group that plays games only within the group (intra-group condition), a group that plays games only with members of the other group (intergroup condition), and a group which plays a combination of...

متن کامل

N-Player Prisoner’s Dilemma in Multiple Groups: A Model of Multilevel Selection

Simulations of the n-player Prisoner’s Dilemma (PD) in populations consisting of multiple groups reveal that Simpson’s paradox (1951) can emerge in such gametheoretic situations. In Simpson’s paradox, as manifest here, the relative proportion of cooperators can decrease in each separate group, while the proportion of cooperators in the total population can nonetheless increase, at least transie...

متن کامل

A non-cooperative Pareto-efficient solution to a single-shot Prisoner's Dilemma

The Prisoner’s Dilemma is a simple model that captures the essential contradiction between individual rationality and global rationality. Although the one-shot Prisoner’s Dilemma is usually viewed simple, in this paper we will categorize it into five different types. For the type-4 Prisoner’s Dilemma game, we will propose a selfenforcing algorithmic model to help non-cooperative agents obtain P...

متن کامل

Modeling Cooperation between Nodes in Wireless Networks by APD Game

Cooperation is the foundation of many protocols in wireless networks. Without cooperation, the performance of a network significantly decreases. Hence, all nodes in traditional networks are required to cooperate with each other. In this paper, instead of traditional networks, a network of rational and autonomous nodes is considered, which means that each node itself can decide whe...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

A new consequence of Simpson’s paradox: Stable co-operation in one-shot Prisoner’s Dilemma from populations of individualistic learning agents

نویسندگان

چکیده

منابع مشابه

Simpson’s Paradox Can Emerge from the N-Player Prisoner’s Dilemma: Implications for the Evolution of Altruistic Behavior

Group Biases in the Prisoner's Dilemma Game: Looking for Simpson's Paradox Effects on Cooperation

N-Player Prisoner’s Dilemma in Multiple Groups: A Model of Multilevel Selection

A non-cooperative Pareto-efficient solution to a single-shot Prisoner's Dilemma

Modeling Cooperation between Nodes in Wireless Networks by APD Game

عنوان ژورنال:

اشتراک گذاری