Repeated Sequential Prisoner's Dilemma: The Stackleberg Variant

被引：0

作者：

Qu, Xing-Long ^{[1
]}

Cao, Zhi-Gang ^{[1
]}

Mu, Yi-Fen ^{[1
]}

Yang, Xiao-Guang ^{[1
]}

机构：

[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China

来源：

ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH | 2015年 / 32卷 / 01期

关键词：

Prisoner's Dilemma; sequential repeated game; folk theorem; Markov decision process; FINITE AUTOMATA; REPEATED GAMES; FOLK THEOREM; EVOLUTION; COOPERATION; EXTORTION; INFORMATION; STRATEGY;

D O I：

10.1142/S0217595915400096

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We study the Stackleberg variant of the repeated Sequential Prisoner's Dilemma (SPD). The game goes in two stages, and the two players, the leader and the follower, are asymmetric in both stages. In the first stage of the game, the leader chooses a strategy (for the repeated SPD of the second stage), which is immediately known to the follower. In the second stage, they play repeated SPD: In each round the follower moves after observing the leader's action. Assuming complete rationality, we find some extraordinary properties of this model. (i) The (subgame perfect) equilibrium payoff profile is unique, which lies on the corner of the region predicted by classical folk theorems: It is best for the leader and at the same time worst for the follower, (ii) the leader has simple optimal strategies that are one-step memory and stationary. These features are in great contrast with classical results, where either uniqueness cannot be guaranteed and equilibrium strategies are often quite complicated, or bounded rationality is required. Although full cooperation, i.e., the outcome is always (cooperate, cooperate), is not attainable in our model, at least a half of the optimal social welfare can be guaranteed. We also do a non-equilibrium analysis which makes the usual equilibrium analysis more convincing.

引用

页数：23

共 50 条

[21] Limit Cycles Sparked by Mutation in the Repeated Prisoner's Dilemma
Toupo, Danielle F. P.
Rand, David G.
Strogatz, Steven H.
INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2014, 24 (12):
[22] Voluntarily separable repeated Prisoner's Dilemma with reference letters
Fujiwara-Greve, Takako
Okuno-Fujiwara, Masahiro
Suzuki, Nobue
GAMES AND ECONOMIC BEHAVIOR, 2012, 74 (02) : 504 - 516
[23] Evolutionary stability in the finitely repeated prisoner's dilemma game
Cressman, R
JOURNAL OF ECONOMIC THEORY, 1996, 68 (01) : 234 - 248
[24] Repeated Prisoner's Dilemma: Stackelberg solution with finite memory
Kleimenov, AF
Semenishchev, AA
CONTROL APPLICATIONS OF OPTIMIZATION 2000, VOLS 1 AND 2, 2000, : 567 - 572
[25] Constructing strategies in the indefinitely repeated prisoner's dilemma game
Romero, Julian
Rosokha, Yaroslav
EUROPEAN ECONOMIC REVIEW, 2018, 104 : 185 - 219
[26] Dynamical Systems Associated with the -Core in the Repeated Prisoner's Dilemma
Plaskacz, Slawomir
Zwierzchowska, Joanna
DYNAMIC GAMES AND APPLICATIONS, 2019, 9 (01) : 217 - 235
[27] Perceptron versus automaton in the finitely repeated prisoner’s dilemma
Sylvain Béal
Theory and Decision, 2010, 69 : 183 - 204
[28] Evolutionarily stable strategy distributions for the repeated prisoner's dilemma
Mowbray, M
JOURNAL OF THEORETICAL BIOLOGY, 1997, 187 (02) : 223 - 229
[29] Motives behind cooperation in finitely repeated prisoner's dilemma
Chakraborty, Anujit
GAMES AND ECONOMIC BEHAVIOR, 2023, 141 : 105 - 132
[30] The Independent Localisations of Interaction and Learning in the Repeated Prisoner's Dilemma
Robert Hoffmann
Theory and Decision, 1999, 47 : 57 - 72

← 1 2 3 4 5 →