Repeated Sequential Prisoner's Dilemma: The Stackleberg Variant

被引:0
|
作者
Qu, Xing-Long [1 ]
Cao, Zhi-Gang [1 ]
Mu, Yi-Fen [1 ]
Yang, Xiao-Guang [1 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
关键词
Prisoner's Dilemma; sequential repeated game; folk theorem; Markov decision process; FINITE AUTOMATA; REPEATED GAMES; FOLK THEOREM; EVOLUTION; COOPERATION; EXTORTION; INFORMATION; STRATEGY;
D O I
10.1142/S0217595915400096
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We study the Stackleberg variant of the repeated Sequential Prisoner's Dilemma (SPD). The game goes in two stages, and the two players, the leader and the follower, are asymmetric in both stages. In the first stage of the game, the leader chooses a strategy (for the repeated SPD of the second stage), which is immediately known to the follower. In the second stage, they play repeated SPD: In each round the follower moves after observing the leader's action. Assuming complete rationality, we find some extraordinary properties of this model. (i) The (subgame perfect) equilibrium payoff profile is unique, which lies on the corner of the region predicted by classical folk theorems: It is best for the leader and at the same time worst for the follower, (ii) the leader has simple optimal strategies that are one-step memory and stationary. These features are in great contrast with classical results, where either uniqueness cannot be guaranteed and equilibrium strategies are often quite complicated, or bounded rationality is required. Although full cooperation, i.e., the outcome is always (cooperate, cooperate), is not attainable in our model, at least a half of the optimal social welfare can be guaranteed. We also do a non-equilibrium analysis which makes the usual equilibrium analysis more convincing.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Repeated Sequential Prisoner's Dilemma
    Qu Xinglong
    Cao Zhigang
    Mu Ylfen
    Yang Xiaoguang
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 8301 - 8304
  • [2] Observable instability for the repeated prisoner's dilemma
    Mowbray, M
    [J]. APPROXIMATION, OPTIMIZATION AND MATHEMATICAL ECONOMICS, 2001, : 223 - 234
  • [3] Voluntarily Separable Repeated Prisoner's Dilemma
    Fujiwara-Greve, Takako
    Okuno-Fujiwara, Masahiro
    [J]. REVIEW OF ECONOMIC STUDIES, 2009, 76 (03): : 993 - 1021
  • [4] The coevolution of automata in the repeated prisoner's dilemma
    Miller, JH
    [J]. JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 1996, 29 (01) : 87 - 112
  • [5] Optimal partnership in a repeated prisoner's dilemma
    Möller, M
    [J]. ECONOMICS LETTERS, 2005, 88 (01) : 13 - 19
  • [6] COOPERATION IN THE FINITELY REPEATED PRISONER'S DILEMMA
    Embrey, Matthew
    Frechette, Guillaume R.
    Yuksel, Sevgi
    [J]. QUARTERLY JOURNAL OF ECONOMICS, 2018, 133 (01): : 509 - 551
  • [7] The sequential prisoner's dilemma: Evidence on reciprocation
    Clark, K
    Sefton, M
    [J]. ECONOMIC JOURNAL, 2001, 111 (468): : 51 - 68
  • [8] Probability of reciprocation in repeated prisoner's dilemma games
    Baker, F
    Rachlin, H
    [J]. JOURNAL OF BEHAVIORAL DECISION MAKING, 2001, 14 (01) : 51 - 67
  • [9] Private Monitoring and Communication in the Repeated Prisoner's Dilemma
    Awaya, Yu
    [J]. GAMES, 2021, 12 (04):
  • [10] Teaching the repeated prisoner's dilemma with a computerized tournament
    Lange, Carsten
    Baylor, Amy L.
    [J]. JOURNAL OF ECONOMIC EDUCATION, 2007, 38 (04): : 407 - 418