Cooperative behavior of agents that model the other and the self in noisy iterated prisoners' dilemma simulation

被引:0
|
作者
Makino, T [1 ]
Aihara, K [1 ]
机构
[1] Univ Tokyo, Inst Ind Sci, Meguro Ku, Tokyo 1538505, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We developed self-learning for simulation study of mutual understanding between peer agents. We designed them to use various types of co-player models and a reinforcement-learning algorithm to learn to play a noisy iterated Prisoners' Dilemma game so that the pay-off for the agent itself is maximized. We measured the mutual-modeling ability of each type of agent in terms of cooperative behavior when playing with another equivalent agent. We observed that agents with a complex co-player model, which includes a model of the agent itself, showed higher cooperation than agents with a simpler co-player model only. Moreover, in low-noise environments, Level-M agent, which develops equivalent models of the self and the other, showed higher cooperation than other types of agents. These results suggest the importance of "self-observation" in the design of communicative agents.
引用
收藏
页码:52 / 57
页数:6
相关论文
共 46 条