Emergent Communication: Generalization and Overfitting in Lewis Games

被引:0
|
作者
Rita, Mathieu [1 ]
Tallec, Corentin [2 ]
Michel, Paul [2 ]
Grill, Jean-Bastien [2 ]
Pietquin, Olivier [3 ]
Dupoux, Emmanuel [4 ,5 ]
Strub, Florian [2 ]
机构
[1] INRIA, Paris, France
[2] DeepMind, London, England
[3] Google Res, Brain Team, Mountain View, CA USA
[4] INRIA, CNRS, EHESS, ENS PSL, Paris, France
[5] Meta AI Res, New York, NY USA
基金
欧洲研究理事会;
关键词
LANGUAGE EVOLUTION; COMPRESSION; DYNAMICS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lewis signaling games are a class of simple communication games for simulating the emergence of language. In these games, two agents must agree on a communication protocol in order to solve a cooperative task. Previous work has shown that agents trained to play this game with reinforcement learning tend to develop languages that display undesirable properties from a linguistic point of view (lack of generalization, lack of compositionality, etc). In this paper, we aim to provide better understanding of this phenomenon by analytically studying the learning problem in Lewis games. As a core contribution, we demonstrate that the standard objective in Lewis games can be decomposed in two components: a co-adaptation loss and an information loss. This decomposition enables us to surface two potential sources of overfitting, which we show may undermine the emergence of a structured communication protocol. In particular, when we control for overfitting on the co-adaptation loss, we recover desired properties in the emergent languages: they are more compositional and generalize better.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A GENERALIZATION OF THE BACKWARD INDUCTION METHOD IN ALIGNED GAMES
    Chakrabarti, Subir
    Topolyan, Iryna
    JOURNAL OF DYNAMICS AND GAMES, 2024,
  • [42] MULTISTAGE GAMES WITH COMMUNICATION
    MYERSON, RB
    ECONOMETRICA, 1986, 54 (02) : 323 - 358
  • [43] A generalization of the CIS value for cooperative cost games
    Hou, Dongshuang
    Han, Weibin
    Xu, Genjiu
    Feng, Yifan
    4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2024, 22 (01): : 17 - 30
  • [44] Convex linear generalization of random coalition games
    Zhang, SK
    Zhang, YD
    Yan, HZ
    Wang, QY
    CHINESE SCIENCE BULLETIN, 1998, 43 (09): : 713 - 716
  • [45] Communication in ultimatum games
    Rankin, FW
    ECONOMICS LETTERS, 2003, 81 (02) : 267 - 271
  • [46] Convex linear generalization of random coalition games
    ZHANG Shengkai+1
    2. North-East University
    ChineseScienceBulletin, 1998, (09) : 713 - 716
  • [47] Generalization of conception for saddle point in antagonistic games
    Smol'yakov, ER
    DOKLADY AKADEMII NAUK, 1999, 366 (01) : 21 - 23
  • [48] Regularization, early-stopping and dreaming: A Hopfield-like setup to address generalization and overfitting
    Agliari, E.
    Alemanno, F.
    Aquaro, M.
    Fachechi, A.
    NEURAL NETWORKS, 2024, 177
  • [49] ON THE CONVEXITY OF COMMUNICATION GAMES
    VANDENNOUWELAND, A
    BORM, P
    INTERNATIONAL JOURNAL OF GAME THEORY, 1991, 19 (04) : 421 - 430
  • [50] A GENERALIZATION OF AN ODDNESS-THEOREM FOR BIMATRIX GAMES
    MEISTER, H
    OR SPEKTRUM, 1984, 6 (04) : 217 - 222