Emergent Communication: Generalization and Overfitting in Lewis Games

被引：0

作者：

Rita, Mathieu ^{[1
]}

Tallec, Corentin ^{[2
]}

Michel, Paul ^{[2
]}

Grill, Jean-Bastien ^{[2
]}

Pietquin, Olivier ^{[3
]}

Dupoux, Emmanuel ^{[4
,5
]}

Strub, Florian ^{[2
]}

机构：

[1] INRIA, Paris, France

[2] DeepMind, London, England

[3] Google Res, Brain Team, Mountain View, CA USA

[4] INRIA, CNRS, EHESS, ENS PSL, Paris, France

[5] Meta AI Res, New York, NY USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年

基金：

欧洲研究理事会;

关键词：

LANGUAGE EVOLUTION; COMPRESSION; DYNAMICS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Lewis signaling games are a class of simple communication games for simulating the emergence of language. In these games, two agents must agree on a communication protocol in order to solve a cooperative task. Previous work has shown that agents trained to play this game with reinforcement learning tend to develop languages that display undesirable properties from a linguistic point of view (lack of generalization, lack of compositionality, etc). In this paper, we aim to provide better understanding of this phenomenon by analytically studying the learning problem in Lewis games. As a core contribution, we demonstrate that the standard objective in Lewis games can be decomposed in two components: a co-adaptation loss and an information loss. This decomposition enables us to surface two potential sources of overfitting, which we show may undermine the emergence of a structured communication protocol. In particular, when we control for overfitting on the co-adaptation loss, we recover desired properties in the emergent languages: they are more compositional and generalize better.

引用

页数：16

共 50 条

[41] A GENERALIZATION OF THE BACKWARD INDUCTION METHOD IN ALIGNED GAMES
Chakrabarti, Subir
Topolyan, Iryna
JOURNAL OF DYNAMICS AND GAMES, 2024,
[42] MULTISTAGE GAMES WITH COMMUNICATION
MYERSON, RB
ECONOMETRICA, 1986, 54 (02) : 323 - 358
[43] A generalization of the CIS value for cooperative cost games
Hou, Dongshuang
Han, Weibin
Xu, Genjiu
Feng, Yifan
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2024, 22 (01): : 17 - 30
[44] Convex linear generalization of random coalition games
Zhang, SK
Zhang, YD
Yan, HZ
Wang, QY
CHINESE SCIENCE BULLETIN, 1998, 43 (09): : 713 - 716
[45] Communication in ultimatum games
Rankin, FW
ECONOMICS LETTERS, 2003, 81 (02) : 267 - 271
[46] Convex linear generalization of random coalition games
ZHANG Shengkai+1
2. North-East University
ChineseScienceBulletin, 1998, (09) : 713 - 716
[47] Generalization of conception for saddle point in antagonistic games
Smol'yakov, ER
DOKLADY AKADEMII NAUK, 1999, 366 (01) : 21 - 23
[48] Regularization, early-stopping and dreaming: A Hopfield-like setup to address generalization and overfitting
Agliari, E.
Alemanno, F.
Aquaro, M.
Fachechi, A.
NEURAL NETWORKS, 2024, 177
[49] ON THE CONVEXITY OF COMMUNICATION GAMES
VANDENNOUWELAND, A
BORM, P
INTERNATIONAL JOURNAL OF GAME THEORY, 1991, 19 (04) : 421 - 430
[50] A GENERALIZATION OF AN ODDNESS-THEOREM FOR BIMATRIX GAMES
MEISTER, H
OR SPEKTRUM, 1984, 6 (04) : 217 - 222

← 1 2 3 4 5 →