Satisficing Paths and Independent Multiagent Reinforcement Learning in Stochastic Games

被引:1
|
作者
Yongacoglu, Bora [1 ]
Arslan, Gurdal [2 ]
Yuksel, Serdar [1 ]
机构
[1] Queens Univ, Dept Math & Stat, Kingston, ON, Canada
[2] Univ Hawaii Manoa, Dept Elect Engn, Honolulu, HI 96822 USA
来源
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE | 2023年 / 5卷 / 03期
关键词
multiagent reinforcement learning; independent learners; learning in games; stochastic games; decentralized systems; FICTITIOUS PLAY; UNCOUPLED DYNAMICS; CONVERGENCE; SYSTEMS; TEAMS; GO;
D O I
10.1137/22M1515112
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In multiagent reinforcement learning, independent learners are those that do not observe the actions of other agents in the system. Due to the decentralization of information, it is challenging to design independent learners that drive play to equilibrium. This paper investigates the feasibility of using satisficing dynamics to guide independent learners to approximate equilibrium in stochastic games. For \epsilon \geq 0, an \epsilon -satisficing policy update rule is any rule that instructs the agent to not change its policy when it is \epsilon -best-responding to the policies of the remaining players; \epsilon -satisficing paths are defined to be sequences of joint policies obtained when each agent uses some \epsilon -satisficing policy update rule to select its next policy. We establish structural results on the existence of \epsilon -satisficing paths into \epsilon -equilibrium in both symmetric N-player games and general stochastic games with two players. We then present an independent learning algorithm for N-player symmetric games and give high probability guarantees of convergence to \epsilon -equilibrium under self-play. This guarantee is made using symmetry alone, leveraging the previously unexploited structure of \epsilon -satisficing paths.
引用
收藏
页码:745 / 773
页数:29
相关论文
共 50 条
  • [1] A stochastic exploration strategy for satisficing reinforcement learning
    Katayama, S
    Kobayashi, S
    INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 296 - 303
  • [2] PyTAG: Tabletop Games for Multiagent Reinforcement Learning
    Balla, Martin
    Long, George E. M.
    Goodman, James
    Gaina, Raluca D.
    Perez-Liebana, Diego
    IEEE TRANSACTIONS ON GAMES, 2024, 16 (04) : 993 - 1002
  • [3] Multiagent Graphical Games With Inverse Reinforcement Learning
    Donge, Vrushabh S.
    Lian, Bosen
    Lewis, Frank L.
    Davoudi, Ali
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (02): : 841 - 852
  • [4] Independent Deep Deterministic Policy Gradient Reinforcement Learning in Cooperative Multiagent Pursuit Games
    Zhou, Shiyang
    Ren, Weiya
    Ren, Xiaoguang
    Wang, Yanzhen
    Yi, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 625 - 637
  • [5] Stigmergic Independent Reinforcement Learning for Multiagent Collaboration
    Xu, Xing
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4285 - 4299
  • [6] Peer Incentive Reinforcement Learning for Cooperative Multiagent Games
    Zhang, Tianle
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    IEEE TRANSACTIONS ON GAMES, 2023, 15 (04) : 623 - 636
  • [7] Multiagent reinforcement learning-model in evolutionary games
    Liu, Wei-Bing
    Wang, Xian-Jia
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2009, 29 (03): : 28 - 33
  • [8] Online Reinforcement Learning in Stochastic Games
    Wei, Chen-Yu
    Hong, Yi-Te
    Lu, Chi-Jen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [9] On Passivity, Reinforcement Learning, and Higher Order Learning in Multiagent Finite Games
    Gao, Bolin
    Pavel, Lacra
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (01) : 121 - 136
  • [10] Two-Player Multiagent Graphical Games with Reinforcement Learning
    Lian, Bosen
    Wu, Jiacheng
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS 2024, 2024,