Rationality of reward sharing in multi-agent reinforcement learning

被引:0
|
作者
Kazuteru Miyazaki
Shigenobu Kobayashi
机构
[1] National Institution for Academic Degrees,
[2] Tokyo Institute of Technology,undefined
来源
New Generation Computing | 2001年 / 19卷
关键词
Reinforcement Learning; Multi-agent System; Profit Sharing; Rationality Theorem; Direct and Indirect Rewards;
D O I
暂无
中图分类号
学科分类号
摘要
In multi-agent reinforcement learning systems, it is important to share a reward among all agents. We focus on theRationality Theorem of Profit Sharing5) and analyze how to share a reward among all profit sharing agents. When an agent gets adirect reward R (R>0), anindirect reward μR (μ≥0) is given to the other agents. We have derived the necessary and sufficient condition to preserve the rationality as follows;\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document} $$\mu < \frac{{M - 1}}{{M^W (1 - (\tfrac{1}{M})^{W_o } )(n - 1)L}}$$ \end{document} whereM andL are the maximum number of conflicting all rules and rational rules in the same sensory input,W andWo are the maximum episode length of adirect and anindirect-reward agents, andn is the number of agents. This theory is derived by avoiding the least desirable situation whose expected reward per an action is zero. Therefore, if we use this theorem, we can experience several efficient aspects of reward sharing. Through numerical examples, we confirm the effectiveness of this theorem.
引用
收藏
页码:157 / 172
页数:15
相关论文
共 50 条
  • [1] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [2] On the rationality of profit sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 421 - 425
  • [3] On the rationality of Profit Sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    [J]. ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 123 - 127
  • [4] Multi-Agent Reinforcement Learning with Reward Delays
    Zhang, Yuyang
    Zhang, Runyu
    Gu, Yuantao
    Li, Na
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [5] Direct reward and indirect reward in multi-agent reinforcement learning
    Ohta, M
    [J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 359 - 366
  • [6] Direct reward and indirect reward in multi-agent reinforcement learning
    [J]. Ohta, M. (ohta@carc.aist.go.jp), (Springer Verlag):
  • [7] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] Cooperative Multi-Agent Reinforcement Learning with Dynamic Target Localization: A Reward Sharing Approach
    Wickramaarachchi, Helani
    Kirley, Michael
    Geard, Nicholas
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT II, 2024, 14472 : 310 - 324
  • [9] Individual Reward Assisted Multi-Agent Reinforcement Learning
    Wang, Li
    Zhang, Yupeng
    Hu, Yujing
    Wang, Weixun
    Zhang, Chongjie
    Gao, Yang
    Hao, Jianye
    Lv, Tangjie
    Fan, Changjie
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [10] Autonomous learning of reward distribution for each agent in multi-agent reinforcement learning
    Shibata, K
    Ito, K
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 6, 2000, : 495 - 502