A Design of Reward Function Based on Knowledge in Multi-agent Learning

被引:0
|
作者
Fan, Bo [1 ]
Pu, Jiexin [1 ]
机构
[1] Henan Univ Sci & Technol, Elect Informat Engn Coll, Luoyang 471003, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of reward function is the key to build reinforcement learning system. With the analysis and research of the reinforcement learning and Markov games, an improved reward function is presented, which includes both the goal information based on task and learner's action information based on its domain knowledge. According with this reinforcement function, reinforcement learning integrates the external environment reward and the internal behavior reward so that learner can perform better. The results of the experiments illuminates the reward function involving domain knowledge is better than the traditional reward function in application.
引用
收藏
页码:596 / 603
页数:8
相关论文
共 50 条
  • [1] Reward shaping for knowledge-based multi-objective multi-agent reinforcement learning
    Mannion, Patrick
    Devlin, Sam
    Duggan, Jim
    Howley, Enda
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2018, 33
  • [2] Reward design for multi-agent reinforcement learning with a penalty based on the payment mechanism
    Matsunami, Natsuki
    Okuhara, Shun
    Ito, Takayuki
    [J]. Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (05)
  • [3] Reward design for driver repositioning using multi-agent reinforcement learning
    Shou, Zhenyu
    Di, Xuan
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 119
  • [4] Multi-Agent Reinforcement Learning with Reward Delays
    Zhang, Yuyang
    Zhang, Runyu
    Gu, Yuantao
    Li, Na
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [5] Direct reward and indirect reward in multi-agent reinforcement learning
    Ohta, M
    [J]. ROBOCUP 2002: ROBOT SOCCER WORLD CUP VI, 2003, 2752 : 359 - 366
  • [6] Direct reward and indirect reward in multi-agent reinforcement learning
    [J]. Ohta, M. (ohta@carc.aist.go.jp), (Springer Verlag):
  • [7] Role differentiation process by division of reward function in multi-agent reinforcement learning
    Taniguchi, Tadahiro
    Tabuchi, Kazuma
    Sawaragi, Tetsuo
    [J]. 2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 358 - +
  • [8] Plan-based reward shaping for multi-agent reinforcement learning
    Devlin, Sam
    Kudenko, Daniel
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2016, 31 (01): : 44 - 58
  • [9] Function approximation based multi-agent reinforcement learning
    Abul, O
    Polat, F
    Alhajj, R
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, : 36 - 39
  • [10] Releasing source locating based on Multi-Agent Reinforcement Learning with reward function designed by maximum entropy
    Wang, Zhi-Pu
    Zeng, Guang-Rong
    Deng, Lie-Wei
    Cao, Wang
    Guo, Yao
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4688 - 4693