A Design of Reward Function Based on Knowledge in Multi-agent Learning

被引:0
|
作者
Fan, Bo [1 ]
Pu, Jiexin [1 ]
机构
[1] Henan Univ Sci & Technol, Elect Informat Engn Coll, Luoyang 471003, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of reward function is the key to build reinforcement learning system. With the analysis and research of the reinforcement learning and Markov games, an improved reward function is presented, which includes both the goal information based on task and learner's action information based on its domain knowledge. According with this reinforcement function, reinforcement learning integrates the external environment reward and the internal behavior reward so that learner can perform better. The results of the experiments illuminates the reward function involving domain knowledge is better than the traditional reward function in application.
引用
收藏
页码:596 / 603
页数:8
相关论文
共 50 条
  • [21] Emotion-Based Heterogeneous Multi-agent Reinforcement Learning with Sparse Reward
    Fang B.
    Ma Y.
    Wang Z.
    Wang H.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (03): : 223 - 231
  • [22] Probabilistic Reward-Based Reinforcement Learning for Multi-Agent Pursuit and Evasion
    Zhang, Bo-Kun
    Hu, Bin
    Chen, Long
    Zhang, Ding-Xue
    Cheng, Xin-Ming
    Guan, Zhi-Hong
    [J]. PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 3352 - 3357
  • [23] Temporal Inconsistency-Based Intrinsic Reward for Multi-Agent Reinforcement Learning
    Sun, Shaoqi
    Xu, Kele
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [24] Reward-based epigenetic learning algorithm for a decentralised multi-agent system
    Mukhlish, Faqihza
    Page, John
    Bain, Michael
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT UNMANNED SYSTEMS, 2020, 8 (03) : 201 - 224
  • [25] All learning is local: Multi-agent learning in global reward games
    Chang, YH
    Ho, A
    Kaelbling, LP
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 807 - 814
  • [26] LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
    Du, Yali
    Han, Lei
    Fang, Meng
    Dai, Tianhong
    Liu, Ji
    Tao, Dacheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] Multi-Agent Common Knowledge Reinforcement Learning
    de Witt, Christian A. Schroeder
    Foerster, Jakob N.
    Farquhar, Gregory
    Torr, Philip H. S.
    Boehmer, Wendelin
    Whiteson, Shimon
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [29] Reward Specifications in Collaborative Multi-agent Learning: A Comparative Study
    Hasan, Maram
    Niyogi, Rajdeep
    [J]. 39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1007 - 1013
  • [30] Learning Cooperative Multi-Agent Policies With Partial Reward Decoupling
    Freed, Benjamin
    Kapoor, Aditya
    Abraham, Ian
    Schneider, Jeff
    Choset, Howie
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02): : 890 - 897