PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning

被引:0
|
作者
Wu, Jizhou [1 ]
Hao, Jianye [1 ]
Yang, Tianpei [2 ,3 ]
Hao, Xiaotian [1 ]
Zheng, Yan [1 ]
Wang, Weixun [4 ]
Taylor, Matthew E. [2 ,3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[4] Netease Fuxi AI Lab, Beijing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite many breakthroughs in recent years, it is still hard for MultiAgent Reinforcement Learning (MARL) algorithms to directly solve complex tasks in MultiAgent Systems (MASs) from scratch. In this work, we study how to use Automatic Curriculum Learning (ACL) to reduce the number of environmental interactions required to learn a good policy. In order to solve a difficult task, ACL methods automatically select a sequence of tasks (i.e., curricula). The idea is to obtain maximum learning progress towards the final task by continuously learning on tasks that match the current capabilities of the learners. The key question is how to measure the learning progress of the learner for better curriculum selection. We propose a novel ACL framework, PrOgRessive mulTiagent Automatic curricuLum (PORTAL), for MASs. PORTAL selects curricula according to two criteria: 1) How difficult is a task, relative to the learners' current abilities? 2) How similar is a task, relative to the final task? By learning a shared feature space between tasks, PORTAL can characterize different tasks based on the distribution of features and select those that are similar to the final task. Also, the shared feature space can effectively facilitate policy transfer between curricula. Experimental results show that PORTAL can train agents to master extremely hard cooperative tasks, which can not be achieved with previous state-of-the-art MARL algorithms.
引用
收藏
页码:15934 / 15942
页数:9
相关论文
共 50 条
  • [41] Measurement of Underlying Cooperation in Multiagent Reinforcement Learning
    Arai, Sachiyo
    Ishigaki, Yoshihisa
    Hirata, Hironori
    [J]. INTELLIGENT AGENTS AND MULTI-AGENT SYSTEMS, PROCEEDINGS, 2008, 5357 : 34 - 41
  • [42] Multiagent Reinforcement Learning for Swarm Confrontation Environments
    Zhang, Guanyu
    Li, Yuan
    Xu, Xinhai
    Dai, Huadong
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PT III, 2019, 11742 : 533 - 543
  • [43] Collaborative multiagent reinforcement learning by payoff propagation
    Kok, Jelle R.
    Vlassis, Nikos
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 1789 - 1828
  • [44] Acceleration methods for centralized multiagent reinforcement learning
    Akahane, Takuma
    Iima, Hitoshi
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2020, 140 (02) : 242 - 248
  • [45] Stigmergic Independent Reinforcement Learning for Multiagent Collaboration
    Xu, Xing
    Li, Rongpeng
    Zhao, Zhifeng
    Zhang, Honggang
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4285 - 4299
  • [46] Heuristically-Accelerated Multiagent Reinforcement Learning
    Bianchi, Reinaldo A. C.
    Martins, Murilo F.
    Ribeiro, Carlos H. C.
    Costa, Anna H. R.
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (02) : 252 - 265
  • [47] An Advising Framework for Multiagent Reinforcement Learning Systems
    da Silva, Felipe Leno
    Glatt, Ruben
    Reali Costa, Anna Helena
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4913 - 4914
  • [48] Autonomously Reusing Knowledge in Multiagent Reinforcement Learning
    Da Silva, Felipe Leno
    Taylor, Matthew E.
    Reali Costa, Anna Helena
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5487 - 5493
  • [49] The dynamics of reinforcement learning in cooperative multiagent systems
    Claus, C
    Boutilier, C
    [J]. FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 746 - 752
  • [50] Employing OLAP mining for multiagent reinforcement learning
    Alhajj, R
    Kaya, M
    [J]. DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 759 - 768