PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning

被引:0
|
作者
Wu, Jizhou [1 ]
Hao, Jianye [1 ]
Yang, Tianpei [2 ,3 ]
Hao, Xiaotian [1 ]
Zheng, Yan [1 ]
Wang, Weixun [4 ]
Taylor, Matthew E. [2 ,3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
[2] Univ Alberta, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[4] Netease Fuxi AI Lab, Beijing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite many breakthroughs in recent years, it is still hard for MultiAgent Reinforcement Learning (MARL) algorithms to directly solve complex tasks in MultiAgent Systems (MASs) from scratch. In this work, we study how to use Automatic Curriculum Learning (ACL) to reduce the number of environmental interactions required to learn a good policy. In order to solve a difficult task, ACL methods automatically select a sequence of tasks (i.e., curricula). The idea is to obtain maximum learning progress towards the final task by continuously learning on tasks that match the current capabilities of the learners. The key question is how to measure the learning progress of the learner for better curriculum selection. We propose a novel ACL framework, PrOgRessive mulTiagent Automatic curricuLum (PORTAL), for MASs. PORTAL selects curricula according to two criteria: 1) How difficult is a task, relative to the learners' current abilities? 2) How similar is a task, relative to the final task? By learning a shared feature space between tasks, PORTAL can characterize different tasks based on the distribution of features and select those that are similar to the final task. Also, the shared feature space can effectively facilitate policy transfer between curricula. Experimental results show that PORTAL can train agents to master extremely hard cooperative tasks, which can not be achieved with previous state-of-the-art MARL algorithms.
引用
收藏
页码:15934 / 15942
页数:9
相关论文
共 50 条
  • [1] Automatic Decomposition of Reward Machines for Decentralized Multiagent Reinforcement Learning
    Smith, Sophia
    Neary, Cyrus
    Topcu, Ufuk
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 5423 - 5430
  • [2] Automatic View Generation with Deep Learning and Reinforcement Learning
    Yuan, Haitao
    Li, Guoliang
    Feng, Ling
    Sun, Ji
    Han, Yue
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1501 - 1512
  • [3] Automatic Goal Generation for Reinforcement Learning Agents
    Florensa, Carlos
    Held, David
    Geng, Xinyang
    Abbeel, Pieter
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [4] A reinforcement learning approach to automatic generation control
    Ahamed, TPI
    Rao, PSN
    Sastry, PS
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2002, 63 (01) : 9 - 26
  • [5] Automatic Poetry Generation with Mutual Reinforcement Learning
    Yi, Xiaoyuan
    Sun, Maosong
    Li, Ruoyu
    Li, Wenhao
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3143 - 3153
  • [6] Deep Reinforcement Learning for Automatic Thumbnail Generation
    Li, Zhuopeng
    Zhang, Xiaoyan
    [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 41 - 53
  • [7] Adaptive Multiagent Model Based on Reinforcement Learning for Distributed Generation Systems
    Divenyi, Daniel
    Dan, Andras
    [J]. 2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 303 - 307
  • [8] Asymmetric multiagent reinforcement learning
    Könönen, V
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 336 - 342
  • [9] Automatic Curriculum Graph Generation for Reinforcement Learning Agents
    Svetlik, Maxwell
    Leonetti, Matteo
    Sinapov, Jivko
    Shah, Rishi
    Walker, Nick
    Stone, Peter
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2590 - 2596
  • [10] Semi-Automatic Generation of Intelligent Curricula to Facilitate Learning Analytics
    Fiallos, Angel
    Ochoa, Xavier
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE (LAK'19), 2019, : 46 - 50