Hierarchical Task Decomposition through Symbiosis in Reinforcement Learning

被引:16
|
作者
Doucette, John A. [1 ]
Lichodzijewski, Peter [1 ]
Heywood, Malcolm I. [1 ]
机构
[1] Univ Waterloo, Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
关键词
Symbiosis; Reinforcement learning; Meta actions; Task decomposition; Genetic Programming; NEURAL NETWORKS;
D O I
10.1145/2330163.2330178
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Adopting a symbiotic model of evolution separates context for deploying an action from the action itself. Such a separation provides a mechanism for task decomposition in temporal sequence learning. Moreover, previously learnt policies are taken to be synonymous with meta actions (actions that are themselves policies). Should solutions to the task not be forthcoming in an initial round of evolution, then solutions from the earlier round represent the 'meta' actions for a new round of evolution. This provides the basis for evolving policy trees. A benchmarking study is performed using the Acrobot handstand task. Solutions to date from reinforcement learning have not been able to approach the performance of those established 14 years ago using an A* search and a priori knowledge regarding the Acrobot energy equations. The proposed symbiotic approach is able to match and, for the first time, better these results. Moreover, unlike previous work, solutions are tested under a broad range of Acrobot initial conditions, with hierarchical solutions providing significantly better generalization performance.
引用
收藏
页码:97 / 104
页数:8
相关论文
共 50 条
  • [41] Budgeted Hierarchical Reinforcement Learning
    Leon, Aurelia
    Denoyer, Ludovic
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [42] Learning disentangled skills for hierarchical reinforcement learning through trajectory autoencoder with weak labels
    Song, Wonil
    Jeon, Sangryul
    Choi, Hyesong
    Sohn, Kwanghoon
    Min, Dongbo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [43] Study of a Multi-Robot Collaborative Task through Reinforcement Learning
    Pereda, Juan
    Martin-Ortiz, Manuel
    de Lope, Javier
    de la Paz, Felix
    [J]. FOUNDATIONS ON NATURAL AND ARTIFICIAL COMPUTATION: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART I, 2011, 6686 : 185 - 191
  • [44] Robotic Arm Control and Task Training Through Deep Reinforcement Learning
    Franceschetti, Andrea
    Tosello, Elisa
    Castaman, Nicola
    Ghidoni, Stefano
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 16, IAS-16, 2022, 412 : 532 - 550
  • [46] Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning
    Icarte, Rodrigo Toro
    Klassen, Toryn Q.
    Valenzano, Richard
    McHraith, Sheila A.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [47] Reinforcement Learning with Task Decomposition and Task-Specific Reward System for Automation of High-Level Tasks
    Kwon, Gunam
    Kim, Byeongjun
    Kwon, Nam Kyu
    [J]. BIOMIMETICS, 2024, 9 (04)
  • [48] Multi-Task Decomposition Architecture based Deep Reinforcement Learning for Obstacle Avoidance
    Zhang, Wengang
    He, Cong
    Wang, Teng
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2735 - 2740
  • [49] Deep learning tomographic reconstruction through hierarchical decomposition of domain transforms
    Fu, Lin
    De Man, Bruno
    [J]. VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2022, 5 (01)
  • [50] Deep learning tomographic reconstruction through hierarchical decomposition of domain transforms
    Lin Fu
    Bruno De Man
    [J]. Visual Computing for Industry, Biomedicine, and Art, 5