Hierarchical Task Decomposition through Symbiosis in Reinforcement Learning

被引：16

作者：

Doucette, John A. ^{[1
]}

Lichodzijewski, Peter ^{[1
]}

Heywood, Malcolm I. ^{[1
]}

机构：

[1] Univ Waterloo, Sch Comp Sci, Waterloo, ON N2L 3G1, Canada

来源：

PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2012年

关键词：

Symbiosis; Reinforcement learning; Meta actions; Task decomposition; Genetic Programming; NEURAL NETWORKS;

D O I：

10.1145/2330163.2330178

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Adopting a symbiotic model of evolution separates context for deploying an action from the action itself. Such a separation provides a mechanism for task decomposition in temporal sequence learning. Moreover, previously learnt policies are taken to be synonymous with meta actions (actions that are themselves policies). Should solutions to the task not be forthcoming in an initial round of evolution, then solutions from the earlier round represent the 'meta' actions for a new round of evolution. This provides the basis for evolving policy trees. A benchmarking study is performed using the Acrobot handstand task. Solutions to date from reinforcement learning have not been able to approach the performance of those established 14 years ago using an A* search and a priori knowledge regarding the Acrobot energy equations. The proposed symbiotic approach is able to match and, for the first time, better these results. Moreover, unlike previous work, solutions are tested under a broad range of Acrobot initial conditions, with hierarchical solutions providing significantly better generalization performance.

引用

页码：97 / 104

页数：8

共 50 条

[41] Budgeted Hierarchical Reinforcement Learning
Leon, Aurelia
Denoyer, Ludovic
[J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[42] Learning disentangled skills for hierarchical reinforcement learning through trajectory autoencoder with weak labels
Song, Wonil
Jeon, Sangryul
Choi, Hyesong
Sohn, Kwanghoon
Min, Dongbo
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
[43] Study of a Multi-Robot Collaborative Task through Reinforcement Learning
Pereda, Juan
Martin-Ortiz, Manuel
de Lope, Javier
de la Paz, Felix
[J]. FOUNDATIONS ON NATURAL AND ARTIFICIAL COMPUTATION: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART I, 2011, 6686 : 185 - 191
[44] Robotic Arm Control and Task Training Through Deep Reinforcement Learning
Franceschetti, Andrea
Tosello, Elisa
Castaman, Nicola
Ghidoni, Stefano
[J]. INTELLIGENT AUTONOMOUS SYSTEMS 16, IAS-16, 2022, 412 : 532 - 550
[45] Deep-Reinforcement-Learning-Based Object Transportation Using Task Space Decomposition
Eoh, Gyuho
[J]. SENSORS, 2023, 23 (10)
[46] Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning
Icarte, Rodrigo Toro
Klassen, Toryn Q.
Valenzano, Richard
McHraith, Sheila A.
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[47] Reinforcement Learning with Task Decomposition and Task-Specific Reward System for Automation of High-Level Tasks
Kwon, Gunam
Kim, Byeongjun
Kwon, Nam Kyu
[J]. BIOMIMETICS, 2024, 9 (04)
[48] Multi-Task Decomposition Architecture based Deep Reinforcement Learning for Obstacle Avoidance
Zhang, Wengang
He, Cong
Wang, Teng
[J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2735 - 2740
[49] Deep learning tomographic reconstruction through hierarchical decomposition of domain transforms
Fu, Lin
De Man, Bruno
[J]. VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2022, 5 (01)
[50] Deep learning tomographic reconstruction through hierarchical decomposition of domain transforms
Lin Fu
Bruno De Man
[J]. Visual Computing for Industry, Biomedicine, and Art, 5

← 1 2 3 4 5 →