Decomposing user-defined tasks in a reinforcement learning setup using TextWorld

被引:0
|
作者
Petsanis, Thanos [1 ]
Keroglou, Christoforos [1 ]
Kapoutsis, Athanasios Ch. [2 ]
Kosmatopoulos, Elias B. [1 ]
Sirakoulis, Georgios Ch. [1 ]
机构
[1] Democritus Univ Thrace DUTH, Sch Engn, Dept Elect & Comp Engn, Xanthi, Greece
[2] Informat Technol Inst, Ctr Res & Technol, Thessaloniki, Greece
来源
关键词
formal methods in robotics and automation; reinforcement learning; hierarchical reinforcement learning; task and motion planning; autonomous agents; ENVIRONMENTS;
D O I
10.3389/frobt.2023.1280578
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The current paper proposes a hierarchical reinforcement learning (HRL) method to decompose a complex task into simpler sub-tasks and leverage those to improve the training of an autonomous agent in a simulated environment. For practical reasons (i.e., illustrating purposes, easy implementation, user-friendly interface, and useful functionalities), we employ two Python frameworks called TextWorld and MiniGrid. MiniGrid functions as a 2D simulated representation of the real environment, while TextWorld functions as a high-level abstraction of this simulated environment. Training on this abstraction disentangles manipulation from navigation actions and allows us to design a dense reward function instead of a sparse reward function for the lower-level environment, which, as we show, improves the performance of training. Formal methods are utilized throughout the paper to establish that our algorithm is not prevented from deriving solutions.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Streaming time series summarization using user-defined amnesic functions
    Palpanas, Themis
    Vlachos, Michail
    Keogh, Eamonn
    Gunopulos, Dimitrios
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (07) : 992 - 1006
  • [22] DeepLabCut: markerless pose estimation of user-defined body parts with deep learning
    Mathis, Alexander
    Mamidanna, Pranav
    Cury, Kevin M.
    Abe, Taiga
    Murthy, Venkatesh N.
    Mathis, Mackenzie Weygandt
    Bethge, Matthias
    [J]. NATURE NEUROSCIENCE, 2018, 21 (09) : 1281 - +
  • [23] A Pipelined Division for Fixed Operation Using User-defined Floating Point
    Yang, Pengfei
    Zha, Daolu
    Jin, Xi
    [J]. 2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 634 - 637
  • [24] Improving Image Processing Performance Using Database User-Defined Functions
    Vagac, Michal
    Melichercik, Miroslav
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2015, 9119 : 789 - 799
  • [25] Designing and Implementing Software Systems using User-defined Design Patterns
    Ozkaya, Mert
    Kose, Mehmet Alp
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), 2021, : 497 - 504
  • [26] Spatial Data Sequence Selection Based on a User-Defined Condition Using GPGPU
    En-Nejjary, Driss
    Pinet, Francois
    Kang, Myoung-Ah
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (12)
  • [27] Transliterating Latin to Amharic scripts using user-defined rules and character mappings
    Zeleke Abebaw
    Andreas Rauber
    Solomon Atnafu
    [J]. International Journal on Digital Libraries, 2023, 24 : 63 - 75
  • [28] PARALLELIZING USER-DEFINED FUNCTIONS IN THE ETL WORKFLOW USING ORCHESTRATION STYLE SHEETS
    Ali, Syed Muhammad Fawad
    Mey, Johannes
    Thiele, Maik
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2019, 29 (01) : 69 - 79
  • [29] Extraction of user-defined data blocks using the regularity of dynamic Web Pages
    Choi, Cheolhee
    Kang, Jinbeom
    Choi, Joongmin
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 123 - +
  • [30] Declarative Parameterizations of User-Defined Functions for Large-Scale Machine Learning and Optimization
    Gao, Zekai J.
    Pansare, Niketan
    Jermaine, Christopher
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (11) : 2079 - 2092