Constructing and Evaluating Options in Reinforcement Learning

被引:0
|
作者
Farahani, Marzieh Davoodabadi [1 ]
Mozayani, Nasser [1 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran
关键词
Hierarchical Reinforcement Learning; Temporal Abstraction; Option; Community Detection; Macro-Action Evaluation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new subgoal based method for automatic construction of useful options. In our proposed method, subgoals are considered as border states of communities of the transition graph created after some initial agent interactions with the environment. We present a new community detection algorithm to provide an appropriate partitioning of the transition graph. Macro-actions are constructed for taking the agent from one community to other communities. In addition, we attempt to capture intuitions about features of useful macro-actions. There is a lack of a generic evaluation mechanism for evaluating each macro-action in previous research. We will propose a method for evaluating each macro-action separately. Inappropriate macro-actions are identified with this method and discarded from agent choices. Experimental results show a significant improvement in results after pruning macro-actions.
引用
收藏
页码:183 / 186
页数:4
相关论文
共 50 条
  • [31] Constructing a hierarchical ontology for reinforcement learning multi-agent system
    Yu, XL
    Wang, L
    Cui, DH
    [J]. ISTM/2003: 5TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, CONFERENCE PROCEEDINGS, 2003, : 1249 - 1252
  • [32] Applying real options with reinforcement learning to assess commercial CCU deployment
    Lee, Jeehwan S.
    Chun, Woopill
    Roh, Kosan
    Heo, Seongmin
    Lee, Jay H.
    [J]. JOURNAL OF CO2 UTILIZATION, 2023, 77
  • [33] A Framework to Discover and Reuse Object-Oriented Options in Reinforcement Learning
    Bonini, Rodrigo Cesar
    Da Silva, Felipe Leno
    Glatt, Ruben
    Spina, Edison
    Reali Costa, Anna Helena
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 109 - 114
  • [34] Connect-based subgoal discovery for options in hierarchical reinforcement learning
    Chen, Fei
    Gao, Yang
    Chen, Shifu
    Ma, Zhenduo
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 698 - +
  • [35] Multi-agent hierarchical reinforcement learning by integrating options into MAXQ
    Shen, Jing
    Gu, Guochang
    Liu, Haibo
    [J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 676 - +
  • [36] Traffic Light Control Using Hierarchical Reinforcement Learning and Options Framework
    Borges, Dimitrius F.
    Leite, Joao Paulo R. R.
    Moreira, Edmilson M.
    Carpinteiro, Otavio A. S.
    [J]. IEEE ACCESS, 2021, 9 : 99155 - 99165
  • [37] Options in Multi-task Reinforcement Learning - Transfer via Reflection
    Denis, Nicholas
    Fraser, Maia
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 225 - 237
  • [38] Evaluating the Coordination of Agents in Multi-agent Reinforcement Learning
    Barton, Sean L.
    Zaroukian, Erin
    Asher, Derrik E.
    Waytowich, Nicholas R.
    [J]. INTELLIGENT HUMAN SYSTEMS INTEGRATION 2019, 2019, 903 : 765 - 770
  • [39] Evaluating the Effectiveness of Deep Reinforcement Learning Algorithms in a Walking Environment
    Neervannan, Arjun
    [J]. BALTIC JOURNAL OF MODERN COMPUTING, 2018, 6 (04): : 335 - 348
  • [40] Evaluating Domain Randomization in Deep Reinforcement Learning Locomotion Tasks
    Ajani, Oladayo S.
    Hur, Sung-ho
    Mallipeddi, Rammohan
    [J]. MATHEMATICS, 2023, 11 (23)