Constructing and Evaluating Options in Reinforcement Learning

被引：0

作者：

Farahani, Marzieh Davoodabadi ^{[1
]}

Mozayani, Nasser ^{[1
]}

机构：

[1] Iran Univ Sci & Technol, Dept Comp Engn, Tehran, Iran

来源：

2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST) | 2018年

关键词：

Hierarchical Reinforcement Learning; Temporal Abstraction; Option; Community Detection; Macro-Action Evaluation;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a new subgoal based method for automatic construction of useful options. In our proposed method, subgoals are considered as border states of communities of the transition graph created after some initial agent interactions with the environment. We present a new community detection algorithm to provide an appropriate partitioning of the transition graph. Macro-actions are constructed for taking the agent from one community to other communities. In addition, we attempt to capture intuitions about features of useful macro-actions. There is a lack of a generic evaluation mechanism for evaluating each macro-action in previous research. We will propose a method for evaluating each macro-action separately. Inappropriate macro-actions are identified with this method and discarded from agent choices. Experimental results show a significant improvement in results after pruning macro-actions.

引用

页码：183 / 186

页数：4

共 50 条

[31] Constructing a hierarchical ontology for reinforcement learning multi-agent system
Yu, XL
Wang, L
Cui, DH
[J]. ISTM/2003: 5TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, CONFERENCE PROCEEDINGS, 2003, : 1249 - 1252
[32] Applying real options with reinforcement learning to assess commercial CCU deployment
Lee, Jeehwan S.
Chun, Woopill
Roh, Kosan
Heo, Seongmin
Lee, Jay H.
[J]. JOURNAL OF CO2 UTILIZATION, 2023, 77
[33] A Framework to Discover and Reuse Object-Oriented Options in Reinforcement Learning
Bonini, Rodrigo Cesar
Da Silva, Felipe Leno
Glatt, Ruben
Spina, Edison
Reali Costa, Anna Helena
[J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 109 - 114
[34] Connect-based subgoal discovery for options in hierarchical reinforcement learning
Chen, Fei
Gao, Yang
Chen, Shifu
Ma, Zhenduo
[J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 698 - +
[35] Multi-agent hierarchical reinforcement learning by integrating options into MAXQ
Shen, Jing
Gu, Guochang
Liu, Haibo
[J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 676 - +
[36] Traffic Light Control Using Hierarchical Reinforcement Learning and Options Framework
Borges, Dimitrius F.
Leite, Joao Paulo R. R.
Moreira, Edmilson M.
Carpinteiro, Otavio A. S.
[J]. IEEE ACCESS, 2021, 9 : 99155 - 99165
[37] Options in Multi-task Reinforcement Learning - Transfer via Reflection
Denis, Nicholas
Fraser, Maia
[J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 225 - 237
[38] Evaluating the Coordination of Agents in Multi-agent Reinforcement Learning
Barton, Sean L.
Zaroukian, Erin
Asher, Derrik E.
Waytowich, Nicholas R.
[J]. INTELLIGENT HUMAN SYSTEMS INTEGRATION 2019, 2019, 903 : 765 - 770
[39] Evaluating the Effectiveness of Deep Reinforcement Learning Algorithms in a Walking Environment
Neervannan, Arjun
[J]. BALTIC JOURNAL OF MODERN COMPUTING, 2018, 6 (04): : 335 - 348
[40] Evaluating Domain Randomization in Deep Reinforcement Learning Locomotion Tasks
Ajani, Oladayo S.
Hur, Sung-ho
Mallipeddi, Rammohan
[J]. MATHEMATICS, 2023, 11 (23)

← 1 2 3 4 5 →