Reinforcement learning in non-Markovian environments using automatic discovery of subgoals

被引：0

作者：

Dung, Le Tien

Komeda, Takashi

Takagi, Motoki

机构：

来源：

PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8 | 2007年

关键词：

selected keywords relevant to the subject;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning time is always a critical issue in Reinforcement Learning, especially when Recurrent Neural Networks (RNNs) are used to predict Q values. By creating useful subgoals, we can speed up learning performance. In this paper, we propose a method to accelerate learning in non-Markovian environments using automatic discovery of subgoals. Once subgoals are created, sub-policies use RNNs to attain them `1hen learned RNNs are integrated into the main RNN as experts. Finally, the agent continues to learn using its new policy. Experiment results of the E maze problem and the virtual office problem show the potential of this approach.

引用

页码：2592 / 2596

页数：5

共 50 条

[41] Quantum acceleration by an ancillary system in non-Markovian environments
Fan, Jinna
Wu, Shaoxiong
Yu, Chang-shui
QUANTUM INFORMATION PROCESSING, 2021, 20 (01)
[42] Geometric measure of quantum discord in non-Markovian environments
Altintas, Ferdi
OPTICS COMMUNICATIONS, 2010, 283 (24) : 5264 - 5268
[43] Entanglement Dynamics of Three Qubits in the Non-Markovian Environments
Shan Chuan-Jia
Liu Ji-Bing
Chen Tao
Liu Tang-Kun
Huang Yan-Xia
Li Hong
CHINESE PHYSICS LETTERS, 2010, 27 (10)
[44] Discord and entanglement in non-Markovian environments at finite temperatures
邹红梅
方卯发
Chinese Physics B, 2016, 25 (09) : 211 - 217
[45] Zeno limit in frequency estimation with non-Markovian environments
Macieszczak, Katarzyna
PHYSICAL REVIEW A, 2015, 92 (01):
[46] Decoherence suppression of tripartite entanglement in non-Markovian environments by using weak measurements
Ding, Zhi-Yong
He, Juan
Ye, Liu
ANNALS OF PHYSICS, 2017, 377 : 96 - 107
[47] Machine Learning Non-Markovian Quantum Dynamics
Luchnikov, I. A.
Vintskevich, S. V.
Grigoriev, D. A.
Filippov, S. N.
PHYSICAL REVIEW LETTERS, 2020, 124 (14)
[48] Learning non-Markovian physics from data
Gonzalez, David
Chinesta, Francisco
Cueto, Elias
JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 428 (428)
[49] Online Learning of non-Markovian Reward Models
Rens, Gavin
Raskin, Jean-Francois
Reynouard, Raphael
Marra, Giuseppe
ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 74 - 86
[50] Learning Non-Markovian Constraints for Handwriting Recognition
Kakisako, Ryosuke
Uchida, Seiichi
Volkmar, Frinken
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 446 - 450

← 1 2 3 4 5 →