Reinforcement learning in non-Markovian environments using automatic discovery of subgoals

被引:0
|
作者
Dung, Le Tien
Komeda, Takashi
Takagi, Motoki
机构
来源
PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8 | 2007年
关键词
selected keywords relevant to the subject;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning time is always a critical issue in Reinforcement Learning, especially when Recurrent Neural Networks (RNNs) are used to predict Q values. By creating useful subgoals, we can speed up learning performance. In this paper, we propose a method to accelerate learning in non-Markovian environments using automatic discovery of subgoals. Once subgoals are created, sub-policies use RNNs to attain them `1hen learned RNNs are integrated into the main RNN as experts. Finally, the agent continues to learn using its new policy. Experiment results of the E maze problem and the virtual office problem show the potential of this approach.
引用
收藏
页码:2592 / 2596
页数:5
相关论文
共 50 条
  • [41] Quantum acceleration by an ancillary system in non-Markovian environments
    Fan, Jinna
    Wu, Shaoxiong
    Yu, Chang-shui
    QUANTUM INFORMATION PROCESSING, 2021, 20 (01)
  • [42] Geometric measure of quantum discord in non-Markovian environments
    Altintas, Ferdi
    OPTICS COMMUNICATIONS, 2010, 283 (24) : 5264 - 5268
  • [43] Entanglement Dynamics of Three Qubits in the Non-Markovian Environments
    Shan Chuan-Jia
    Liu Ji-Bing
    Chen Tao
    Liu Tang-Kun
    Huang Yan-Xia
    Li Hong
    CHINESE PHYSICS LETTERS, 2010, 27 (10)
  • [44] Discord and entanglement in non-Markovian environments at finite temperatures
    邹红梅
    方卯发
    Chinese Physics B, 2016, 25 (09) : 211 - 217
  • [45] Zeno limit in frequency estimation with non-Markovian environments
    Macieszczak, Katarzyna
    PHYSICAL REVIEW A, 2015, 92 (01):
  • [46] Decoherence suppression of tripartite entanglement in non-Markovian environments by using weak measurements
    Ding, Zhi-Yong
    He, Juan
    Ye, Liu
    ANNALS OF PHYSICS, 2017, 377 : 96 - 107
  • [47] Machine Learning Non-Markovian Quantum Dynamics
    Luchnikov, I. A.
    Vintskevich, S. V.
    Grigoriev, D. A.
    Filippov, S. N.
    PHYSICAL REVIEW LETTERS, 2020, 124 (14)
  • [48] Learning non-Markovian physics from data
    Gonzalez, David
    Chinesta, Francisco
    Cueto, Elias
    JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 428 (428)
  • [49] Online Learning of non-Markovian Reward Models
    Rens, Gavin
    Raskin, Jean-Francois
    Reynouard, Raphael
    Marra, Giuseppe
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 74 - 86
  • [50] Learning Non-Markovian Constraints for Handwriting Recognition
    Kakisako, Ryosuke
    Uchida, Seiichi
    Volkmar, Frinken
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 446 - 450