TRANSFER LEARNING BASED ON FORBIDDEN RULE SET IN ACTOR-CRITIC METHOD

被引:0
|
作者
Takano, Toshiaki [1 ]
Takase, Haruhiko [1 ]
Kawanaka, Hiroharu [1 ]
Kita, Hidehiko [1 ]
Hayashi, Terumine [1 ]
Tsuruoka, Shinji [1 ]
机构
[1] Mie Univ, Tsu, Mie 514, Japan
关键词
Reinforcement learning; Actor-critic method; Transfer learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we aim to accelerate learning processes in actor-critic method. We proposed the effective transfer learning method, which reduces training cycles by using information acquired from source tasks. The proposed method consists of two ideas, the method to select a policy to transfer, and the transfer method considering the characteristic of each actor-critic parameter set. The selection method aims to reduce redundant trial and error that are used in the selection phase and the training phase. We introduce the forbidden rule set, which are detected easily in the training phase, and concordance rate that measures an effectiveness of a source policy. The transfer method aims to merge a selected source policy to the target policy without negative transfers. it transfers only reliable action preferences and state values that implies preferred actions. We show the effectiveness of the proposed method by simple experiments. Agents found effective policies from, the database, and finished their training with less or same episodes than the original actor-critic method.
引用
收藏
页码:2907 / 2917
页数:11
相关论文
共 50 条
  • [1] Merging with Extraction Method for Transfer Learning in Actor-Critic
    Takano, Toshiaki
    Takase, Haruhiko
    Kawanaka, Hiroharu
    Tsuruoka, Shinji
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (07) : 814 - 821
  • [2] Swarm Reinforcement Learning Method Based on an Actor-Critic Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    [J]. SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 279 - 288
  • [3] Actor-Critic based Improper Reinforcement Learning
    Zaki, Mohammadi
    Mohan, Avinash
    Gopalan, Aditya
    Mannor, Shie
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Deployment Algorithm of Service Function Chain Based on Transfer Actor-Critic Learning
    Tang Lun
    He Xiaoyu
    Wang Xiao
    Chen Qianbin
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (11) : 2671 - 2679
  • [5] Actor-Critic reinforcement learning based on prior knowledge
    [J]. Yang, Zhenyu, 1600, Transport and Telecommunication Institute, Lomonosova street 1, Riga, LV-1019, Latvia (18):
  • [6] Actor-Critic Learning Based on Adaptive Importance Sampling
    Cheng Yuhu
    Feng Huanting
    Wang Xuesong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04) : 583 - 588
  • [7] Actor-critic learning based on fuzzy inference system
    Jouffe, L
    [J]. INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 339 - 344
  • [8] Granular computing in actor-critic learning
    Peters, James F.
    [J]. 2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 59 - 64
  • [9] An inertia wheel pendulum control method based on actor-critic learning algorithm
    Liu Huanlong
    Wang Zhengjie
    Jiang Bin
    Peng Hongyu
    [J]. 2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1281 - 1285
  • [10] An Actor-Critic Method for Simulation-Based Optimization
    Li, Kuo
    Jia, Qing-Shan
    Yan, Jiaqi
    [J]. IFAC PAPERSONLINE, 2022, 55 (11): : 7 - 12