Reinforcement Learning with Reward Shaping and Hybrid Exploration in Sparse Reward Scenes

被引:0
|
作者
Yang, Yulong [1 ]
Cao, Weihua
Guo, Linwei
Gan, Chao
Wu, Min
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
reinforcement learning; sparse reward; reward shaping; hybrid exploration;
D O I
10.1109/ICPS58381.2023.10128012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High precision modeling in industrial systems is difficult and costly. Model-free intelligent control methods, represented by reinforcement learning, have been applied in industrial systems broadly. The hard evaluated of production states and the low value density of processing data causes sparse rewards, which lead to an insufficient performance of reinforcement learning. To overcome the difficulty of reinforcement learning in sparse reward scenes, a reinforcement learning method with reward shaping and hybrid exploration is proposed. By perfecting the rewards distribution in the state space of environment, the reward shaping can make the state-value estimation of reinforcement learning more accurate. By improving the rewards distribution in time dimension, the hybrid exploration can make the iteration of reinforcement learning more efficient and more stable. Finally, the effectiveness of the proposed method is verified by simulations.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem
    Hua, Yun
    Wang, Xiangfeng
    Jin, Bo
    Li, Wenhao
    Yan, Junchi
    He, Xiaofeng
    Zha, Hongyuan
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 637 - 645
  • [42] Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
    Guo, Yijie
    Wu, Qiucheng
    Lee, Honglak
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6792 - 6800
  • [43] Self-Supervised Online Reward Shaping in Sparse-Reward Environments
    Memarian, Farzan
    Goo, Wonjoon
    Lioutikov, Rudolf
    Niekum, Scott
    Topcu, Ufuk
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 2369 - 2375
  • [44] Policy-based deep reinforcement learning for sparse reward environment
    Kim, MyeongSeop
    Kim, Jung-Su
    [J]. Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (03): : 506 - 514
  • [45] Sparse reward for reinforcement learning-based continuous integration testing
    Yang, Yang
    Li, Zheng
    Shang, Ying
    Li, Qianyu
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2023, 35 (06)
  • [46] Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
    Sohn, Sungryull
    Lee, Sungtae
    Choi, Jongwook
    van Seijen, Harm
    Fatemi, Mehdi
    Lee, Honglak
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [47] Reinforcement Learning in Sparse-Reward Environments With Hindsight Policy Gradients
    Rauber, Paulo
    Ummadisingu, Avinash
    Mutz, Filipe
    Schmidhuber, Juergen
    [J]. NEURAL COMPUTATION, 2021, 33 (06) : 1498 - 1553
  • [48] Reinforcement learning in sparse-reward environments with hindsight policy gradients
    Queen Mary University of London, London
    E1 4FZ, United Kingdom
    不详
    100-0004, Japan
    不详
    29056-264, Brazil
    不详
    6962, Switzerland
    不详
    6900, Switzerland
    不详
    6928, Switzerland
    不详
    6900, Switzerland
    [J]. Neural Comp., 6 (1498-1553):
  • [49] Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments
    Rengarajan, Desik
    Chaudhary, Sapana
    Kim, Jaewon
    Kalathil, Dileep
    Shakkottai, Srinivas
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] Reward Reports for Reinforcement Learning
    Gilbert, Thomas Krendl
    Lambert, Nathan
    Dean, Sarah
    Zick, Tom
    Snoswell, Aaron
    Mehta, Soham
    [J]. PROCEEDINGS OF THE 2023 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, AIES 2023, 2023, : 84 - 130