An Improved Reinforcement Learning Method Based on Unsupervised Learning

被引:0
|
作者
Chang, Xin [1 ]
Li, Yanbin [1 ]
Zhang, Guanjie [1 ]
Liu, Donghui [2 ]
Fu, Changjun [1 ]
机构
[1] 54th Res Inst China Elect Technol Grp Corp CETC54, Shijiazhuang 050081, Peoples R China
[2] Shijiazhuang Tiedao Univ, Sch Management, Shijiazhuang 050043, Peoples R China
基金
中国博士后科学基金;
关键词
Reinforcement learning; unsupervised learning; supervised learning; deep learning; dimensionality reduction; NETWORKS; LEVEL;
D O I
10.1109/ACCESS.2024.3351696
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The approach of directly combining clustering method and reinforcement learning (RL) will lead to encounter the issue where states may have different state transition processes under the same action, resulting in poor policy performance. To address this challenge with multi-dimensional continuous observation data, an improved reinforcement learning method based on unsupervised learning is proposed with a novel framework. Instead of dimensionality reduction methods, unsupervised clustering is employed to indirectly capture the underlying structure of the data. First, the proposed framework incorporates multi-dimensional information, including the current observation data, the next observation data and reward information, during the clustering process, leading to a more accurate and comprehensive low-dimensional discrete representation of the observation data while retaining preserving transition of Markov decision process. Second, by compressing the observation data into a well-defined state space, the resulting cluster labels serve as the low-dimensional discrete label-states for reinforcement learning to generate more effective and robust policies. Comparative analysis with state-of-the-art RL methods demonstrates that the improved RL methods base on framework achieves higher rewards, indicating its superior performance. Furthermore, the framework exhibits computational efficiency, as evidenced by its reasonable time complexity. This structural innovation allows for better exploration and exploitation of the transition, leading to improved policy performance in engineering applications.
引用
收藏
页码:12295 / 12307
页数:13
相关论文
共 50 条
  • [1] Reinforcement learning reward functions for unsupervised learning
    Fyfe, Colin
    Lai, Pei Ling
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 397 - +
  • [2] Wasserstein Unsupervised Reinforcement Learning
    He, Shuncheng
    Jiang, Yuhang
    Zhang, Hongchang
    Shao, Jianzhun
    Ji, Xiangyang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6884 - 6892
  • [3] Reinforcement Learning Based on Active Learning Method
    Sagha, Hesam
    Shouraki, Saeed Bagheri
    Khasteh, Hosein
    Kiaei, Ali Akbar
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 598 - +
  • [4] Improved Robot Path Planning Method Based on Deep Reinforcement Learning
    Han, Huiyan
    Wang, Jiaqi
    Kuang, Liqun
    Han, Xie
    Xue, Hongxin
    [J]. SENSORS, 2023, 23 (12)
  • [5] Unsupervised Learning for Robust Fitting: A Reinforcement Learning Approach
    Truong, Giang
    Le, Huu
    Suter, David
    Zhang, Erchuan
    Gilani, Syed Zulqarnain
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10343 - 10352
  • [6] A brainlike learning system with supervised, unsupervised, and reinforcement learning
    Sasakawa, Takafumi
    Hu, Jinglu
    Hirasawa, Kotaro
    [J]. ELECTRICAL ENGINEERING IN JAPAN, 2008, 162 (01) : 32 - 39
  • [7] Unsupervised Reinforcement Learning in Multiple Environments
    Mutti, Mirco
    Mancassola, Mattia
    Restelli, Marcello
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7850 - 7858
  • [8] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    [J]. SENSORS, 2023, 23 (07)
  • [9] A hierarchical learning system incorporating with supervised, unsupervised and reinforcement learning
    Hu, Jinglu
    Sasakawa, Takafumi
    Hirasawa, Kotaro
    Zheng, Huiru
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 403 - +
  • [10] Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning
    Feng, Fei
    Wang, Ruosong
    Yin, Wotao
    Du, Simon S.
    Yang, Lin F.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33