DATA-EFFICIENT DEEP REINFORCEMENT LEARNING WITH CONVOLUTION-BASED STATE ENCODER NETWORKS

被引:0
|
作者
Fang, Qiang [1 ]
Xu, Xin [1 ]
Lan, Yixin [1 ]
Zhang, Yichuan [1 ]
Zeng, Yujun [1 ]
Tang, Tao [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; actor-critic learning; learning control; online learning; auto-encoder; REPRESENTATION;
D O I
10.2316/J.2021.206-0763
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to its ability to deal with high-dimensional end-to-end learning control problems, deep reinforcement learning (DRL) has received lots of research interests in recent years. However, the existing DRL approaches still face the challenge of data efficiency and the online learning control performance of DRL algorithms still needs to be improved. In this paper, we propose an online DRL approach with convolutional encoder networks. In the proposed approach, a cascaded learning control architecture is designed, which performs system state extraction and dimension reduction in the first stage and executes online reinforcement learning in the second stage. A convolutional network is used to encode features from the raw image data so that the algorithm can be implemented based on the encoded low-dimensional features, which can significantly improve the learning efficiency. Experimental results on two benchmark of learning control tasks show that the proposed approach outperforms previous end-to-end DRL approaches, which demonstrates the effectiveness and efficiency of the proposed approach.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] EqR: Equivariant Representations for Data-Efficient Reinforcement Learning
    Mondal, Arnab Kumar
    Jain, Vineet
    Siddiqi, Kaleem
    Ravanbakhsh, Siamak
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [22] Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
    Nie, Allen
    Flet-Berliac, Yannis
    Jordan, Deon R.
    Steenbergen, William
    Brunskill, Emma
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] Data-Efficient Reinforcement Learning for Variable Impedance Control
    Anand, Akhil S.
    Kaushik, Rituraj
    Gravdahl, Jan Tommy
    Abu-Dakka, Fares J.
    [J]. IEEE ACCESS, 2024, 12 : 15631 - 15641
  • [24] BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
    Cagatan, Omer Veysel
    Akgun, Baris
    [J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [25] Data-Efficient Offline Reinforcement Learning with Approximate Symmetries
    Angelotti, Giorgio
    Drougard, Nicolas
    Chanel, Caroline P. C.
    [J]. AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2023, 2024, 14546 : 164 - 186
  • [26] Concurrent Credit Assignment for Data-efficient Reinforcement Learning
    Dauce, Emmanuel
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [27] Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning
    Zhao, Dongfang
    Liu, Jiafeng
    Wu, Rui
    Cheng, Dansong
    Tang, Xianglong
    [J]. IEEE ACCESS, 2019, 7 : 55763 - 55769
  • [28] Data-Efficient Reinforcement Learning for Complex Nonlinear Systems
    Donge, Vrushabh S.
    Lian, Bosen
    Lewis, Frank L.
    Davoudi, Ali
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1391 - 1402
  • [29] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
    Elaziz, Eman Abd
    Fathalla, Radwa
    Shaheen, Mohamed
    [J]. JOURNAL OF BIG DATA, 2023, 10 (01)
  • [30] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
    Eman Abd Elaziz
    Radwa Fathalla
    Mohamed Shaheen
    [J]. Journal of Big Data, 10