DATA-EFFICIENT DEEP REINFORCEMENT LEARNING WITH CONVOLUTION-BASED STATE ENCODER NETWORKS

被引：0

作者：

Fang, Qiang ^{[1
]}

Xu, Xin ^{[1
]}

Lan, Yixin ^{[1
]}

Zhang, Yichuan ^{[1
]}

Zeng, Yujun ^{[1
]}

Tang, Tao ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION | 2021年 / 36卷

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; actor-critic learning; learning control; online learning; auto-encoder; REPRESENTATION;

D O I：

10.2316/J.2021.206-0763

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Due to its ability to deal with high-dimensional end-to-end learning control problems, deep reinforcement learning (DRL) has received lots of research interests in recent years. However, the existing DRL approaches still face the challenge of data efficiency and the online learning control performance of DRL algorithms still needs to be improved. In this paper, we propose an online DRL approach with convolutional encoder networks. In the proposed approach, a cascaded learning control architecture is designed, which performs system state extraction and dimension reduction in the first stage and executes online reinforcement learning in the second stage. A convolutional network is used to encode features from the raw image data so that the algorithm can be implemented based on the encoded low-dimensional features, which can significantly improve the learning efficiency. Experimental results on two benchmark of learning control tasks show that the proposed approach outperforms previous end-to-end DRL approaches, which demonstrates the effectiveness and efficiency of the proposed approach.

引用

页数：10

共 50 条

[21] EqR: Equivariant Representations for Data-Efficient Reinforcement Learning
Mondal, Arnab Kumar
Jain, Vineet
Siddiqi, Kaleem
Ravanbakhsh, Siamak
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[22] Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Nie, Allen
Flet-Berliac, Yannis
Jordan, Deon R.
Steenbergen, William
Brunskill, Emma
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[23] Data-Efficient Reinforcement Learning for Variable Impedance Control
Anand, Akhil S.
Kaushik, Rituraj
Gravdahl, Jan Tommy
Abu-Dakka, Fares J.
[J]. IEEE ACCESS, 2024, 12 : 15631 - 15641
[24] BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
Cagatan, Omer Veysel
Akgun, Baris
[J]. ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
[25] Data-Efficient Offline Reinforcement Learning with Approximate Symmetries
Angelotti, Giorgio
Drougard, Nicolas
Chanel, Caroline P. C.
[J]. AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2023, 2024, 14546 : 164 - 186
[26] Concurrent Credit Assignment for Data-efficient Reinforcement Learning
Dauce, Emmanuel
[J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[27] Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning
Zhao, Dongfang
Liu, Jiafeng
Wu, Rui
Cheng, Dansong
Tang, Xianglong
[J]. IEEE ACCESS, 2019, 7 : 55763 - 55769
[28] Data-Efficient Reinforcement Learning for Complex Nonlinear Systems
Donge, Vrushabh S.
Lian, Bosen
Lewis, Frank L.
Davoudi, Ali
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1391 - 1402
[29] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
Elaziz, Eman Abd
Fathalla, Radwa
Shaheen, Mohamed
[J]. JOURNAL OF BIG DATA, 2023, 10 (01)
[30] Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection
Eman Abd Elaziz
Radwa Fathalla
Mohamed Shaheen
[J]. Journal of Big Data, 10

← 1 2 3 4 5 →