Model Predictive Control-Based Value Estimation for Efficient Reinforcement Learning

被引：0

作者：

Wu, Qizhen ^{[1
]}

Liu, Kexin ^{[1
]}

Chen, Lei ^{[2
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China

来源：

IEEE INTELLIGENT SYSTEMS | 2024年 / 39卷 / 03期

基金：

美国国家科学基金会; 国家重点研发计划;

关键词：

Predictive models; Neural networks; Trajectory; Data models; Computational modeling; Training; Optimization; Reinforcement learning; Predictive control;

D O I：

10.1109/MIS.2024.3386204

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal strategy with only a few attempts for many learning methods. Hereby, we design an improved RL method based on model predictive control that models the environment through a data-driven approach. Based on the learned environment model, it performs multistep prediction to estimate the value function and optimize the policy. The method demonstrates higher learning efficiency, faster convergent speed of strategies tending to the local optimal value, and less sample capacity space required by experience replay buffers. Experimental results, both in classic databases and in a dynamic obstacle-avoidance scenario for an unmanned aerial vehicle, validate the proposed approaches.

引用

页码：63 / 72

页数：10

共 50 条

[1] Model Predictive Control-Based Reinforcement Learning
Han, Qiang
Boussaid, Farid
Bennamoun, Mohammed
2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
[2] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
Moradimaryamnegari, Hoomaan
Frego, Marco
Peer, Angelika
IEEE ACCESS, 2022, 10 : 81177 - 81191
[3] Energy management in residential microgrid using model predictive control-based reinforcement learning and Shapley value
Cai, Wenqi
Kordabad, Arash Bahari
Gros, Sebastien
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
[4] Filtered Probabilistic Model Predictive Control-Based Reinforcement Learning for Unmanned Surface Vehicles
Cui, Yunduan
Peng, Lei
Li, Huiyun
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 6950 - 6961
[5] Autonomous boat driving system using sample-efficient model predictive control-based reinforcement learning approach
Cui, Yunduan
Osaki, Shigeki
Matsubara, Takamitsu
JOURNAL OF FIELD ROBOTICS, 2021, 38 (03) : 331 - 354
[6] An efficient Model Predictive Control-based motion cueing algorithm for the driving simulator
Fang, Zhou
Kemeny, Andras
SIMULATION-TRANSACTIONS OF THE SOCIETY FOR MODELING AND SIMULATION INTERNATIONAL, 2016, 92 (11): : 1025 - 1033
[7] Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control
Kamthe, Sanket
Deisenroth, Marc Peter
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
[8] Reinforcement Learning Boat Autopilot: A Sample-efficient and Model Predictive Control based Approach
Cui, Yunduan
Osaki, Shigeki
Matsubara, Takamitsu
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2868 - 2875
[9] Model Predictive Control of Quadruped Robot Based on Reinforcement Learning
Zhang, Zhitong
Chang, Xu
Ma, Hongxu
An, Honglei
Lang, Lin
APPLIED SCIENCES-BASEL, 2023, 13 (01):
[10] Markov Decision Process Framework for Control-Based Reinforcement Learning
Lu Y.
Squillante M.S.
Wah Wu C.
Performance Evaluation Review, 2023, 51 (02): : 39 - 41

← 1 2 3 4 5 →