Model-based deep reinforcement learning with heuristic search for satellite attitude control

被引：3

作者：

Xu, Ke ^{[1
]}

Wu, Fengge ^{[1
]}

Zhao, Junsuo ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Software, Beijing, Peoples R China

来源：

INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION | 2019年 / 46卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Control; Artificial Intelligence; Deep reinforcement learning; Satellite attitude; TRACKING CONTROL;

D O I：

10.1108/IR-05-2018-0086

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Purpose Recently, deep reinforcement learning is developing rapidly and shows its power to solve difficult problems such as robotics and game of GO. Meanwhile, satellite attitude control systems are still using classical control technics such as proportional - integral - derivative and slide mode control as major solutions, facing problems with adaptability and automation. Design/methodology/approach In this paper, an approach based on deep reinforcement learning is proposed to increase adaptability and autonomy of satellite control system. It is a model-based algorithm which could find solutions with fewer episodes of learning than model-free algorithms. Findings Simulation experiment shows that when classical control crashed, this approach could find solution and reach the target with hundreds times of explorations and learning. Originality/value This approach is a non-gradient method using heuristic search to optimize policy to avoid local optima. Compared with classical control technics, this approach does not need prior knowledge of satellite or its orbit, has the ability to adapt different kinds of situations with data learning and has the ability to adapt different kinds of satellite and different tasks through transfer learning.

引用

下载

页码：415 / 420

页数：6

共 50 条

[1] Satellite attitude control method based on deep reinforcement learning
Wang Yuejiao
Ma Zhong
Yang Yidai
Wang Zhuping
Tang Lei
CHINESE SPACE SCIENCE AND TECHNOLOGY, 2019, 39 (04) : 36 - 42
[2] Satellite Attitude Control with Deep Reinforcement Learning
Gao, Duozhi
Zhang, Haibo
Li, Chuanjiang
Gao, Xinzhou
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4095 - 4101
[3] RETRACTED: A Novel Model-Based Reinforcement Learning Attitude Control Method for Virtual Reality Satellite (Retracted Article)
Zhang, Jian
Wu, Fengge
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[4] Learning to Paint With Model-based Deep Reinforcement Learning
Huang, Zhewei
Heng, Wen
Zhou, Shuchang
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
[5] Calibrated Model-Based Deep Reinforcement Learning
Malik, Ali
Kuleshov, Volodymyr
Song, Jiaming
Nemer, Danny
Seymour, Harlan
Ermon, Stefano
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[6] Missile Attitude Control Based on Deep Reinforcement Learning
Li, Bohao
Ma, Fei
Wu, Yunjie
2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 931 - 936
[7] Deep reinforcement learning method based on DDPG with simulated annealing for satellite attitude control system
Su, Ruipeng
Wu, Fengge
Zhao, Junsuo
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 390 - 395
[8] Adaptive satellite attitude control for varying masses using deep reinforcement learning
Retagne, Wiebke
Dauer, Jonas
Waxenegger-Wilfing, Guenther
FRONTIERS IN ROBOTICS AND AI, 2024, 11
[9] Model-Based Reinforcement Learning For Robot Control
Li, Xiang
Shang, Weiwei
Cong, Shuang
2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
[10] Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning
Lambert, Nathan O.
Drewe, Daniel S.
Yaconelli, Joseph
Levine, Sergey
Calandra, Roberto
Pister, Kristofer S. J.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4224 - 4230

← 1 2 3 4 5 →