Model-based deep reinforcement learning with heuristic search for satellite attitude control

被引:3
|
作者
Xu, Ke [1 ]
Wu, Fengge [1 ]
Zhao, Junsuo [1 ]
机构
[1] Chinese Acad Sci, Inst Software, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Control; Artificial Intelligence; Deep reinforcement learning; Satellite attitude; TRACKING CONTROL;
D O I
10.1108/IR-05-2018-0086
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Purpose Recently, deep reinforcement learning is developing rapidly and shows its power to solve difficult problems such as robotics and game of GO. Meanwhile, satellite attitude control systems are still using classical control technics such as proportional - integral - derivative and slide mode control as major solutions, facing problems with adaptability and automation. Design/methodology/approach In this paper, an approach based on deep reinforcement learning is proposed to increase adaptability and autonomy of satellite control system. It is a model-based algorithm which could find solutions with fewer episodes of learning than model-free algorithms. Findings Simulation experiment shows that when classical control crashed, this approach could find solution and reach the target with hundreds times of explorations and learning. Originality/value This approach is a non-gradient method using heuristic search to optimize policy to avoid local optima. Compared with classical control technics, this approach does not need prior knowledge of satellite or its orbit, has the ability to adapt different kinds of situations with data learning and has the ability to adapt different kinds of satellite and different tasks through transfer learning.
引用
下载
收藏
页码:415 / 420
页数:6
相关论文
共 50 条
  • [1] Satellite attitude control method based on deep reinforcement learning
    Wang Yuejiao
    Ma Zhong
    Yang Yidai
    Wang Zhuping
    Tang Lei
    CHINESE SPACE SCIENCE AND TECHNOLOGY, 2019, 39 (04) : 36 - 42
  • [2] Satellite Attitude Control with Deep Reinforcement Learning
    Gao, Duozhi
    Zhang, Haibo
    Li, Chuanjiang
    Gao, Xinzhou
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4095 - 4101
  • [3] RETRACTED: A Novel Model-Based Reinforcement Learning Attitude Control Method for Virtual Reality Satellite (Retracted Article)
    Zhang, Jian
    Wu, Fengge
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [4] Learning to Paint With Model-based Deep Reinforcement Learning
    Huang, Zhewei
    Heng, Wen
    Zhou, Shuchang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8708 - 8717
  • [5] Calibrated Model-Based Deep Reinforcement Learning
    Malik, Ali
    Kuleshov, Volodymyr
    Song, Jiaming
    Nemer, Danny
    Seymour, Harlan
    Ermon, Stefano
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [6] Missile Attitude Control Based on Deep Reinforcement Learning
    Li, Bohao
    Ma, Fei
    Wu, Yunjie
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2020, : 931 - 936
  • [7] Deep reinforcement learning method based on DDPG with simulated annealing for satellite attitude control system
    Su, Ruipeng
    Wu, Fengge
    Zhao, Junsuo
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 390 - 395
  • [8] Adaptive satellite attitude control for varying masses using deep reinforcement learning
    Retagne, Wiebke
    Dauer, Jonas
    Waxenegger-Wilfing, Guenther
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [9] Model-Based Reinforcement Learning For Robot Control
    Li, Xiang
    Shang, Weiwei
    Cong, Shuang
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 300 - 305
  • [10] Low-Level Control of a Quadrotor With Deep Model-Based Reinforcement Learning
    Lambert, Nathan O.
    Drewe, Daniel S.
    Yaconelli, Joseph
    Levine, Sergey
    Calandra, Roberto
    Pister, Kristofer S. J.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (04) : 4224 - 4230