Fuzzy neural control of satellite attitude by TD based reinforcement learning

被引:0
|
作者
Cui, Xiao-ting [1 ]
Liu, Xiang-dong [1 ]
机构
[1] Beijing Inst Technol, Dept Automat Control, Beijing 100081, Peoples R China
关键词
satellite attitude control; fuzzy neural network; reinforcement learning; temporal difference learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With recent development of the space science and technology, higher requirements such as accuracy, robustness and disturbance rejection ability in satellite attitude control system have leaded to the more promising intelligent control methods. In this paper, a fuzzy neural control approach applied to the three-axis stabilized satellite is presented. In order to solve the problems of online learning and tuning of the fuzzy neural network parameters, the reinforcement learning based on temporal difference (TD) is also proposed and studied so that the training samples for the self-learning controller are not needed. Since the vibration of the solar swing cannot be ignored, a flexible mathematic model of the satellite is studied, employing Quaternion and Euler-Angles representations. The simulation results showed that the proposed control method with reinforcement learning architecture could not only improve the accuracy and robustness of the system, but also could deal with the uncertainties and external disturbance efficiently.
引用
收藏
页码:3983 / +
页数:2
相关论文
共 6 条
  • [1] LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS
    BERENJI, HR
    KHEDKAR, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05): : 724 - 740
  • [2] BERENJI HR, 1994, P 2 INT C FUZZ SYST, P1396
  • [3] Lin CT, 1994, NEURAL FUZZY CONTROL
  • [4] Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479
  • [5] Adaptive fuzzy control of satellite attitude by reinforcement learning
    van Buijtenen, WM
    Schram, G
    Babuska, R
    Verbruggen, HB
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1998, 6 (02) : 185 - 194
  • [6] WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698