A visual servo intelligent control method for rotor UAV based on Dyna-Q learning

被引:0
|
作者
Shi, Hao-Bin [1 ]
Xu, Meng [1 ]
Liu, Jia-Yu [1 ]
Li, Ji-Chao [1 ]
机构
[1] School of Computer Science, Northwestern Polytechnical University, Xi'an,710072, China
来源
Kongzhi yu Juece/Control and Decision | 2019年 / 34卷 / 12期
关键词
Reinforcement learning;
D O I
10.13195/j.kzyjc.2018.0342
中图分类号
学科分类号
摘要
The image-based visual servo control method of robots obtains the image information through the robot's vision and then forms the closed-loop feedback based on the image information to control the robot's reasonable movement. However, due to the problem of poor robustness and slow convergence, the selection of servo gain for classical visual servoing is artificial assignment under most conditions. Therefore, an intelligent servo control method based on Dyna-Q learning is proposed to adjust the servo gain to improve its adaptability. Firstly, this method uses the image feature extraction algorithm based on Felman chain code to extract the target feature point, then uses the image-based visual servoing to form the closed-loop control of the characteristic error. Then, this paper presents a decoupling visual servoing control model for the dynamic characteristics of rotor UAV's strong coupling underactuated. Finally, a reinforcement learning model using Dyna-Q learning to adjust the servo gain is established, through which the rotor UAV can choose the servo gain independently. The Dyna-Q learning method learns to store experience on the basis of classical Q-Learning by setting up an environment model, and the virtual samples generated by the environment model can be used as learning samples to iterate the value function. The experimental results show that the proposed method is faster and more stable than the classical PID control and classical image based visual servo methods. © 2019, Editorial Office of Control and Decision. All right reserved.
引用
收藏
页码:2517 / 2526
相关论文
共 50 条
  • [1] Adaptive Model Learning Based on Dyna-Q Learning
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    [J]. CYBERNETICS AND SYSTEMS, 2013, 44 (08) : 641 - 662
  • [2] Model-based Indirect Learning method based on Dyna-Q architecture
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    Wang, Wei-Han
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2540 - 2544
  • [3] Intelligent Ramp Control for Incident Response Using Dyna-Q Architecture
    Lu, Chao
    Zhao, Yanan
    Gong, Jianwei
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [4] Pheromone-Based Planning Strategies in Dyna-Q Learning
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (02) : 424 - 435
  • [5] An Improved Dyna-Q Algorithm Based in Reverse Model Learning
    Tseng, Yi-Jia
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Huang, Tsung-Chuan
    Chen, Song-Shyong
    [J]. NEW TRENDS ON SYSTEM SCIENCES AND ENGINEERING, 2015, 276 : 200 - 212
  • [6] Tree-Based Dyna-Q Agent
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    [J]. 2012 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2012, : 1077 - 1080
  • [7] Model Learning for Multistep Backward Prediction in Dyna-Q Learning
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    Hwang, Iris
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (09): : 1470 - 1481
  • [8] An Intelligent Tracking Method of Rotor UAV Based on Reinforcement Learning
    Shi, Hao-Bin
    Xu, Meng
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2019, 48 (04): : 553 - 559
  • [9] Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Chen, Yu-Jen
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (05) : 964 - 976
  • [10] Gaussian Process based Deep Dyna-Q Approach for Dialogue Policy Learning
    Wu, Guanlin
    Fang, Wenqi
    Wang, Ji
    Cao, Jiang
    Bao, Weidong
    Ping, Yang
    Zhu, Xiaomin
    Wang, Zheng
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1786 - 1795