A visual servo intelligent control method for rotor UAV based on Dyna-Q learning

被引：0

作者：

Shi, Hao-Bin ^{[1
]}

Xu, Meng ^{[1
]}

Liu, Jia-Yu ^{[1
]}

Li, Ji-Chao ^{[1
]}

机构：

[1] School of Computer Science, Northwestern Polytechnical University, Xi'an,710072, China

来源：

Kongzhi yu Juece/Control and Decision | 2019年 / 34卷 / 12期

关键词：

Reinforcement learning;

D O I：

10.13195/j.kzyjc.2018.0342

中图分类号：

学科分类号：

摘要：

The image-based visual servo control method of robots obtains the image information through the robot's vision and then forms the closed-loop feedback based on the image information to control the robot's reasonable movement. However, due to the problem of poor robustness and slow convergence, the selection of servo gain for classical visual servoing is artificial assignment under most conditions. Therefore, an intelligent servo control method based on Dyna-Q learning is proposed to adjust the servo gain to improve its adaptability. Firstly, this method uses the image feature extraction algorithm based on Felman chain code to extract the target feature point, then uses the image-based visual servoing to form the closed-loop control of the characteristic error. Then, this paper presents a decoupling visual servoing control model for the dynamic characteristics of rotor UAV's strong coupling underactuated. Finally, a reinforcement learning model using Dyna-Q learning to adjust the servo gain is established, through which the rotor UAV can choose the servo gain independently. The Dyna-Q learning method learns to store experience on the basis of classical Q-Learning by setting up an environment model, and the virtual samples generated by the environment model can be used as learning samples to iterate the value function. The experimental results show that the proposed method is faster and more stable than the classical PID control and classical image based visual servo methods. © 2019, Editorial Office of Control and Decision. All right reserved.

引用

页码：2517 / 2526

共 50 条

[1] Adaptive Model Learning Based on Dyna-Q Learning
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
[J]. CYBERNETICS AND SYSTEMS, 2013, 44 (08) : 641 - 662
[2] Model-based Indirect Learning method based on Dyna-Q architecture
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
Wang, Wei-Han
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 2540 - 2544
[3] Intelligent Ramp Control for Incident Response Using Dyna-Q Architecture
Lu, Chao
Zhao, Yanan
Gong, Jianwei
[J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[4] Pheromone-Based Planning Strategies in Dyna-Q Learning
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (02) : 424 - 435
[5] An Improved Dyna-Q Algorithm Based in Reverse Model Learning
Tseng, Yi-Jia
Hwang, Kao-Shing
Jiang, Wei-Cheng
Huang, Tsung-Chuan
Chen, Song-Shyong
[J]. NEW TRENDS ON SYSTEM SCIENCES AND ENGINEERING, 2015, 276 : 200 - 212
[6] Tree-Based Dyna-Q Agent
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
[J]. 2012 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2012, : 1077 - 1080
[7] Model Learning for Multistep Backward Prediction in Dyna-Q Learning
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
Hwang, Iris
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (09): : 1470 - 1481
[8] An Intelligent Tracking Method of Rotor UAV Based on Reinforcement Learning
Shi, Hao-Bin
Xu, Meng
[J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2019, 48 (04): : 553 - 559
[9] Model Learning and Knowledge Sharing for a Multiagent System With Dyna-Q Learning
Hwang, Kao-Shing
Jiang, Wei-Cheng
Chen, Yu-Jen
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (05) : 964 - 976
[10] Gaussian Process based Deep Dyna-Q Approach for Dialogue Policy Learning
Wu, Guanlin
Fang, Wenqi
Wang, Ji
Cao, Jiang
Bao, Weidong
Ping, Yang
Zhu, Xiaomin
Wang, Zheng
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1786 - 1795

← 1 2 3 4 5 →