A Path Planning Algorithm for Space Manipulator Based on Q-Learning

被引：0

作者：

Li, Taiguo ^{[1
]}

Li, Quanhong ^{[2
]}

Li, Wenxi ^{[1
]}

Xia, Jiagao ^{[1
]}

Tang, Wenhua ^{[1
]}

Wang, Weiwen ^{[1
]}

机构：

[1] Lanzhou Inst Phys, Lanzhou, Gansu, Peoples R China

[2] Gansu Agr Univ, Coll Resources & Environm, Lanzhou, Gansu, Peoples R China

来源：

PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019) | 2019年

关键词：

Space Manipulato; Grid Model; Q-Learning; Reinforcement Learning; Path Planning;

D O I：

10.1109/itaic.2019.8785427

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

an improved Q-Learning autonomous learning algorithm is proposed to solve the problem of the adaptive path planning of the space manipulator in the unknown environment. After simplification of the manipulator and obstacle model, the grid model of the environment is established, and the position of the manipulator and obstacles are randomly deployed in the grid map. Based on the analysis of the basic principle of reinforcement learning and the state generalization method, the improved Q-Learning algorithm is used to carry out the path planning. In this algorithm, the reward and punishment strategies in the path planning of the manipulator are designed, and the approximate greedy and continuous micro Botlzmann distribution behavior selection strategy is adopted. According to the autonomous learning of Q-table, the manipulator can guide its follow-up action selection and path planning, reduce the number of manipulator movement, and reduce the blindness of the learning process. The results show that the algorithm has the advantages of simple calculation, strong self-learning ability, and can successfully complete the adaptive path planning in unknown environment.

引用

页码：1566 / 1571

页数：6

共 50 条

[21] UAV path planning algorithm based on Deep Q-Learning to search for a lost in the ocean
Boulares, Mehrez
Fehri, Afef
Jemni, Mohamed
ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 179
[22] Optimal path planning method based on epsilon-greedy Q-learning algorithm
Bulut, Vahide
JOURNAL OF THE BRAZILIAN SOCIETY OF MECHANICAL SCIENCES AND ENGINEERING, 2022, 44 (03)
[23] Optimal path planning method based on epsilon-greedy Q-learning algorithm
Vahide Bulut
Journal of the Brazilian Society of Mechanical Sciences and Engineering, 2022, 44
[24] Extended Q-Learning Algorithm for Path-Planning of a Mobile Robot
Goswami , Indrani
Das, Pradipta Kumar
Konar, Amit
Janarthanan, R.
SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 379 - +
[25] Path planning of UAV using guided enhancement Q-learning algorithm
Zhou B.
Guo Y.
Li N.
Zhong X.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (09):
[26] Dynamic Path Planning of a Mobile Robot with Improved Q-Learning algorithm
Li, Siding
Xu, Xin
Zuo, Lei
2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 409 - 414
[27] An optimized Q-Learning algorithm for mobile robot local path planning
Zhou, Qian
Lian, Yang
Wu, Jiayang
Zhu, Mengyue
Wang, Haiyong
Cao, Jinli
KNOWLEDGE-BASED SYSTEMS, 2024, 286
[28] Ant colony pheromone aided Q-learning path planning algorithm
Tian X.-H.
Huo X.
Zhou D.-L.
Zhao H.
Kongzhi yu Juece/Control and Decision, 2023, 38 (12): : 3345 - 3353
[29] Synergism of Firefly Algorithm and Q-Learning for Robot Arm Path Planning
Sadhu, Arup Kumar
Konar, Amit
Bhattacharjee, Tanuka
Das, Swagatam
SWARM AND EVOLUTIONARY COMPUTATION, 2018, 43 : 50 - 68
[30] The Method Based on Q-Learning Path Planning in Migrating Workflow
Xiao, Song
Wang, Xiao-lin
PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 2204 - 2208

← 1 2 3 4 5 →