A Path Planning Algorithm for Space Manipulator Based on Q-Learning

被引:0
|
作者
Li, Taiguo [1 ]
Li, Quanhong [2 ]
Li, Wenxi [1 ]
Xia, Jiagao [1 ]
Tang, Wenhua [1 ]
Wang, Weiwen [1 ]
机构
[1] Lanzhou Inst Phys, Lanzhou, Gansu, Peoples R China
[2] Gansu Agr Univ, Coll Resources & Environm, Lanzhou, Gansu, Peoples R China
关键词
Space Manipulato; Grid Model; Q-Learning; Reinforcement Learning; Path Planning;
D O I
10.1109/itaic.2019.8785427
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
an improved Q-Learning autonomous learning algorithm is proposed to solve the problem of the adaptive path planning of the space manipulator in the unknown environment. After simplification of the manipulator and obstacle model, the grid model of the environment is established, and the position of the manipulator and obstacles are randomly deployed in the grid map. Based on the analysis of the basic principle of reinforcement learning and the state generalization method, the improved Q-Learning algorithm is used to carry out the path planning. In this algorithm, the reward and punishment strategies in the path planning of the manipulator are designed, and the approximate greedy and continuous micro Botlzmann distribution behavior selection strategy is adopted. According to the autonomous learning of Q-table, the manipulator can guide its follow-up action selection and path planning, reduce the number of manipulator movement, and reduce the blindness of the learning process. The results show that the algorithm has the advantages of simple calculation, strong self-learning ability, and can successfully complete the adaptive path planning in unknown environment.
引用
下载
收藏
页码:1566 / 1571
页数:6
相关论文
共 50 条
  • [21] UAV path planning algorithm based on Deep Q-Learning to search for a lost in the ocean
    Boulares, Mehrez
    Fehri, Afef
    Jemni, Mohamed
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 179
  • [22] Path planning for mobile robot based on improved ant colony Q-learning algorithm
    Mengru Cui
    Maowei He
    Hanning Chen
    Kunpeng Liu
    Yabao Hu
    Chen Zheng
    Xuliang Wang
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2025, 19 (4): : 3069 - 3087
  • [23] Optimal path planning method based on epsilon-greedy Q-learning algorithm
    Vahide Bulut
    Journal of the Brazilian Society of Mechanical Sciences and Engineering, 2022, 44
  • [24] Path planning of UAV using guided enhancement Q-learning algorithm
    Zhou B.
    Guo Y.
    Li N.
    Zhong X.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (09):
  • [25] Dynamic Path Planning of a Mobile Robot with Improved Q-Learning algorithm
    Li, Siding
    Xu, Xin
    Zuo, Lei
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 409 - 414
  • [26] Extended Q-Learning Algorithm for Path-Planning of a Mobile Robot
    Goswami , Indrani
    Das, Pradipta Kumar
    Konar, Amit
    Janarthanan, R.
    SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 379 - +
  • [27] An optimized Q-Learning algorithm for mobile robot local path planning
    Zhou, Qian
    Lian, Yang
    Wu, Jiayang
    Zhu, Mengyue
    Wang, Haiyong
    Cao, Jinli
    KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [28] Ant colony pheromone aided Q-learning path planning algorithm
    Tian X.-H.
    Huo X.
    Zhou D.-L.
    Zhao H.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (12): : 3345 - 3353
  • [29] Synergism of Firefly Algorithm and Q-Learning for Robot Arm Path Planning
    Sadhu, Arup Kumar
    Konar, Amit
    Bhattacharjee, Tanuka
    Das, Swagatam
    SWARM AND EVOLUTIONARY COMPUTATION, 2018, 43 : 50 - 68
  • [30] The Method Based on Q-Learning Path Planning in Migrating Workflow
    Xiao, Song
    Wang, Xiao-lin
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 2204 - 2208