Grid Path Planning with Deep Reinforcement Learning: Preliminary Results

被引:88
|
作者
Panov, Aleksandr, I [1 ,2 ]
Yakovlev, Konstantin S. [1 ,2 ]
Suvorov, Roman [1 ]
机构
[1] Russian Acad Sci, Fed Res Ctr Comp Sci & Control, Moscow, Russia
[2] Natl Res Univ, Higher Sch Econ, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
path planning; reinforcement learning; neural networks; Q-learning; convolution networks; Q-network; NEURAL-NETWORKS;
D O I
10.1016/j.procs.2018.01.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single-shot grid-based path finding is an important problem with the applications in robotics, video games etc. Typically in AI community heuristic search methods (based on A* and its variations) are used to solve it. In this work we present the results of preliminary studies on how neural networks can be utilized to path planning on square grids, e.g. how well they can cope with path finding tasks by themselves within the well-known reinforcement problem statement. Conducted experiments show that the agent using neural Q-learning algorithm robustly learns to achieve the goal on small maps and demonstrate promising results on the maps have ben never seen by him before. (C) 2018 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/) Peer-review under responsibility of the scientific committee of the 8th Annual International Conference on Biologically Inspired Cognitive Architectures
引用
收藏
页码:347 / 353
页数:7
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Algorithms for Path Planning Domain in Grid-like Environment
    Grzelczak, Maciej
    Duch, Piotr
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [2] Efficient Deep Reinforcement Learning for Optimal Path Planning
    Ren, Jing
    Huang, Xishi
    Huang, Raymond N.
    [J]. ELECTRONICS, 2022, 11 (21)
  • [3] Robot path planning based on deep reinforcement learning
    Long, Yinxin
    He, Huajin
    [J]. 2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 151 - 154
  • [4] Dynamic Path Planning for Mobile Robots with Deep Reinforcement Learning
    Yang, Laiyi
    Bi, Jing
    Yuan, Haitao
    [J]. IFAC PAPERSONLINE, 2022, 55 (11): : 19 - 24
  • [5] Explainable Deep Reinforcement Learning for UAV autonomous path planning
    He, Lei
    Aouf, Nabil
    Song, Bifeng
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118
  • [6] UCAV Path Planning Algorithm Based on Deep Reinforcement Learning
    Zheng, Kaiyuan
    Gao, Jingpeng
    Shen, Liangxi
    [J]. IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 : 702 - 714
  • [7] A decentralized path planning model based on deep reinforcement learning
    Guo, Dong
    Ji, Shouwen
    Yao, Yanke
    Chen, Cheng
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2024, 117
  • [8] Application of Deep Reinforcement Learning in Mobile Robot Path Planning
    Xin, Jing
    Zhao, Huan
    Liu, Ding
    Li, Minqi
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7112 - 7116
  • [9] A Deep Reinforcement Learning Based Approach for AGVs Path Planning
    Guo, Xinde
    Ren, Zhigang
    Wu, Zongze
    Lai, Jialun
    Zeng, Deyu
    Xie, Shengli
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 6833 - 6838
  • [10] Research on path planning of robot based on deep reinforcement learning
    Liu, Feng
    Chen, Chang
    Li, Zhihua
    Guan, Zhi-Hong
    Wang, Hua O.
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 3730 - 3734