Grid Path Planning with Deep Reinforcement Learning: Preliminary Results

被引:88
|
作者
Panov, Aleksandr, I [1 ,2 ]
Yakovlev, Konstantin S. [1 ,2 ]
Suvorov, Roman [1 ]
机构
[1] Russian Acad Sci, Fed Res Ctr Comp Sci & Control, Moscow, Russia
[2] Natl Res Univ, Higher Sch Econ, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
path planning; reinforcement learning; neural networks; Q-learning; convolution networks; Q-network; NEURAL-NETWORKS;
D O I
10.1016/j.procs.2018.01.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single-shot grid-based path finding is an important problem with the applications in robotics, video games etc. Typically in AI community heuristic search methods (based on A* and its variations) are used to solve it. In this work we present the results of preliminary studies on how neural networks can be utilized to path planning on square grids, e.g. how well they can cope with path finding tasks by themselves within the well-known reinforcement problem statement. Conducted experiments show that the agent using neural Q-learning algorithm robustly learns to achieve the goal on small maps and demonstrate promising results on the maps have ben never seen by him before. (C) 2018 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/) Peer-review under responsibility of the scientific committee of the 8th Annual International Conference on Biologically Inspired Cognitive Architectures
引用
收藏
页码:347 / 353
页数:7
相关论文
共 50 条
  • [21] Path planning of robotic arm based on deep reinforcement learning algorithm
    Al-Gabalawy M.
    [J]. Advanced Control for Applications: Engineering and Industrial Systems, 2022, 4 (01):
  • [22] A Review of Deep Reinforcement Learning Algorithms for Mobile Robot Path Planning
    Singh, Ramanjeet
    Ren, Jing
    Lin, Xianke
    [J]. VEHICLES, 2023, 5 (04): : 1423 - 1451
  • [23] Deep Reinforcement Learning With Dynamic Graphs for Adaptive Informative Path Planning
    Vashisth, Apoorva
    Rueckin, Julius
    Magistri, Federico
    Stachniss, Cyrill
    Popovic, Marija
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7747 - 7754
  • [24] UAV online path planning technology based on deep reinforcement learning
    Fan, Jiaxuan
    Wang, Zhenya
    Ren, Jinlei
    Lu, Ying
    Liu, Yiheng
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5382 - 5386
  • [25] Mobile Service Robot Path Planning Using Deep Reinforcement Learning
    Kumaar, A. A. Nippun
    Kochuvila, Sreeja
    [J]. IEEE ACCESS, 2023, 11 : 100083 - 100096
  • [26] A path planning method based on deep reinforcement learning for crowd evacuation
    Meng X.
    Liu H.
    Li W.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (6) : 2925 - 2939
  • [27] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang Y.
    Wang Y.-H.
    Yin Z.-Z.
    Wan P.
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [28] An Improved Deep Reinforcement Learning Algorithm for Path Planning in Unmanned Driving
    Yang, Kai
    Liu, Li
    [J]. IEEE ACCESS, 2024, 12 : 67935 - 67944
  • [29] Real Time Path Planning of Robot using Deep Reinforcement Learning
    Raajan, Jeevan
    Srihari, P., V
    Satya, Jayadev P.
    Bhikkaji, B.
    Pasumarthy, Ramkrishna
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 15602 - 15607
  • [30] Dynamic Path Planning of Unknown Environment Based on Deep Reinforcement Learning
    Lei, Xiaoyun
    Zhang, Zhian
    Dong, Peifang
    [J]. JOURNAL OF ROBOTICS, 2018, 2018