Reinforcement learning-based complete area coverage path planning for a modified htrihex robot

被引:0
|
作者
Apuroop, Koppaka Ganesh Sai [1 ]
Le, Anh Vu [2 ]
Elara, Mohan Rajesh [1 ]
Sheu, Bing J. [3 ]
机构
[1] ROAR Lab, Engineering Product Development, Singapore University of Technology and Design, Singapore,487372, Singapore
[2] Optoelectronics Research Group, Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, Ho Chi Minh City,700000, Viet Nam
[3] Electronics Engineering and Information Management, Chang Gung University, Taoyuan City,330, Taiwan
来源
Sensors (Switzerland) | 2021年 / 21卷 / 04期
关键词
Learning algorithms - Long short-term memory - Multilayer neural networks - Traveling salesman problem - Energy utilization - Motion planning - Ant colony optimization - Cleaning - Learning systems - Genetic algorithms - Robot programming;
D O I
暂无
中图分类号
学科分类号
摘要
One of the essential attributes of a cleaning robot is to achieve complete area coverage. Current commercial indoor cleaning robots have fixed morphology and are restricted to clean only specific areas in a house. The results of maximum area coverage are sub-optimal in this case. Tiling robots are innovative solutions for such a coverage problem. These new kinds of robots can be deployed in the cases of cleaning, painting, maintenance, and inspection, which require complete area coverage. Tiling robots’ objective is to cover the entire area by reconfiguring to different shapes as per the area requirements. In this context, it is vital to have a framework that enables the robot to maximize the area coverage while minimizing energy consumption. That means it is necessary for the robot to cover the maximum area with the least number of shape reconfigurations possible. The current paper proposes a complete area coverage planning module for the modified hTrihex, a honeycomb-shaped tiling robot, based on the deep reinforcement learning technique. This framework simultaneously generates the tiling shapes and the trajectory with minimum overall cost. In this regard, a convolutional neural network (CNN) with long short term memory (LSTM) layer was trained using the actor-critic experience replay (ACER) reinforcement learning algorithm. The simulation results obtained from the current implementation were compared against the results that were generated through traditional tiling theory models that included zigzag, spiral, and greedy search schemes. The model presented in the current paper was also compared against other methods where this problem was considered as a traveling salesman problem (TSP) solved through genetic algorithm (GA) and ant colony optimization (ACO) approaches. Our proposed scheme generates a path with a minimized cost at a lesser time. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
引用
下载
收藏
页码:1 / 20
相关论文
共 50 条
  • [41] Research on Full Coverage Path Planning Based on Reinforcement Learning in Nuclear Environment
    Wang, Shiqi
    Song, Shuzong
    Liu, Zhenni
    Ma, Lijun
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (04)
  • [42] Path planning of a mobile robot by optimization and reinforcement learning
    Harukazu Igarashi
    Artificial Life and Robotics, 2002, 6 (1-2) : 59 - 65
  • [43] Reinforcement Learning-Based Approach to Robot Path Tracking in Nonlinear Dynamic Environments
    Chen, Wei
    Zhou, Zebin
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2024, 21 (04)
  • [44] Field Complete Coverage Path Planning Based on Improved Genetic Algorithm for Transplanting Robot
    Wu, Xizhi
    Bai, Jinqiang
    Hao, Fengqi
    Cheng, Guanghe
    Tang, Yongwei
    Li, Xiuhua
    MACHINES, 2023, 11 (06)
  • [45] Sensor-based complete coverage path planning in dynamic environment for cleaning robot
    Liu, Hong
    Ma, Jiayao
    Huang, Weibo
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (01) : 65 - 72
  • [46] Design and Simulation of Complete Coverage Indoor Robot Path Planning Algorithm Based on BIM
    Hu, Jiqiang
    Huang, Rongjiang
    Zeng, Qiang
    Luo, Shipeng
    Wang, Xianhe
    CARBON PEAK AND NEUTRALITY STRATEGIES OF THE CONSTRUCTION INDUSTRY (ICCREM 2022), 2022, : 305 - 314
  • [47] Reinforcement learning-based radar-evasive path planning: a comparative analysis
    Hameed, R. U.
    Maqsood, A.
    Hashmi, A. J.
    Saeed, M. T.
    Riaz, R.
    AERONAUTICAL JOURNAL, 2022, 126 (1297): : 547 - 564
  • [48] Rescue path planning for urban flood: A deep reinforcement learning-based approach
    Li, Xiao-Yan
    Wang, Xia
    RISK ANALYSIS, 2024,
  • [49] Deep Reinforcement Learning-Based Robotic Puncturing Path Planning of Flexible Needle
    Lin, Jun
    Huang, Zhiqiang
    Zhu, Tengliang
    Leng, Jiewu
    Huang, Kai
    Processes, 12 (12):
  • [50] Improved Path Planning for Indoor Patrol Robot Based on Deep Reinforcement Learning
    Zheng, Jianfeng
    Mao, Shuren
    Wu, Zhenyu
    Kong, Pengcheng
    Qiang, Hao
    SYMMETRY-BASEL, 2022, 14 (01):