Reinforcement learning-based complete area coverage path planning for a modified htrihex robot

被引:0
|
作者
Apuroop, Koppaka Ganesh Sai [1 ]
Le, Anh Vu [2 ]
Elara, Mohan Rajesh [1 ]
Sheu, Bing J. [3 ]
机构
[1] ROAR Lab, Engineering Product Development, Singapore University of Technology and Design, Singapore,487372, Singapore
[2] Optoelectronics Research Group, Faculty of Electrical and Electronics Engineering, Ton Duc Thang University, Ho Chi Minh City,700000, Viet Nam
[3] Electronics Engineering and Information Management, Chang Gung University, Taoyuan City,330, Taiwan
来源
Sensors (Switzerland) | 2021年 / 21卷 / 04期
关键词
Learning algorithms - Long short-term memory - Multilayer neural networks - Traveling salesman problem - Energy utilization - Motion planning - Ant colony optimization - Cleaning - Learning systems - Genetic algorithms - Robot programming;
D O I
暂无
中图分类号
学科分类号
摘要
One of the essential attributes of a cleaning robot is to achieve complete area coverage. Current commercial indoor cleaning robots have fixed morphology and are restricted to clean only specific areas in a house. The results of maximum area coverage are sub-optimal in this case. Tiling robots are innovative solutions for such a coverage problem. These new kinds of robots can be deployed in the cases of cleaning, painting, maintenance, and inspection, which require complete area coverage. Tiling robots’ objective is to cover the entire area by reconfiguring to different shapes as per the area requirements. In this context, it is vital to have a framework that enables the robot to maximize the area coverage while minimizing energy consumption. That means it is necessary for the robot to cover the maximum area with the least number of shape reconfigurations possible. The current paper proposes a complete area coverage planning module for the modified hTrihex, a honeycomb-shaped tiling robot, based on the deep reinforcement learning technique. This framework simultaneously generates the tiling shapes and the trajectory with minimum overall cost. In this regard, a convolutional neural network (CNN) with long short term memory (LSTM) layer was trained using the actor-critic experience replay (ACER) reinforcement learning algorithm. The simulation results obtained from the current implementation were compared against the results that were generated through traditional tiling theory models that included zigzag, spiral, and greedy search schemes. The model presented in the current paper was also compared against other methods where this problem was considered as a traveling salesman problem (TSP) solved through genetic algorithm (GA) and ant colony optimization (ACO) approaches. Our proposed scheme generates a path with a minimized cost at a lesser time. © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
引用
下载
收藏
页码:1 / 20
相关论文
共 50 条
  • [31] Reinforcement learning-based dynamic obstacle avoidance and integration of path planning
    Jaewan Choi
    Geonhee Lee
    Chibum Lee
    Intelligent Service Robotics, 2021, 14 : 663 - 677
  • [32] Research on path planning algorithm of mobile robot based on reinforcement learning
    Guoqian Pan
    Yong Xiang
    Xiaorui Wang
    Zhongquan Yu
    Xinzhi Zhou
    Soft Computing, 2022, 26 : 8961 - 8970
  • [33] Robot Patrol Path Planning Based on Combined Deep Reinforcement Learning
    Li, Wenqi
    Chen, Dehua
    Le, Jiajin
    2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 659 - 666
  • [34] Path planning for mobile robot based on improved reinforcement learning algorithm
    Xu X.
    Yuan J.
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2019, 27 (03): : 314 - 320
  • [35] A case-based reinforcement learning for probe robot path planning
    Yang, L
    Chen, ZH
    Chen, F
    PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 1161 - 1165
  • [36] Research on path planning algorithm of mobile robot based on reinforcement learning
    Pan, Guoqian
    Xiang, Yong
    Wang, Xiaorui
    Yu, Zhongquan
    Zhou, Xinzhi
    SOFT COMPUTING, 2022, 26 (18) : 8961 - 8970
  • [37] Path planning for a robot manipulator based on probabilistic roadmap and reinforcement learning
    Department of Mechanical Engineering, Korea University, Anam-dong, Seongbuk-gu, Seoul 136-713, Korea, Republic of
    Int. J. Control Autom. Syst., 2007, 6 (674-680):
  • [38] Complete Coverage Path Planning and Obstacle Avoidance Strategy of the Robot
    Wu, JunHui
    Qin, TongDi
    Chen, Jie
    Si, HuiPing
    Lin, KaiYan
    Zhou, Qiang
    Zhang, ChiBin
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 346 - 351
  • [39] Coverage path planning for kiwifruit picking robots based on deep reinforcement learning
    Wang, Yinchu
    He, Zhi
    Cao, Dandan
    Ma, Li
    Li, Kai
    Jia, Liangsheng
    Cui, Yongjie
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 205
  • [40] Multi-agent Coverage Path Planning Based on Security Reinforcement Learning
    Li S.
    Ma Z.
    Zhang Y.
    Shao J.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 : 101 - 113