Towards autonomous and optimal excavation of shield machine: a deep reinforcement learning-based approach

被引:12
|
作者
Zhang, Ya-kun [1 ]
Gong, Guo-fang [1 ]
Yang, Hua-yong [1 ]
Chen, Yu-xi [1 ]
Chen, Geng-lin [2 ]
机构
[1] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Peoples R China
[2] China Univ Min & Technol, Sch Elect & Power Engn, Xuzhou 221116, Jiangsu, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Shield machine; Slurry shield; Intelligent tunnel boring machine (TBM); Deep reinforcement learning; Optimal control; Dynamic optimization; Deep learning; TUNNEL BORING MACHINE; LAYOUT DESIGN; MODEL; PERFORMANCE; PREDICTION; TORQUE; INTELLIGENCE; SIMULATION; THRUST; SCHEME;
D O I
10.1631/jzus.A2100325
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Autonomous excavation operation is a major trend in the development of a new generation of intelligent tunnel boring machines (TBMs). However, existing technologies are limited to supervised machine learning and static optimization, which cannot outperform human operation and deal with ever changing geological conditions and the long-term performance measure. The aim of this study is to resolve the problem of dynamic optimization of the shield excavation performance, as well as to achieve autonomous optimal excavation. In this study, a novel autonomous optimal excavation approach that integrates deep reinforcement learning and optimal control is proposed for shield machines. Based on a first-principles analysis of the machine-ground interaction dynamics of the excavation process, a deep neural network model is developed using construction field data consisting of 1.1 million samples. The multi-system coupling mechanism is revealed by establishing an overall system model. Based on the overall system analysis, the autonomous optimal excavation problem is decomposed into a multi-objective dynamic optimization problem and an optimal control problem. Subsequently, a dimensionless multi-objective comprehensive excavation performance measure is proposed. A deep reinforcement learning method is used to solve for the optimal action sequence trajectory, and optimal closed-loop feedback controllers are designed to achieve accurate execution. The performance of the proposed approach is compared to that of human operation by using the construction field data. The simulation results show that the proposed approach not only has the potential to replace human operation but also can significantly improve the comprehensive excavation performance.
引用
收藏
页码:458 / 478
页数:21
相关论文
共 50 条
  • [1] Towards autonomous and optimal excavation of shield machine: a deep reinforcement learning-based approach题目:迈向盾构机自主最优掘进:一种基于深度强化学习的方法
    Ya-kun Zhang
    Guo-fang Gong
    Hua-yong Yang
    Yu-xi Chen
    Geng-lin Chen
    [J]. Journal of Zhejiang University-SCIENCE A, 2022, 23 : 458 - 478
  • [2] Autonomous intelligent control of earth pressure balance shield machine based on deep reinforcement learning
    Liu, Xuanyu
    Zhang, Wenshuai
    Shao, Cheng
    Wang, Yudong
    Cong, Qiumei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [3] A deep reinforcement learning-based approach for autonomous driving in highway on-ramp merge
    Wang, Huanjie
    Yuan, Shihua
    Guo, Mengyu
    Li, Xueyuan
    Lan, Wei
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2021, 235 (10-11) : 2726 - 2739
  • [4] Reinforcement learning-based optimizer to improve the steering of shield tunneling machine
    Elbaz, Khalid
    Shen, Shui-Long
    Zhou, Annan
    Yoo, Chungsik
    [J]. ACTA GEOTECHNICA, 2024, 19 (06) : 4167 - 4187
  • [5] TOWARDS THE DEEP LEARNING-BASED AUTONOMOUS COLLISION AVOIDANCE
    He, Binxin
    Xiao, Youan
    Wang, Tengfei
    Li, Zhuo
    [J]. PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 9, 2022,
  • [6] Deep reinforcement learning-based collision avoidance for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Ha, Jisang
    Yu, Donghun
    [J]. OCEAN ENGINEERING, 2021, 234
  • [7] A probabilistic deep reinforcement learning approach for optimal monitoring of a building adjacent to deep excavation
    Pan, Yue
    Qin, Jianjun
    Zhang, Limao
    Pan, Weiqiang
    Chen, Jin-Jian
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024, 39 (05) : 656 - 678
  • [8] A Deep Reinforcement Learning Based Approach for Autonomous Overtaking
    Li, Xiaoxiang
    Qiu, Xinyou
    Wang, Jian
    Shen, Yuan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [9] Deep Learning-Based Autonomous Excavation: A Bucket-Trajectory Planning Algorithm
    Huh, Jaemyung
    Bae, Jangho
    Lee, Donghyeon
    Kwak, Jaewon
    Moon, Chanyoung
    Im, Chulwhan
    Ko, Yeonho
    Kang, Tae Koo
    Hong, Daehie
    [J]. IEEE ACCESS, 2023, 11 : 38047 - 38060
  • [10] Deep Reinforcement Learning-Based Vehicle Energy Efficiency Autonomous Learning System
    Qi, Xuewei
    Luo, Yadan
    Wu, Guoyuan
    Boriboonsomsin, Kanok
    Barth, Matthew J.
    [J]. 2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1228 - 1233