A maintenance planning framework using online and offline deep reinforcement learning

被引:5
|
作者
Bukhsh, Zaharah A. [1 ]
Molegraaf, Hajo [2 ]
Jansen, Nils [3 ]
机构
[1] Eindhoven Univ Technol, Eindhoven, Netherlands
[2] Rolsch Assetmanagement, Enschede, Netherlands
[3] Radboud Univ Nijmegen, Nijmegen, Netherlands
基金
欧洲研究理事会;
关键词
Deep reinforcement learning; Maintenance planning; Water distribution systems; Conservative Q-learning; Deep Q-networks; Offline DRL; BUDGET ALLOCATION;
D O I
10.1007/s00521-023-08560-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cost-effective asset management is an area of interest across several industries. Specifically, this paper develops a deep reinforcement learning (DRL) solution to automatically determine an optimal rehabilitation policy for continuously deteriorating water pipes. We approach the problem of rehabilitation planning in an online and offline DRL setting. In online DRL, the agent interacts with a simulated environment of multiple pipes with distinct lengths, materials, and failure rate characteristics. We train the agent using deep Q-learning (DQN) to learn an optimal policy with minimal average costs and reduced failure probability. In offline learning, the agent uses static data, e.g., DQN replay data, to learn an optimal policy via a conservative Q-learning algorithm without further interactions with the environment. We demonstrate that DRL-based policies improve over standard preventive, corrective, and greedy planning alternatives. Additionally, learning from the fixed DQN replay dataset in an offline setting further improves the performance. The results warrant that the existing deterioration profiles of water pipes consisting of large and diverse states and action trajectories provide a valuable avenue to learn rehabilitation policies in the offline setting, which can be further fine-tuned using the simulator.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Online and Offline Reinforcement Learning by Planning with a Learned Model
    Schrittwieser, Julian
    Hubert, Thomas
    Mandhane, Amol
    Barekatain, Mohammadamin
    Antonoglou, Ioannis
    Silver, David
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [2] Online Multimodal Transportation Planning using Deep Reinforcement Learning
    Farahani, Amirreza
    Genga, Laura
    Dijkman, Remco
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1691 - 1698
  • [3] An Effective Negotiating Agent Framework based on Deep Offline Reinforcement Learning
    Chen, Siqi
    Zhao, Jianing
    Weiss, Gerhard
    Su, Ran
    Lei, Kaiyou
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 324 - 335
  • [4] Tackling Uncertainty in Online Multimodal Transportation Planning Using Deep Reinforcement Learning
    Farahani, Amirreza
    Genga, Laura
    Dijkman, Remco
    [J]. COMPUTATIONAL LOGISTICS (ICCL 2021), 2021, 13004 : 578 - 593
  • [5] A deep reinforcement learning framework for life-cycle maintenance planning of regional deteriorating bridges using inspection data
    Lei, Xiaoming
    Xia, Ye
    Deng, Lu
    Sun, Limin
    [J]. STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2022, 65 (05)
  • [6] A deep reinforcement learning framework for life-cycle maintenance planning of regional deteriorating bridges using inspection data
    Xiaoming Lei
    Ye Xia
    Lu Deng
    Limin Sun
    [J]. Structural and Multidisciplinary Optimization, 2022, 65
  • [7] A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework
    Yin, Yan
    Chen, Zhiyu
    Liu, Gang
    Guo, Jianwei
    [J]. SENSORS, 2023, 23 (04)
  • [8] A deep reinforcement learning approach for rail renewal and maintenance planning
    Mohammadi, Reza
    He, Qing
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 225
  • [9] Deep reinforcement learning for optimal planning of assembly line maintenance
    Geurtsen, M.
    Adan, I.
    Atan, Z.
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2023, 69 : 170 - 188
  • [10] Warfarin Dose Management Using Offline Deep Reinforcement Learning
    Ji, Hannah
    Gill, Matthew F.
    Draper, Evan W.
    Liedl, David A.
    Hodge, David O.
    Houghton, Damon E.
    Casanegra, Ana I.
    [J]. CIRCULATION, 2023, 148