A maintenance planning framework using online and offline deep reinforcement learning

被引：5

作者：

Bukhsh, Zaharah A. ^{[1
]}

Molegraaf, Hajo ^{[2
]}

Jansen, Nils ^{[3
]}

机构：

[1] Eindhoven Univ Technol, Eindhoven, Netherlands

[2] Rolsch Assetmanagement, Enschede, Netherlands

[3] Radboud Univ Nijmegen, Nijmegen, Netherlands

来源：

NEURAL COMPUTING & APPLICATIONS | 2023年

基金：

欧洲研究理事会;

关键词：

Deep reinforcement learning; Maintenance planning; Water distribution systems; Conservative Q-learning; Deep Q-networks; Offline DRL; BUDGET ALLOCATION;

D O I：

10.1007/s00521-023-08560-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cost-effective asset management is an area of interest across several industries. Specifically, this paper develops a deep reinforcement learning (DRL) solution to automatically determine an optimal rehabilitation policy for continuously deteriorating water pipes. We approach the problem of rehabilitation planning in an online and offline DRL setting. In online DRL, the agent interacts with a simulated environment of multiple pipes with distinct lengths, materials, and failure rate characteristics. We train the agent using deep Q-learning (DQN) to learn an optimal policy with minimal average costs and reduced failure probability. In offline learning, the agent uses static data, e.g., DQN replay data, to learn an optimal policy via a conservative Q-learning algorithm without further interactions with the environment. We demonstrate that DRL-based policies improve over standard preventive, corrective, and greedy planning alternatives. Additionally, learning from the fixed DQN replay dataset in an offline setting further improves the performance. The results warrant that the existing deterioration profiles of water pipes consisting of large and diverse states and action trajectories provide a valuable avenue to learn rehabilitation policies in the offline setting, which can be further fine-tuned using the simulator.

引用

页数：12

共 50 条

[1] Online and Offline Reinforcement Learning by Planning with a Learned Model
Schrittwieser, Julian
Hubert, Thomas
Mandhane, Amol
Barekatain, Mohammadamin
Antonoglou, Ioannis
Silver, David
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[2] Online Multimodal Transportation Planning using Deep Reinforcement Learning
Farahani, Amirreza
Genga, Laura
Dijkman, Remco
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 1691 - 1698
[3] An Effective Negotiating Agent Framework based on Deep Offline Reinforcement Learning
Chen, Siqi
Zhao, Jianing
Weiss, Gerhard
Su, Ran
Lei, Kaiyou
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 324 - 335
[4] Tackling Uncertainty in Online Multimodal Transportation Planning Using Deep Reinforcement Learning
Farahani, Amirreza
Genga, Laura
Dijkman, Remco
COMPUTATIONAL LOGISTICS (ICCL 2021), 2021, 13004 : 578 - 593
[5] A deep reinforcement learning framework for life-cycle maintenance planning of regional deteriorating bridges using inspection data
Lei, Xiaoming
Xia, Ye
Deng, Lu
Sun, Limin
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2022, 65 (05)
[6] A deep reinforcement learning framework for life-cycle maintenance planning of regional deteriorating bridges using inspection data
Xiaoming Lei
Ye Xia
Lu Deng
Limin Sun
Structural and Multidisciplinary Optimization, 2022, 65
[7] A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework
Yin, Yan
Chen, Zhiyu
Liu, Gang
Guo, Jianwei
SENSORS, 2023, 23 (04)
[8] A deep reinforcement learning approach for rail renewal and maintenance planning
Mohammadi, Reza
He, Qing
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 225
[9] Deep reinforcement learning for optimal planning of assembly line maintenance
Geurtsen, M.
Adan, I.
Atan, Z.
JOURNAL OF MANUFACTURING SYSTEMS, 2023, 69 : 170 - 188
[10] Warfarin Dose Management Using Offline Deep Reinforcement Learning
Ji, Hannah
Gill, Matthew F.
Draper, Evan W.
Liedl, David A.
Hodge, David O.
Houghton, Damon E.
Casanegra, Ana I.
CIRCULATION, 2023, 148

← 1 2 3 4 5 →