A Reinforcement Learning Algorithm for Optimal Dynamic Policies of Joint Condition-based Maintenance and Condition-based Production

被引:0
|
作者
Rasay, Hasas [1 ]
Azizi, Fariba [2 ]
Salmani, Mehrnaz [3 ]
Naderkhani, Farnoosh [3 ]
机构
[1] Kermanshah Univ Technol, Kermanshah, Iran
[2] Alzahra Univ, Fac Math Sci, Dept Stat, Tehran, Iran
[3] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
关键词
condition-based maintenance; condition-based production; reinforcement learning; Markov decision process;
D O I
10.1109/ICPHM57936.2023.10194057
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on development of joint optimal maintenance and production policy for a specific type of production system that allows for adjustable production rates. The rate of deterioration of the system is directly related to the production rate, with higher production rates resulting in greater expected deterioration. The system's deterioration can be controlled through two main actions: (1) scheduling and conducting maintenance actions referred to as maintenance policy; and (2) adjusting the production rate referred to as production policy. To determine the optimal actions given the system's state, a Markov decision process (MDP) is developed and a reinforcement learning algorithm, specifically a Q-learning algorithm, is utilized. The algorithm's hyper parameters are tuned using a value-iteration algorithm of dynamic programming. The goal is to minimize expected costs for the system over a finite planning horizon.
引用
收藏
页码:200 / 204
页数:5
相关论文
共 50 条
  • [1] A Reinforcement Learning Algorithm for Optimal Dynamic Policies of Joint Condition-based Maintenance and Condition-based Production
    Rasay, Hasas
    Azizi, Fariba
    Salmani, Mehrnaz
    Naderkhani, Farnoosh
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT, ICPHM, 2023, : 134 - 138
  • [2] Joint condition-based maintenance and condition-based production optimization
    Broek, Michiel A. J. Uit Het
    Teunter, Ruud H.
    de Jonge, Bram
    Veldman, Jasper
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 214
  • [3] Reinforcement learning for optimal policy learning in condition-based maintenance
    Adsule, Aniket
    Kulkarni, Makarand
    Tewari, Asim
    [J]. IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2020, 2 (04) : 182 - 188
  • [4] Reinforcement learning with Gaussian processes for condition-based maintenance
    Peng, Shenglin
    Feng, Qianmei
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 158
  • [5] Optimum condition-based maintenance policy with dynamic inspections based on reinforcement learning
    Cheng, Jianda
    Liu, Yan
    Cheng, Minghui
    Li, Wei
    Li, Tianyun
    [J]. OCEAN ENGINEERING, 2022, 261
  • [6] Condition-based inspection scheme for condition-based maintenance
    Golmakani, Hamid Reza
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2012, 50 (14) : 3920 - 3935
  • [7] Reinforcement Learning based on Stochastic Dynamic Programming for Condition-based Maintenance of Deteriorating Production Processes
    Rasay, Hasan
    Naderkhani, Farnoosh
    Golmohammadi, Amir Mohammad
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2022, : 17 - 24
  • [8] Condition-Based Production for Stochastically Deteriorating Systems: Optimal Policies and Learning
    Drent, Collin
    Drent, Melvin
    Arts, Joachim
    [J]. M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT, 2024, 26 (03) : 1137 - 1156
  • [9] Joint optimization of condition-based maintenance and condition-based production of a single equipment considering random yield and maintenance delay
    Zhang, Nan
    Cai, Kaiquan
    Deng, Yingjun
    Zhang, Jun
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 241
  • [10] Condition-based maintenance
    不详
    [J]. ZKG INTERNATIONAL, 2006, 59 (12): : 30 - 30