Kernel Reinforcement Learning for sampling-efficient risk management of large-scale engineering systems

被引:0
|
作者
Zhang, Dingyang [1 ,2 ]
Zhang, Yiming [1 ,2 ]
Li, Pei [3 ]
Zhang, Shuyou [3 ]
机构
[1] Zhejiang Univ, Sch Mech Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Peoples R China
[3] Zhejiang Univ, Engn Res Ctr Design Engn & Digital Twin Zhejiang P, Hangzhou 310027, Peoples R China
关键词
Large-scale engineering systems; Kernel Reinforcement Learning (KRL); Maintenance strategy optimization; Uncertainty management; Sample-efficient learning; CONDITION-BASED MAINTENANCE; PROGNOSTICS; UNCERTAINTY; MODEL;
D O I
10.1016/j.ress.2025.111022
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Mainstream methods for maintenance scheduling of multi-state systems (e.g. aircraft engines) often encounter challenges such as uncertainty accumulation, the need for extensive training data, and instability in the training process, particularly in life-cycle cost management. This paper introduces an innovative Kernel Reinforcement Learning (KRL) approach designed to enhance the reliability and safety of multi-state systems while significantly increasing decision-making efficiency. The policy and value functions are formulated non- parametrically to capture high-value episodes and datasets. KRL integrates probabilistic setups to imbue reinforcement learning with uncertainty, enhancing exploration of state-action spaces. Prior knowledge can be seamlessly integrated with the probabilistic framework to accelerate convergence. To address the memory issues associated with kernel methods when handling large datasets, the kernel matrix is dynamically updated with screened high-value datasets. Numerical evaluations on a k-out-of-n system, a coal mining transportation system, and an aircraft engine simulation demonstrate that the proposed KRL approach achieves faster convergence and reduced life-cycle costs compared to alternative methods. Specifically, KRL reduces the number of training episodes by 2-3 orders of magnitude, with a maximum cost reduction of 92%.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Large-scale Online Kernel Learning with Random Feature Reparameterization
    Tu Dinh Nguyen
    Le, Trung
    Bui, Hung
    Phung, Dinh
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2543 - 2549
  • [42] An efficient evolutionary algorithm based on deep reinforcement learning for large-scale sparse multiobjective optimization
    Gao, Mengqi
    Feng, Xiang
    Yu, Huiqun
    Li, Xiuquan
    APPLIED INTELLIGENCE, 2023, 53 (18) : 21116 - 21139
  • [43] Algorithm architectures to support large-scale process systems engineering applications involving combinatorics, uncertainty, and risk management
    Pekny, JF
    COMPUTERS & CHEMICAL ENGINEERING, 2002, 26 (02) : 239 - 267
  • [44] An efficient evolutionary algorithm based on deep reinforcement learning for large-scale sparse multiobjective optimization
    Mengqi Gao
    Xiang Feng
    Huiqun Yu
    Xiuquan Li
    Applied Intelligence, 2023, 53 : 21116 - 21139
  • [45] Kernel-Based Autoencoders for Large-Scale Representation Learning
    Bao, Jinzhou
    Zhao, Bo
    Guo, Ping
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 112 - 117
  • [46] Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning
    Ghasemipour, Seyed Kamyar Seyed
    Freeman, Daniel
    David, Byron
    Gu, Shixiang Shane
    Kataoka, Satoshi
    Mordatch, Igor
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [47] Efficient Load Balancing In Large-Scale Systems
    Mukherjee, D.
    Borst, S. C.
    van Leeuwaarden, J. S. H.
    Whiting, P. A.
    2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
  • [48] Efficient Objective Functions for Coordinated Learning in Large-Scale Distributed OSA Systems
    NoroozOliaee, MohammadJavad
    Hamdaoui, Bechir
    Tumer, Kagan
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2013, 12 (05) : 931 - 944
  • [49] Toward Optimally Efficient Search With Deep Learning for Large-Scale MIMO Systems
    He, Le
    He, Ke
    Fan, Lisheng
    Lei, Xianfu
    Nallanathan, Arumugam
    Karagiannidis, George K.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (05) : 3157 - 3168
  • [50] An operations based systems engineering approach for large-scale systems
    Merida, Staci N.
    Saha, Rahul A.
    2005 IEEE Aerospace Conference, Vols 1-4, 2005, : 4239 - 4250