Kernel Reinforcement Learning for sampling-efficient risk management of large-scale engineering systems

被引:0
|
作者
Zhang, Dingyang [1 ,2 ]
Zhang, Yiming [1 ,2 ]
Li, Pei [3 ]
Zhang, Shuyou [3 ]
机构
[1] Zhejiang Univ, Sch Mech Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou 310027, Peoples R China
[3] Zhejiang Univ, Engn Res Ctr Design Engn & Digital Twin Zhejiang P, Hangzhou 310027, Peoples R China
关键词
Large-scale engineering systems; Kernel Reinforcement Learning (KRL); Maintenance strategy optimization; Uncertainty management; Sample-efficient learning; CONDITION-BASED MAINTENANCE; PROGNOSTICS; UNCERTAINTY; MODEL;
D O I
10.1016/j.ress.2025.111022
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Mainstream methods for maintenance scheduling of multi-state systems (e.g. aircraft engines) often encounter challenges such as uncertainty accumulation, the need for extensive training data, and instability in the training process, particularly in life-cycle cost management. This paper introduces an innovative Kernel Reinforcement Learning (KRL) approach designed to enhance the reliability and safety of multi-state systems while significantly increasing decision-making efficiency. The policy and value functions are formulated non- parametrically to capture high-value episodes and datasets. KRL integrates probabilistic setups to imbue reinforcement learning with uncertainty, enhancing exploration of state-action spaces. Prior knowledge can be seamlessly integrated with the probabilistic framework to accelerate convergence. To address the memory issues associated with kernel methods when handling large datasets, the kernel matrix is dynamically updated with screened high-value datasets. Numerical evaluations on a k-out-of-n system, a coal mining transportation system, and an aircraft engine simulation demonstrate that the proposed KRL approach achieves faster convergence and reduced life-cycle costs compared to alternative methods. Specifically, KRL reduces the number of training episodes by 2-3 orders of magnitude, with a maximum cost reduction of 92%.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] RoSE: Robust Analog Circuit Parameter Optimization with Sampling-Efficient Reinforcement Learning
    Gao, Jian
    Cao, Weidong
    Zhang, Xuan
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [2] Efficient and scalable reinforcement learning for large-scale network control
    Ma, Chengdong
    Li, Aming
    Du, Yali
    Dong, Hao
    Yang, Yaodong
    NATURE MACHINE INTELLIGENCE, 2024, 6 (09) : 1006 - 1020
  • [3] Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning
    Lin, Kaixiang
    Zhao, Renyu
    Xu, Zhe
    Zhou, Jiayu
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1774 - 1783
  • [4] Large-scale Image Annotation by Efficient and Robust Kernel Metric Learning
    Feng, Zheyun
    Jin, Rong
    Jain, Anil
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1609 - 1616
  • [5] Guided probabilistic reinforcement learning for sampling-efficient maintenance scheduling of multi-component system
    Zhang, Yiming
    Zhang, Dingyang
    Zhang, Xiaoge
    Qiu, Lemiao
    Chan, Felix T. S.
    Wang, Zili
    Zhang, Shuyou
    APPLIED MATHEMATICAL MODELLING, 2023, 119 : 677 - 697
  • [6] Large-Scale Retrieval for Reinforcement Learning
    Humphreys, Peter C.
    Guez, Arthur
    Tieleman, Olivier
    Sifre, Laurent
    Weber, Theophane
    Lillicrap, Timothy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Automatic Hierarchical Reinforcement Learning for Efficient Large-scale Service Composition
    Wang, Hongbing
    Huang, Guicheng
    Yu, Qi
    2016 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS), 2016, : 57 - 64
  • [8] Communication-Efficient Federated Learning for Large-Scale Multiagent Systems in ISAC: Data Augmentation With Reinforcement Learning
    Ouyang, Wenjiang
    Liu, Qian
    Mu, Junsheng
    AI-Dulaimi, Anwer
    Jing, Xiaojun
    Liu, Qilie
    IEEE SYSTEMS JOURNAL, 2024, : 1893 - 1904
  • [9] A Large-Scale Systems Engineering Perspective of Water Management in Singapore
    Seng, Eng
    Gilbert, Foo Kong Pin
    INCOSE International Symposium, 2008, 18 (01) : 2055 - 2070
  • [10] Deep reinforcement learning for scheduling in large-scale networked control systems
    Redder, Adrian
    Ramaswamy, Arunselvan
    Quevedo, Daniel E.
    IFAC PAPERSONLINE, 2019, 52 (20): : 333 - 338