A differential evolution with reinforcement learning for multi-objective assembly line feeding problem

被引:2
|
作者
Tao, Lue [1 ,2 ]
Dong, Yun [2 ]
Chen, Weihua [4 ]
Yang, Yang [3 ]
Su, Lijie [5 ]
Guo, Qingxin [1 ]
Wang, Gongshu [1 ]
机构
[1] Northeastern Univ, Natl Frontiers Sci Ctr Ind Intelligence & Syst Opt, Shenyang, Peoples R China
[2] Northeastern Univ, Key Lab Data Analyt & Optimizat Smart Ind, Minist Educ, Shenyang, Peoples R China
[3] Northeastern Univ, Liaoning Engn Lab Operat Analyt & Optimizat Smart, Shenyang, Peoples R China
[4] BMW Brilliance Automobile Ind, Shenyang, Peoples R China
[5] Northeastern Univ, Liaoning Key Lab Mfg Syst & Logist Optimizat, Shenyang, Peoples R China
基金
中国国家自然科学基金; 国家自然科学基金重大项目;
关键词
Assembly line feeding problem; Multi -objective optimization; Differential evolution algorithm; Reinforcement learning; OPTIMIZATION ALGORITHM; GENETIC ALGORITHM; PARTS; STOCKING; POLICIES; MODEL;
D O I
10.1016/j.cie.2022.108714
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper studies a multi-objective assembly line feeding problem (MALFP), which is a new variant of the as-sembly line feeding problem in automobile manufacturers. In this problem, part families are delivered through five feeding policies to minimize three objectives simultaneously. To describe the problem, a novel multi -objective mathematical model is formulated. It not only overcomes the difficulty of determining perfect weights for objectives without prior knowledge, but also complements the traditional model by considering extended decisions on receiving warehouses, an extra cost item for policy switching, and a hybrid inventory strategy. To solve the problem, an innovative multi-objective differential evolution with a reinforcement learning (RL) based operator selection mechanism (MODE-RLOSM) is proposed. By solving MALFP with MODE-RLOSM, near-optimal candidate solutions that are suitable for different working conditions are provided to managers for making trade-offs and implementations. Compared with state-of-the-art optimization algorithms as well as a practical decision tree approach, the proposed algorithm shows superiority in cost saving, solution quality, and convergence efficiency. Through ablation study, sensitivity analysis, and RL behavior analysis, we investigate components in MODE-RLOSM and verify their effectiveness and robustness. In addition to bringing significant cost savings, the obtained solution also gives us production enlightenment and thus improves the decision -making efficiency of the enterprise. In our research, we illustrate the influence of part diversity on policy se-lection, give managers suggestions under different objective preferences, and find it uneconomical to pursue a specific objective excessively.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Reinforcement Learning for Dynamic Mutation Process Control in Multi-Objective Differential Evolution
    Tao, Lue
    Wang, Gongshu
    Yang, Yang
    Dong, Yun
    Su, Lijie
    IFAC PAPERSONLINE, 2022, 55 (15): : 117 - 122
  • [2] Multi-objective optimization-assisted single-objective differential evolution by reinforcement learning
    Zhang, Haotian
    Guan, Xiaohong
    Wang, Yixin
    Nan, Nan
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 94
  • [3] Assembly line balancing by a new multi-objective differential evolution algorithm based on TOPSIS
    Nourmohammadi, A.
    Zandieh, M.
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2011, 49 (10) : 2833 - 2855
  • [4] Deep Reinforcement Learning for Adaptive Parameter Control in Differential Evolution for Multi-Objective Optimization
    Reijnen, Robbert
    Zhang, Yingqian
    Bukhsh, Zaharah
    Guzek, Mateusz
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 804 - 811
  • [5] Reinforcement learning-based multi-objective differential evolution algorithm for feature selection
    Yu, Xiaobing
    Hu, Zhengpeng
    Luo, Wenguan
    Xue, Yu
    INFORMATION SCIENCES, 2024, 661
  • [6] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [7] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [8] Reinforcement learning-based multi-objective differential evolution for wind farm layout optimization
    Yu, Xiaobing
    Lu, Yangchen
    ENERGY, 2023, 284
  • [9] Reinforcement learning-based differential evolution algorithm for constrained multi-objective optimization problems
    Yu, Xiaobing
    Xu, Pingping
    Wang, Feng
    Wang, Xuming
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [10] Scheduling of Continuous Annealing With a Multi-Objective Differential Evolution Algorithm Based on Deep Reinforcement Learning
    Li, Tianyang
    Meng, Ying
    Tang, Lixin
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (02) : 1767 - 1780