A Reinforcement Learning Method for a Hybrid Flow-Shop Scheduling Problem

被引:28
|
作者
Han, Wei [1 ]
Guo, Fang [1 ]
Su, Xichao [1 ]
机构
[1] Naval Aviat Univ, Dept Airborne Vehicle Engn, Yantai 264001, Peoples R China
关键词
reinforcement learning; hybrid flow-shop scheduling problem; Markov decision processes; sortie scheduling of carrier aircraft; SHOP; OPTIMIZATION; ALGORITHM;
D O I
10.3390/a12110222
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The scheduling problems in mass production, manufacturing, assembly, synthesis, and transportation, as well as internet services, can partly be attributed to a hybrid flow-shop scheduling problem (HFSP). To solve the problem, a reinforcement learning (RL) method for HFSP is studied for the first time in this paper. HFSP is described and attributed to the Markov Decision Processes (MDP), for which the special states, actions, and reward function are designed. On this basis, the MDP framework is established. The Boltzmann exploration policy is adopted to trade-off the exploration and exploitation during choosing action in RL. Compared with the first-come-first-serve strategy that is frequently adopted when coding in most of the traditional intelligent algorithms, the rule in the RL method is first-come-first-choice, which is more conducive to achieving the global optimal solution. For validation, the RL method is utilized for scheduling in a metal processing workshop of an automobile engine factory. Then, the method is applied to the sortie scheduling of carrier aircraft in continuous dispatch. The results demonstrate that the machining and support scheduling obtained by this RL method are reasonable in result quality, real-time performance and complexity, indicating that this RL method is practical for HFSP.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Solving flow-shop scheduling problem with a reinforcement learning algorithm that generalizes the value function with neural network
    Ren, Jianfeng
    Ye, Chunming
    Yang, Feng
    ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (03) : 2787 - 2800
  • [22] Solving the flow-shop scheduling problem with human factors and two competing agents with deep reinforcement learning
    Ren, Tao
    Dong, Zhuoran
    Qi, Fang
    Weng, Jiacheng
    Xue, Hanyu
    ENGINEERING OPTIMIZATION, 2025, 57 (03) : 671 - 687
  • [23] Surgical Scheduling based on Hybrid Flow-shop Scheduling
    Huang, Guoxun
    Xiang, Wei
    Li, Chong
    Zheng, Qian
    Zhou, Shan
    Shen, Bingqian
    Chen, Saifeng
    ADVANCES IN ENGINEERING DESIGN AND OPTIMIZATION III, PTS 1 AND 2, 2012, 201-202 : 1004 - +
  • [24] Deep Reinforcement Learning Based Optimization Algorithm for Permutation Flow-Shop Scheduling
    Pan, Zixiao
    Wang, Ling
    Wang, Jingjing
    Lu, Jiawen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (04): : 983 - 994
  • [25] Hybrid flow-shop scheduling with assembly operations
    Yokoyama, M
    INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2001, 73 (02) : 103 - 116
  • [26] A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem
    Di, Yuanzhu
    Deng, Libao
    Zhang, Lili
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
  • [27] A Decomposition and Coordination Scheduling Method for Flow-shop Problem Based on TOC
    ZHANG HongYuan XI YuGeng GU HanYu Institute of Automation Shanghai Jiaotong University Shanghai
    自动化学报, 2005, (02) : 182 - 187
  • [28] Flow-Shop Scheduling Problem With Batch Processing Machines via Deep Reinforcement Learning for Industrial Internet of Things
    Luo, Zihui
    Jiang, Chengling
    Liu, Liang
    Zheng, Xiaolong
    Ma, Huadong
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1 - 16
  • [29] Multi-line hybrid flow-shop scheduling problem with energy considerations
    Taguemount, Sara
    Lamy, Damien
    Delorme, Xavier
    Casoetto, Nicolas
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2024,
  • [30] Solving flow-shop scheduling problem by hybrid particle swarm optimization algorithm
    Gao Shang
    Yang Jing-yu
    Proceedings of 2006 Chinese Control and Decision Conference, 2006, : 1006 - +