A Reinforcement Learning Method for a Hybrid Flow-Shop Scheduling Problem

被引:28
|
作者
Han, Wei [1 ]
Guo, Fang [1 ]
Su, Xichao [1 ]
机构
[1] Naval Aviat Univ, Dept Airborne Vehicle Engn, Yantai 264001, Peoples R China
关键词
reinforcement learning; hybrid flow-shop scheduling problem; Markov decision processes; sortie scheduling of carrier aircraft; SHOP; OPTIMIZATION; ALGORITHM;
D O I
10.3390/a12110222
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The scheduling problems in mass production, manufacturing, assembly, synthesis, and transportation, as well as internet services, can partly be attributed to a hybrid flow-shop scheduling problem (HFSP). To solve the problem, a reinforcement learning (RL) method for HFSP is studied for the first time in this paper. HFSP is described and attributed to the Markov Decision Processes (MDP), for which the special states, actions, and reward function are designed. On this basis, the MDP framework is established. The Boltzmann exploration policy is adopted to trade-off the exploration and exploitation during choosing action in RL. Compared with the first-come-first-serve strategy that is frequently adopted when coding in most of the traditional intelligent algorithms, the rule in the RL method is first-come-first-choice, which is more conducive to achieving the global optimal solution. For validation, the RL method is utilized for scheduling in a metal processing workshop of an automobile engine factory. Then, the method is applied to the sortie scheduling of carrier aircraft in continuous dispatch. The results demonstrate that the machining and support scheduling obtained by this RL method are reasonable in result quality, real-time performance and complexity, indicating that this RL method is practical for HFSP.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Improvement heuristic for the flow-shop scheduling problem: An adaptive-learning approach
    Agarwal, A
    Colak, S
    Eryarsoy, E
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 169 (03) : 801 - 815
  • [42] Algorithm performance and problem structure for flow-shop scheduling
    Watson, JP
    Barbulescu, L
    Howe, AE
    Whitley, LD
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 688 - 695
  • [43] Flow-shop scheduling problem with fuzzy due window
    Li, Haiyan
    Wang, Li
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13 : 370 - 374
  • [44] An Improved Heuristic Algorithm for a Hybrid Flow-shop Scheduling
    Dai, Min
    Tang, Dunbing
    Zheng, Kun
    Cai, Qixiang
    MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 1414 - 1417
  • [45] Solving the continuous flow-shop scheduling problem by metaheuristics
    Fink, A
    Voss, S
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2003, 151 (02) : 400 - 414
  • [46] Scheduling a flow-shop problem with fuzzy processing time
    Huang, Chieh-Hung
    Lin, Feng-Tse
    ICIC Express Letters, 2015, 9 (02): : 517 - 524
  • [47] A Hybrid Evolutionary Algorithm Using Two Solution Representations for Hybrid Flow-Shop Scheduling Problem
    Fan, Jiaxin
    Li, Yingli
    Xie, Jin
    Zhang, Chunjiang
    Shen, Weiming
    Gao, Liang
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) : 1752 - 1764
  • [48] Adaptive Genetic Algorithm for Hybrid Flow-shop Scheduling
    Zhu, Xiao Chun
    Zhao, Jian Feng
    Wang, Mu Lan
    MATERIALS PROCESSING AND MANUFACTURING III, PTS 1-4, 2013, 753-755 : 2925 - +
  • [49] AN EXTENSION OF PALMER HEURISTIC FOR THE FLOW-SHOP SCHEDULING PROBLEM
    HUNDAL, TS
    RAJGOPAL, J
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 1988, 26 (06) : 1119 - 1124
  • [50] FLOW SHOP SCHEDULING WITH REINFORCEMENT LEARNING
    Zhang, Zhicong
    Wang, Weiping
    Zhong, Shouyan
    Hu, Kaishun
    ASIA-PACIFIC JOURNAL OF OPERATIONAL RESEARCH, 2013, 30 (05)