A Q-learning-based algorithm for the 2D-rectangular packing problem

被引:0
|
作者
Xusheng Zhao
Yunqing Rao
Ronghua Meng
Jie Fang
机构
[1] Huazhong University of Science and Technology,School of Mechanical Science & Engineering
[2] China Three Gorges University,College of Mechanical & Power Engineering
来源
Soft Computing | 2023年 / 27卷
关键词
Rectangular packing problem; Combinatorial optimization; Fitness degree; Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a Q-learning-based algorithm for sequence and orientation optimization toward the 2D rectangular strip packing problem. The width-filled skyline is used to represent the interior packing state, and a constructive rectangular packing algorithm with the commonly adopted fitness evaluation for placement is designed. Then, the consecutive item packing is simulated as Markov Decision Process, where the state is defined as the set of already packed items, and the action is defined as the rectangle selected to be packed along with its orientation. We propose the reverse updating of Q-value in the paradigm of Q-learning and use the algorithm to optimize the sequence and orientation of the rectangles. The decreasing-size-choice mechanism in Q-learning is studied on randomly generated problems to optimize the setting of ε-greedy policy. We also test the Q-learning-based algorithm on the benchmark instances of C21, N13, N-series from NT, Cgcut and Beng. Compared with a few state-of-the-art algorithms, the computational results show that the proposed algorithm can produce good packing quality when adequate execution time allowed.
引用
收藏
页码:12057 / 12070
页数:13
相关论文
共 50 条
  • [21] Q-learning-based simulated annealing algorithm for constrained engineering design problems
    Hussein Samma
    Junita Mohamad-Saleh
    Shahrel Azmin Suandi
    Badr Lahasan
    Neural Computing and Applications, 2020, 32 : 5147 - 5161
  • [22] The container stowage planning problem in a barge convoy system via Q-learning-based NSGA-II algorithm
    Ling, Yu
    Pan, Lin
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4492 - 4497
  • [23] A Q-Learning-Based Hyper-Heuristic Evolutionary Algorithm for the Distributed Flexible Job-Shop Scheduling Problem
    Wu, Fang-Chun
    Qian, Bin
    Hu, Rong
    Zhang, Zi-Qi
    Wang, Bin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 251 - 261
  • [24] An efficient deterministic heuristic algorithm for the rectangular packing problem
    Chen, Mao
    Wu, Chao
    Tang, Xiangyang
    Peng, Xicheng
    Zeng, Zhizhong
    Liu, Sanya
    COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 137
  • [25] Decimal wolf pack algorithm for rectangular packing problem
    Luo Q.
    Rao Y.
    Liu Q.
    Li S.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (05): : 1169 - 1179
  • [26] A recursive algorithm for the rectangular guillotine strip packing problem
    Cui, Y.
    Gu, T.
    Zhong, Y.
    ENGINEERING OPTIMIZATION, 2008, 40 (04) : 347 - 360
  • [27] Bioinspired Algorithm for 2D Packing Problem
    Kureichik, Vladimir
    Kureichik, Liliya
    Kureichik, Vladimir, Jr.
    Zaruba, Daria
    ARTIFICIAL INTELLIGENCE AND ALGORITHMS IN INTELLIGENT SYSTEMS, 2019, 764 : 39 - 46
  • [28] Hybrid Heuristic Algorithm Based On Improved Rules & Reinforcement Learning for 2D Strip Packing Problem
    Zhu, Kai
    Ji, Naihua
    Li, Xiang Dong
    IEEE ACCESS, 2020, 8 : 226784 - 226796
  • [29] Q-Thermal: A Q-Learning-Based Thermal-Aware Routing Algorithm for 3-D Network On-Chips
    Shahabinejad, Narges
    Beitollahi, Hakem
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2020, 10 (09): : 1482 - 1490
  • [30] QLGR: A Q-learning-based Geographic FANET Routing Algorithm Based on Multi- agent Reinforcement Learning
    Qiu, Xiulin
    Xie, Yongsheng
    Wang, Yinyin
    Ye, Lei
    Yang, Yuwang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (11): : 4244 - 4274