A Q-learning-based algorithm for the 2D-rectangular packing problem

被引:3
|
作者
Zhao, Xusheng [1 ]
Rao, Yunqing [1 ]
Meng, Ronghua [2 ]
Fang, Jie [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan, Hubei, Peoples R China
[2] China Three Gorges Univ, Coll Mech & Power Engn, Yichang, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Rectangular packing problem; Combinatorial optimization; Fitness degree; Reinforcement learning; RECTANGLE; MODELS; HEURISTICS; PLACEMENT;
D O I
10.1007/s00500-023-08381-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a Q-learning-based algorithm for sequence and orientation optimization toward the 2D rectangular strip packing problem. The width-filled skyline is used to represent the interior packing state, and a constructive rectangular packing algorithm with the commonly adopted fitness evaluation for placement is designed. Then, the consecutive item packing is simulated as Markov Decision Process, where the state is defined as the set of already packed items, and the action is defined as the rectangle selected to be packed along with its orientation. We propose the reverse updating of Q-value in the paradigm of Q-learning and use the algorithm to optimize the sequence and orientation of the rectangles. The decreasing-size-choice mechanism in Q-learning is studied on randomly generated problems to optimize the setting of epsilon-greedy policy. We also test the Q-learning-based algorithm on the benchmark instances of C21, N13, N-series from NT, Cgcut and Beng. Compared with a few state-of-the-art algorithms, the computational results show that the proposed algorithm can produce good packing quality when adequate execution time allowed.
引用
收藏
页码:12057 / 12070
页数:14
相关论文
共 50 条
  • [31] Q-Learning-Based High Credibility and Stability Routing Algorithm for Internet of Medical Things
    Wei, Kefeng
    Zhang, Lincong
    Jiang, Xin
    Guo, Yi
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [32] A Q-learning-based Downlink Power Control Algorithm for Energy Efficiency in LTE Femtocells
    Huang, Lianfen
    Wen, Bin
    Gao, Zhibin
    Cai, Hongxiang
    Li, Yujie
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 1766 - +
  • [33] QTSRA: A Q-learning-based Trusted Routing Algorithm in SDN Wireless Sensor Networks
    Zhang, Yujie
    Li, Peng
    Fan, Weibei
    Wang, Ruchuan
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1881 - 1886
  • [34] Q-Learning-Based Adjustable Fixed-Phase Quantum Grover Search Algorithm
    Guo, Ying
    Shi, Wensha
    Wang, Yijun
    Hu, Jiankun
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2017, 86 (02)
  • [35] Improved heuristic recursive strategy based on genetic algorithm for the strip rectangular packing problem
    Zhang, De-Fu
    Chen, Sheng-Da
    Liu, Yan-Juan
    Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (09): : 911 - 916
  • [36] A Modified Particle Swarm Optimization for the 2D Rectangular Packing Problem
    Shao, Libing
    Wang, Shuzong
    Li, Biruo
    Song, Huanhuan
    2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 195 - 198
  • [37] A Q-learning-based network content caching method
    Chen, Haijun
    Tan, Guanzheng
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
  • [38] Q-learning-based H∞ control for LPV systems
    Wang, Hongye
    Wen, Jiwei
    Wan, Haiying
    Xue, Huiwen
    ASIAN JOURNAL OF CONTROL, 2024,
  • [39] A Q-learning-based network content caching method
    Haijun Chen
    Guanzheng Tan
    EURASIP Journal on Wireless Communications and Networking, 2018
  • [40] A meta-heuristic algorithm for the strip rectangular packing problem
    Zhang, DF
    Liu, YJ
    Chen, SD
    Xie, XG
    ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 1235 - 1241