A Q-learning-based algorithm for the 2D-rectangular packing problem

被引：3

作者：

Zhao, Xusheng ^{[1
]}

Rao, Yunqing ^{[1
]}

Meng, Ronghua ^{[2
]}

Fang, Jie ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan, Hubei, Peoples R China

[2] China Three Gorges Univ, Coll Mech & Power Engn, Yichang, Hubei, Peoples R China

来源：

SOFT COMPUTING | 2023年 / 27卷 / 17期

基金：

中国国家自然科学基金;

关键词：

Rectangular packing problem; Combinatorial optimization; Fitness degree; Reinforcement learning; RECTANGLE; MODELS; HEURISTICS; PLACEMENT;

D O I：

10.1007/s00500-023-08381-9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a Q-learning-based algorithm for sequence and orientation optimization toward the 2D rectangular strip packing problem. The width-filled skyline is used to represent the interior packing state, and a constructive rectangular packing algorithm with the commonly adopted fitness evaluation for placement is designed. Then, the consecutive item packing is simulated as Markov Decision Process, where the state is defined as the set of already packed items, and the action is defined as the rectangle selected to be packed along with its orientation. We propose the reverse updating of Q-value in the paradigm of Q-learning and use the algorithm to optimize the sequence and orientation of the rectangles. The decreasing-size-choice mechanism in Q-learning is studied on randomly generated problems to optimize the setting of epsilon-greedy policy. We also test the Q-learning-based algorithm on the benchmark instances of C21, N13, N-series from NT, Cgcut and Beng. Compared with a few state-of-the-art algorithms, the computational results show that the proposed algorithm can produce good packing quality when adequate execution time allowed.

引用

页码：12057 / 12070

页数：14

共 50 条

[31] Q-Learning-Based High Credibility and Stability Routing Algorithm for Internet of Medical Things
Wei, Kefeng
Zhang, Lincong
Jiang, Xin
Guo, Yi
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
[32] A Q-learning-based Downlink Power Control Algorithm for Energy Efficiency in LTE Femtocells
Huang, Lianfen
Wen, Bin
Gao, Zhibin
Cai, Hongxiang
Li, Yujie
MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 1766 - +
[33] QTSRA: A Q-learning-based Trusted Routing Algorithm in SDN Wireless Sensor Networks
Zhang, Yujie
Li, Peng
Fan, Weibei
Wang, Ruchuan
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1881 - 1886
[34] Q-Learning-Based Adjustable Fixed-Phase Quantum Grover Search Algorithm
Guo, Ying
Shi, Wensha
Wang, Yijun
Hu, Jiankun
JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2017, 86 (02)
[35] Improved heuristic recursive strategy based on genetic algorithm for the strip rectangular packing problem
Zhang, De-Fu
Chen, Sheng-Da
Liu, Yan-Juan
Zidonghua Xuebao/Acta Automatica Sinica, 2007, 33 (09): : 911 - 916
[36] A Modified Particle Swarm Optimization for the 2D Rectangular Packing Problem
Shao, Libing
Wang, Shuzong
Li, Biruo
Song, Huanhuan
2010 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS 1-3, 2010, : 195 - 198
[37] A Q-learning-based network content caching method
Chen, Haijun
Tan, Guanzheng
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
[38] Q-learning-based H∞ control for LPV systems
Wang, Hongye
Wen, Jiwei
Wan, Haiying
Xue, Huiwen
ASIAN JOURNAL OF CONTROL, 2024,
[39] A Q-learning-based network content caching method
Haijun Chen
Guanzheng Tan
EURASIP Journal on Wireless Communications and Networking, 2018
[40] A meta-heuristic algorithm for the strip rectangular packing problem
Zhang, DF
Liu, YJ
Chen, SD
Xie, XG
ADVANCES IN NATURAL COMPUTATION, PT 3, PROCEEDINGS, 2005, 3612 : 1235 - 1241

← 1 2 3 4 5 →