A Q-learning-based algorithm for the 2D-rectangular packing problem

被引：0

作者：

Xusheng Zhao

Yunqing Rao

Ronghua Meng

Jie Fang

机构：

[1] Huazhong University of Science and Technology,School of Mechanical Science & Engineering

[2] China Three Gorges University,College of Mechanical & Power Engineering

来源：

Soft Computing | 2023年 / 27卷

关键词：

Rectangular packing problem; Combinatorial optimization; Fitness degree; Reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper presents a Q-learning-based algorithm for sequence and orientation optimization toward the 2D rectangular strip packing problem. The width-filled skyline is used to represent the interior packing state, and a constructive rectangular packing algorithm with the commonly adopted fitness evaluation for placement is designed. Then, the consecutive item packing is simulated as Markov Decision Process, where the state is defined as the set of already packed items, and the action is defined as the rectangle selected to be packed along with its orientation. We propose the reverse updating of Q-value in the paradigm of Q-learning and use the algorithm to optimize the sequence and orientation of the rectangles. The decreasing-size-choice mechanism in Q-learning is studied on randomly generated problems to optimize the setting of ε-greedy policy. We also test the Q-learning-based algorithm on the benchmark instances of C21, N13, N-series from NT, Cgcut and Beng. Compared with a few state-of-the-art algorithms, the computational results show that the proposed algorithm can produce good packing quality when adequate execution time allowed.

引用

页码：12057 / 12070

页数：13

共 50 条

[21] Q-learning-based simulated annealing algorithm for constrained engineering design problems
Hussein Samma
Junita Mohamad-Saleh
Shahrel Azmin Suandi
Badr Lahasan
Neural Computing and Applications, 2020, 32 : 5147 - 5161
[22] The container stowage planning problem in a barge convoy system via Q-learning-based NSGA-II algorithm
Ling, Yu
Pan, Lin
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4492 - 4497
[23] A Q-Learning-Based Hyper-Heuristic Evolutionary Algorithm for the Distributed Flexible Job-Shop Scheduling Problem
Wu, Fang-Chun
Qian, Bin
Hu, Rong
Zhang, Zi-Qi
Wang, Bin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 251 - 261
[24] An efficient deterministic heuristic algorithm for the rectangular packing problem
Chen, Mao
Wu, Chao
Tang, Xiangyang
Peng, Xicheng
Zeng, Zhizhong
Liu, Sanya
COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 137
[25] Decimal wolf pack algorithm for rectangular packing problem
Luo Q.
Rao Y.
Liu Q.
Li S.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (05): : 1169 - 1179
[26] A recursive algorithm for the rectangular guillotine strip packing problem
Cui, Y.
Gu, T.
Zhong, Y.
ENGINEERING OPTIMIZATION, 2008, 40 (04) : 347 - 360
[27] Bioinspired Algorithm for 2D Packing Problem
Kureichik, Vladimir
Kureichik, Liliya
Kureichik, Vladimir, Jr.
Zaruba, Daria
ARTIFICIAL INTELLIGENCE AND ALGORITHMS IN INTELLIGENT SYSTEMS, 2019, 764 : 39 - 46
[28] Hybrid Heuristic Algorithm Based On Improved Rules & Reinforcement Learning for 2D Strip Packing Problem
Zhu, Kai
Ji, Naihua
Li, Xiang Dong
IEEE ACCESS, 2020, 8 : 226784 - 226796
[29] Q-Thermal: A Q-Learning-Based Thermal-Aware Routing Algorithm for 3-D Network On-Chips
Shahabinejad, Narges
Beitollahi, Hakem
IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2020, 10 (09): : 1482 - 1490
[30] QLGR: A Q-learning-based Geographic FANET Routing Algorithm Based on Multi- agent Reinforcement Learning
Qiu, Xiulin
Xie, Yongsheng
Wang, Yinyin
Ye, Lei
Yang, Yuwang
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (11): : 4244 - 4274

← 1 2 3 4 5 →