Multi-Objective Neural Evolutionary Algorithm for Combinatorial Optimization Problems

被引:59
|
作者
Shao, Yinan [1 ]
Lin, Jerry Chun-Wei [2 ]
Srivastava, Gautam [3 ,4 ,5 ]
Guo, Dongdong [1 ]
Zhang, Hongchun [1 ]
Yi, Hu [1 ]
Jolfaei, Alireza [6 ]
机构
[1] Alibaba Inc, Hangzhou 310052, Peoples R China
[2] Western Norway Univ Appl Sci, N-5063 Bergen, Norway
[3] Brandon Univ, Dept Math & Comp Sci, Brandon, MB R7A 6A9, Canada
[4] China Med Univ, Res Ctr Interneural Comp, Taichung 40402, Taiwan
[5] Asia Univ, Coll Informat & Elect Engn, Taichung 413, Taiwan
[6] Macquarie Univ, Dept Comp, Sydney, NSW 2109, Australia
关键词
Optimization; Evolutionary computation; Heuristic algorithms; Search problems; Neural networks; Genetics; Urban areas; Attention mechanism; deep reinforcement learning (DRL); multi-objective learning; neural combinatorial optimization; neural evolutionary algorithm;
D O I
10.1109/TNNLS.2021.3105937
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a recent surge of success in optimizing deep reinforcement learning (DRL) models with neural evolutionary algorithms. This type of method is inspired by biological evolution and uses different genetic operations to evolve neural networks. Previous neural evolutionary algorithms mainly focused on single-objective optimization problems (SOPs). In this article, we present an end-to-end multi-objective neural evolutionary algorithm based on decomposition and dominance (MONEADD) for combinatorial optimization problems. The proposed MONEADD is an end-to-end algorithm that utilizes genetic operations and rewards signals to evolve neural networks for different combinatorial optimization problems without further engineering. To accelerate convergence, a set of nondominated neural networks is maintained based on the notion of dominance and decomposition in each generation. In inference time, the trained model can be directly utilized to solve similar problems efficiently, while the conventional heuristic methods need to learn from scratch for every given test problem. To further enhance the model performance in inference time, three multi-objective search strategies are introduced in this work. Our experimental results clearly show that the proposed MONEADD has a competitive and robust performance on a bi-objective of the classic travel salesman problem (TSP), as well as Knapsack problem up to 200 instances. We also empirically show that the designed MONEADD has good scalability when distributed on multiple graphics processing units (GPUs).
引用
收藏
页码:2133 / 2143
页数:11
相关论文
共 50 条