Generative Adversarial Imitation Learning to Search in Branch-and-Bound Algorithms

被引：1

作者：

Wang, Qi ^{[1
]}

Blackley, Suzanne, V ^{[2
]}

Tang, Chunlei ^{[2
]}

机构：

[1] Fudan Univ, Shanghai 200438, Peoples R China

[2] Harvard Med Sch, Boston, MA 02120 USA

来源：

DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT II | 2022年

关键词：

Combinatorial optimization; Reinforcement learning; Branch-and-bound; Generative adversarial imitation learning;

D O I：

10.1007/978-3-031-00126-0_51

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recent studies have shown that reinforcement learning (RL) can provide state-of-the-art performance at learning sophisticated heuristics by exploiting the shared internal structure combinatorial optimization instances in the data. However, existing RL-based methods require too much trial-and-error reliant on sophisticated reward engineering, which is laborious and inefficient for practical applications. This paper proposes a novel framework (RAIL) that combines RL and generative adversarial imitation learning (GAIL) to meet the challenge by searching in branch-and-bound algorithms. RAIL has a policy architecture with dual decoders, corresponding to the sequence decoding of RL and the edge decoding of GAIL, respectively. The two complement each other and restrict each other to improve the learned policy and reward function iteratively.

引用

页码：673 / 680

页数：8

共 50 条

[31] PARALLEL BRANCH-AND-BOUND ALGORITHMS - SURVEY AND SYNTHESIS
GENDRON, B
CRAINIC, TG
OPERATIONS RESEARCH, 1994, 42 (06) : 1042 - 1066
[32] On the complexity of branch-and-bound search for random trees
Devroye, L
Zamora-Cura, C
RANDOM STRUCTURES & ALGORITHMS, 1999, 14 (04) : 309 - 327
[33] Branch-and-Bound Search for Training Cascades of Classifiers
Sychel, Dariusz
Klesk, Przemyslaw
Bera, Aneta
COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 18 - 34
[34] A Bayesian Approach to Generative Adversarial Imitation Learning
Jeon, Wonseok
Seo, Seokin
Kim, Kee-Eung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[35] A Category Theoretic Approach to Search Algorithms: Towards a Unified Implementation for Branch-and-Bound and Backtracking
Zheng Yujun
Xue Jinyun
Shi Haihe
ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 845 - +
[36] ANALYSIS AND IMPLEMENTATION OF BRANCH-AND-BOUND ALGORITHMS ON A HYPERCUBE MULTICOMPUTER
QUINN, MJ
IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (03) : 384 - 387
[37] BRANCH-AND-BOUND ALGORITHMS FOR SOLUTION OF RANGE ASSIGNMENT PROBLEMS
SCHON, B
ANGEWANDTE INFORMATIK, 1971, 13 (12): : 546 - &
[38] AN UPPER BOUND FOR THE SPEEDUP OF PARALLEL BEST-BOUND BRANCH-AND-BOUND ALGORITHMS
QUINN, MJ
DEO, N
BIT, 1986, 26 (01): : 35 - 43
[39] AND/OR Branch-and-Bound search for combinatorial optimization in graphical models
Marinescu, Radu
Dechter, Rina
ARTIFICIAL INTELLIGENCE, 2009, 173 (16-17) : 1457 - 1491
[40] Robot Manipulation Learning Using Generative Adversarial Imitation Learning
Jabri, Mohamed Khalil
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4893 - 4894

← 1 2 3 4 5 →