Generative Adversarial Imitation Learning to Search in Branch-and-Bound Algorithms

被引:1
|
作者
Wang, Qi [1 ]
Blackley, Suzanne, V [2 ]
Tang, Chunlei [2 ]
机构
[1] Fudan Univ, Shanghai 200438, Peoples R China
[2] Harvard Med Sch, Boston, MA 02120 USA
关键词
Combinatorial optimization; Reinforcement learning; Branch-and-bound; Generative adversarial imitation learning;
D O I
10.1007/978-3-031-00126-0_51
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent studies have shown that reinforcement learning (RL) can provide state-of-the-art performance at learning sophisticated heuristics by exploiting the shared internal structure combinatorial optimization instances in the data. However, existing RL-based methods require too much trial-and-error reliant on sophisticated reward engineering, which is laborious and inefficient for practical applications. This paper proposes a novel framework (RAIL) that combines RL and generative adversarial imitation learning (GAIL) to meet the challenge by searching in branch-and-bound algorithms. RAIL has a policy architecture with dual decoders, corresponding to the sequence decoding of RL and the edge decoding of GAIL, respectively. The two complement each other and restrict each other to improve the learned policy and reward function iteratively.
引用
收藏
页码:673 / 680
页数:8
相关论文
共 50 条
  • [1] Learning to Search in Branch-and-Bound Algorithms
    He, He
    Daume, Hal, III
    Eisner, Jason
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [2] DEPTH-M SEARCH IN BRANCH-AND-BOUND ALGORITHMS
    IBARAKI, T
    INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1978, 7 (04): : 315 - 343
  • [3] THEORETICAL COMPARISONS OF SEARCH STRATEGIES IN BRANCH-AND-BOUND ALGORITHMS
    IBARAKI, T
    INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1976, 5 (04): : 315 - 344
  • [4] Branch-and-bound algorithms on a hypercube
    Pargas, R.P.
    Wooster, D.E.
    Conference on Hypercube Concurrent Computers and Applications, 1988,
  • [5] RANDOMIZED PARALLEL ALGORITHMS FOR BACKTRACK SEARCH AND BRANCH-AND-BOUND COMPUTATION
    KARP, RM
    ZHANG, YJ
    JOURNAL OF THE ACM, 1993, 40 (03) : 765 - 789
  • [6] Generative Adversarial Imitation Learning
    Ho, Jonathan
    Ermon, Stefano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [7] When Will Generative Adversarial Imitation Learning Algorithms Attain Global Convergence
    Guan, Ziwei
    Xu, Tengyu
    Liang, Yingbin
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [8] A STUDY OF PARALLEL BRANCH-AND-BOUND ALGORITHMS WITH BEST-BOUND-1ST SEARCH
    TROYA, JM
    ORTEGA, M
    PARALLEL COMPUTING, 1989, 11 (01) : 121 - 126
  • [9] EXPECTED PERFORMANCE OF BRANCH-AND-BOUND ALGORITHMS
    LENSTRA, JK
    RINNOOYKAN, AHG
    OPERATIONS RESEARCH, 1978, 26 (02) : 347 - 349
  • [10] ANOMALIES IN PARALLEL BRANCH-AND-BOUND ALGORITHMS
    LAI, TH
    SAHNI, S
    COMMUNICATIONS OF THE ACM, 1984, 27 (06) : 594 - 602