Algorithm portfolio selection as a bandit problem with unbounded losses

被引:0
|
作者
Matteo Gagliolo
Jürgen Schmidhuber
机构
[1] Vrije Universiteit Brussel,CoMo
[2] IDSIA,Faculty of Informatics
[3] University of Lugano,undefined
关键词
Algorithm selection; Algorithm portfolios; Meta learning; Online learning; Multi-armed bandit problem; Survival analysis; Las Vegas algorithms; Computational complexity; Combinatorial optimization; Constraint programming; Satisfiability; 68T05; 68T20; 68W27; 68Q25; 62N99; 62G99;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a method that learns to allocate computation time to a given set of algorithms, of unknown performance, with the aim of solving a given sequence of problem instances in a minimum time. Analogous meta-learning techniques are typically based on models of algorithm performance, learned during a separate offline training sequence, which can be prohibitively expensive. We adopt instead an online approach, named GAMBLETA, in which algorithm performance models are iteratively updated, and used to guide allocation on a sequence of problem instances. GAMBLETA is a general method for selecting among two or more alternative algorithm portfolios. Each portfolio has its own way of allocating computation time to the available algorithms, possibly based on performance models, in which case its performance is expected to improve over time, as more runtime data becomes available. The resulting exploration-exploitation trade-off is represented as a bandit problem. In our previous work, the algorithms corresponded to the arms of the bandit, and allocations evaluated by the different portfolios were mixed, using a solver for the bandit problem with expert advice, but this required the setting of an arbitrary bound on algorithm runtimes, invalidating the optimal regret of the solver. In this paper, we propose a simpler version of GAMBLETA, in which the allocators correspond to the arms, such that a single portfolio is selected for each instance. The selection is represented as a bandit problem with partial information, and an unknown bound on losses. We devise a solver for this game, proving a bound on its expected regret. We present experiments based on results from several solver competitions, in various domains, comparing GAMBLETA with another online method.
引用
收藏
页码:49 / 86
页数:37
相关论文
共 50 条
  • [1] Algorithm portfolio selection as a bandit problem with unbounded losses
    Gagliolo, Matteo
    Schmidhuber, Jurgen
    [J]. ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2011, 61 (02) : 49 - 86
  • [2] Algorithm Selection as a Bandit Problem with Unbounded Losses
    Gagliolo, Matteo
    Schmidhuber, Juergen
    [J]. LEARNING AND INTELLIGENT OPTIMIZATION, 2010, 6073 : 82 - 96
  • [3] A SIMPLE ALGORITHM FOR THE PORTFOLIO SELECTION PROBLEM
    LEWIS, AL
    [J]. JOURNAL OF FINANCE, 1988, 43 (01): : 71 - 82
  • [4] Algorithm for the bandit problem
    Abdel, Hamid, A.A.
    Rakha, E.E.A.I.
    [J]. Advances in Modelling and Analysis A: General Mathematical and Computer Tools, 25 (1-3): : 17 - 36
  • [5] Risk-aware multi-armed bandit problem with application to portfolio selection
    Huo, Xiaoguang
    Fu, Feng
    [J]. ROYAL SOCIETY OPEN SCIENCE, 2017, 4 (11):
  • [6] Portfolio Optimization by Means of a χ-Armed Bandit Algorithm
    Moeini, Mahdi
    Wendt, Oliver
    Krumrey, Linus
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT II, 2016, 9622 : 620 - 629
  • [7] SIMPLE ALGORITHM FOR STONES VERSION OF PORTFOLIO SELECTION PROBLEM
    JUCKER, JV
    FARO, CD
    [J]. JOURNAL OF FINANCIAL AND QUANTITATIVE ANALYSIS, 1975, 10 (05) : 859 - 870
  • [8] A hybrid intelligent algorithm for portfolio selection problem with fuzzy returns
    Li, Xiang
    Zhang, Yang
    Wong, Hau-San
    Qin, Zhongfeng
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2009, 233 (02) : 264 - 278
  • [9] Genetic Algorithm for the Portfolio Selection Problem on the Romanian Capital Market
    Stancu, Stelian
    Predescu, Oana Madalina
    [J]. ICEME 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND META-ENGINEERING (POST-CONFERENCE EDITION), 2010, : 57 - 60
  • [10] Portfolio selection problem
    Senhaji, Kaoutar
    El Moutaouakil, Karim
    Ettaouil, Mohamed
    [J]. PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON LOGISTICS OPERATIONS MANAGEMENT (GOL'16), 2016,