Model-Based Relative Entropy Stochastic Search

被引:0
|
作者
Abdolmaleki, Abbas [1 ,2 ,3 ]
Lioutikov, Rudolf [4 ]
Lau, Nuno [1 ]
Reis, Luis Paulo [2 ,3 ]
Peters, Jan [4 ,6 ]
Neumann, Gerhard [5 ]
机构
[1] Univ Aveiro, IEETA, Aveiro, Portugal
[2] Univ Minho, DSI, Braga, Portugal
[3] Univ Porto, LIACC, Porto, Portugal
[4] Tech Univ Darmstadt, IAS, Darmstadt, Germany
[5] Tech Univ Darmstadt, CLAS, Darmstadt, Germany
[6] Max Planck Inst Intelligent Syst, Stuttgart, Germany
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stochastic search algorithms are general black-box optimizers. Due to their ease of use and their generality, they have recently also gained a lot of attention in operations research, machine learning and policy search. Yet, these algorithms require a lot of evaluations of the objective, scale poorly with the problem dimension, are affected by highly noisy objective functions and may converge prematurely. To alleviate these problems, we introduce a new surrogate-based stochastic search approach. We learn simple, quadratic surrogate models of the objective function. As the quality of such a quadratic approximation is limited, we do not greedily exploit the learned models. The algorithm can be misled by an inaccurate optimum introduced by the surrogate. Instead, we use information theoretic constraints to bound the 'distance' between the new and old data distribution while maximizing the objective function. Additionally the new method is able to sustain the exploration of the search distribution to avoid premature convergence. We compare our method with state of art black-box optimization methods on standard uni-modal and multi-modal optimization functions, on simulated planar robot tasks and a complex robot ball throwing task. The proposed method considerably outperforms the existing approaches.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Model-Based Relative Entropy Stochastic Search
    Abdolmaleki, Abbas
    Lioutikov, Rudolf
    Lau, Nuno
    Reis, Luis Paulo
    Peters, Jan
    Neumann, Gerhard
    [J]. PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 153 - 154
  • [2] Stochastic Approximation Trackers for Model-Based Search
    Joseph, Ajin George
    Bhatnagar, Shalabh
    [J]. 2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 741 - 748
  • [3] Model-Based Annealing Random Search with Stochastic Averaging
    Hu, Jiaqiao
    Zhou, Enlu
    Fan, Qi
    [J]. ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2014, 24 (04):
  • [4] Approximate model-based diagnosis using greedy stochastic search
    Feldman, Alexander
    Provan, Gregory
    van Gemund, Arjan
    [J]. ABSTRACTION, REFORMULATION, AND APPROXIMATION, PROCEEDINGS, 2007, 4612 : 139 - +
  • [5] Approximate Model-Based Diagnosis Using Greedy Stochastic Search
    Feldman, Alexander
    Provan, Gregory
    van Gemund, Arjan
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2010, 38 : 371 - 413
  • [6] Model-based search
    Ruml, W
    [J]. ABSTRACTION, REFORMULATION AND APPROXIMATION, PROCEEDINGS, 2005, 3607 : 365 - 366
  • [7] Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm
    Christophe Biernacki
    Julien Jacques
    [J]. Statistics and Computing, 2016, 26 : 929 - 943
  • [8] Model-based clustering of multivariate ordinal data relying on a stochastic binary search algorithm
    Biernacki, Christophe
    Jacques, Julien
    [J]. STATISTICS AND COMPUTING, 2016, 26 (05) : 929 - 943
  • [9] Stochastic Model-Based Source Identification
    Calkins, Luke
    Khodayi-mehr, Reza
    Aquino, Wilkins
    Zavlanos, Michael
    [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [10] Relative Entropy Policy Search
    Peters, Jan
    Muelling, Katharina
    Altuen, Yasemin
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1607 - 1612