Model-Based Relative Entropy Stochastic Search

被引:0
|
作者
Abdolmaleki, Abbas [1 ,2 ,3 ]
Lioutikov, Rudolf [4 ]
Lau, Nuno [1 ]
Reis, Luis Paulo [2 ,3 ]
Peters, Jan [4 ,6 ]
Neumann, Gerhard [5 ]
机构
[1] Univ Aveiro, IEETA, Aveiro, Portugal
[2] Univ Minho, DSI, Braga, Portugal
[3] Univ Porto, LIACC, Porto, Portugal
[4] Tech Univ Darmstadt, IAS, Darmstadt, Germany
[5] Tech Univ Darmstadt, CLAS, Darmstadt, Germany
[6] Max Planck Inst Intelligent Syst, Stuttgart, Germany
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stochastic search algorithms are general black-box optimizers. Due to their ease of use and their generality, they have recently also gained a lot of attention in operations research, machine learning and policy search. Yet, these algorithms require a lot of evaluations of the objective, scale poorly with the problem dimension, are affected by highly noisy objective functions and may converge prematurely. To alleviate these problems, we introduce a new surrogate-based stochastic search approach. We learn simple, quadratic surrogate models of the objective function. As the quality of such a quadratic approximation is limited, we do not greedily exploit the learned models. The algorithm can be misled by an inaccurate optimum introduced by the surrogate. Instead, we use information theoretic constraints to bound the 'distance' between the new and old data distribution while maximizing the objective function. Additionally the new method is able to sustain the exploration of the search distribution to avoid premature convergence. We compare our method with state of art black-box optimization methods on standard uni-modal and multi-modal optimization functions, on simulated planar robot tasks and a complex robot ball throwing task. The proposed method considerably outperforms the existing approaches.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Gradient-Aware Model-Based Policy Search
    D'Oro, Pierluca
    Metelli, Alberto Maria
    Tirinzoni, Andrea
    Papini, Matteo
    Restelli, Marcello
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3801 - 3808
  • [42] Multi-objective Search for Model-based Testing
    Wang, Rui
    Artho, Cyrille
    Kristensen, Lars Michael
    Stolz, Volker
    [J]. 2020 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY (QRS 2020), 2020, : 130 - 141
  • [43] Improving protein structure prediction with model-based search
    Brunette, TJ
    Brock, O
    [J]. BIOINFORMATICS, 2005, 21 : I66 - I74
  • [44] Interactive model-based search with reactive resource allocation
    Yue Sun
    Alfredo Garcia
    [J]. Journal of Global Optimization, 2017, 67 : 135 - 149
  • [45] Interactive model-based search with reactive resource allocation
    Sun, Yue
    Garcia, Alfredo
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2017, 67 (1-2) : 135 - 149
  • [46] Facilitating Exploratory Search by Model-Based Navigational Cues
    Fu, Wai-Tat
    Kannampallil, Thomas G.
    Kang, Ruogu
    [J]. IUI 2010, 2010, : 199 - 208
  • [47] Model-Based Search for Combinatorial Optimization: A Critical Survey
    Mark Zlochin
    Mauro Birattari
    Nicolas Meuleau
    Marco Dorigo
    [J]. Annals of Operations Research, 2004, 131 : 373 - 395
  • [48] Absorption in Model-based Search Algorithms for Combinatorial Optimization
    Wu, Zijun
    Kolonko, Michael
    [J]. 2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 1744 - 1751
  • [49] Model-based search for combinatorial optimization: A critical survey
    Zlochin, M
    Birattari, M
    Meuleau, N
    Dorigo, M
    [J]. ANNALS OF OPERATIONS RESEARCH, 2004, 131 (1-4) : 373 - 395
  • [50] An Improved Stochastic Fractal Search Based on Entropy Multitrust Fusion Model for IoT-Enabled WSNs Clustering
    Xu, Chaojie
    Su, Shengchao
    Wang, Yiwang
    [J]. IEEE SENSORS JOURNAL, 2023, 23 (23) : 29694 - 29704