The Differentiable Cross-Entropy Method

被引:0
|
作者
Amos, Brandon [1 ]
Yarats, Denis [1 ,2 ]
机构
[1] Facebook AI Res, Menlo Pk, CA 94025 USA
[2] NYU, New York, NY 10003 USA
关键词
OPTIMIZATION; EQUATIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the cross-entropy method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable variant that enables us to differentiate the output of CEM with respect to the objective function's parameters. In the machine learning setting this brings CEM inside of the end-to-end learning pipeline where this has otherwise been impossible. We show applications in a synthetic energy-based structured prediction task and in non-convex continuous control. In the control setting we show how to embed optimal action sequences into a lower-dimensional space. DCEM enables us to fine-tune CEM-based controllers with policy optimization.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] The Cross-Entropy Method for Policy Search in Decentralized POMDPs
    Oliehoek, Frans A.
    Kooij, Julian F. P.
    Vlassis, Nikos
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2008, 32 (04): : 341 - 357
  • [22] Sparse Antenna Array Optimization With the Cross-Entropy Method
    Minvielle, Pierre
    Tantar, Emilia
    Tantar, Alexandru-Adrian
    Berisset, Philippe
    [J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2011, 59 (08) : 2862 - 2871
  • [23] Convergence properties of the cross-entropy method for discrete optimization
    Costa, Andre
    Jones, Owen Dafydd
    Kroese, Dirk
    [J]. OPERATIONS RESEARCH LETTERS, 2007, 35 (05) : 573 - 580
  • [24] Cross-Entropy Method for Design and Optimization of Pixelated Metasurfaces
    Kovaleva, Maria
    Bulger, David
    Esselle, Karu P.
    [J]. IEEE ACCESS, 2020, 8 (08): : 224922 - 224931
  • [25] Cooperative Cross-Entropy method for generating entangled networks
    Kin-Ping Hui
    [J]. Annals of Operations Research, 2011, 189 : 205 - 214
  • [26] The Cross-Entropy Method and Its Application to Inverse Problems
    Ho, S. L.
    Yang, Shiyou
    [J]. IEEE TRANSACTIONS ON MAGNETICS, 2010, 46 (08) : 3401 - 3404
  • [27] Community detection algorithm based on cross-entropy method
    Software College, Northeastern University, Shenyang
    110819, China
    [J]. Jisuanji Xuebao, 8 (1574-1581):
  • [28] Reconstruction of CT Images Based on Cross-Entropy Method
    Wang, Qi
    Wang, Huaxiang
    Yan, Yong
    [J]. 2010 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE I2MTC 2010, PROCEEDINGS, 2010,
  • [29] Solving the Multidimensional Assignment Problem by a Cross-Entropy method
    Duc Manh Nguyen
    Hoai An Le Thi
    Tao Pham Dinh
    [J]. JOURNAL OF COMBINATORIAL OPTIMIZATION, 2014, 27 (04) : 808 - 823
  • [30] A cross-entropy based stacking method in ensemble learning
    Ding, Weimin
    Wu, Shengli
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (03) : 4677 - 4688