Natural Reweighted Wake-Sleep

被引:0
|
作者
Varady, Csongor [1 ]
Volpi, Riccardo [2 ,3 ]
Malago, Luigi [2 ,3 ]
Ay, Nihat [1 ,4 ,5 ]
机构
[1] Hamburg Univ Technol, Inst Data Sci Fdn, Hamburg, Germany
[2] Transylvanian Inst Neurosci, Cluj Napoca, Romania
[3] Quaesta AI, Cluj Napoca, Romania
[4] Santa Fe Inst, Santa Fe, NM 87501 USA
[5] Univ Leipzig, Leipzig, Germany
关键词
Natural gradient; Helmholtz machine; Wake-Sleep; Information geometry; INFORMATION GEOMETRY; ALGORITHMS;
D O I
10.1016/j.neunet.2022.09.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Helmholtz Machines (HMs) are a class of generative models composed of two Sigmoid Belief Networks (SBNs), acting respectively as an encoder and a decoder. These models are commonly trained using a two-step optimization algorithm called Wake-Sleep (WS) and more recently by improved versions, such as Reweighted Wake-Sleep (RWS) and Bidirectional Helmholtz Machines (BiHM). The locality of the connections in an SBN induces sparsity in the Fisher Information Matrices associated to the probabilistic models, in the form of a finely-grained block-diagonal structure. In this paper we exploit this property to efficiently train SBNs and HMs using the natural gradient. We present a novel algorithm, called Natural Reweighted Wake-Sleep (NRWS), that corresponds to the geometric adaptation of its standard version. In a similar manner, we also introduce Natural Bidirectional Helmholtz Machine (NBiHM). Differently from previous work, we will show how for HMs the natural gradient can be efficiently computed without the need of introducing any approximation in the structure of the Fisher information matrix. The experiments performed on standard datasets from the literature show a consistent improvement of NRWS and NBiHM not only with respect to their non-geometric baselines but also with respect to state-of-the-art training algorithms for HMs. The improvement is quantified both in terms of speed of convergence as well as value of the log-likelihood reached after training.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:574 / 591
页数:18
相关论文
共 50 条
  • [1] Massively Parallel Reweighted Wake-Sleep
    Heap, Thomas
    Leech, Gavin
    Aitchison, Laurence
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 870 - 878
  • [2] Revisiting Reweighted Wake-Sleep for Models with Stochastic Control Flow
    Tuan Anh Le
    Kosiorek, Adam R.
    Siddharth, N.
    Teh, Yee Whye
    Wood, Frank
    [J]. 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 1039 - 1049
  • [3] Wake-sleep PCA
    Choi, Seungjin
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2431 - 2434
  • [4] Convergence of the Wake-Sleep algorithm
    Ikeda, S
    Amari, S
    Nakahara, H
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 239 - 245
  • [5] Wake-sleep circuitry: an overview
    Saper, Clifford B.
    Fuller, Patrick M.
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2017, 44 : 186 - 192
  • [6] Affect Across the Wake-Sleep Cycle
    Sikka, Pilleriin
    Gross, James J.
    [J]. AFFECTIVE SCIENCE, 2023, 4 (03) : 563 - 569
  • [7] CURE FOR CHILDREN BY CONTROLLED WAKE-SLEEP
    FABRE, N
    [J]. NEUROPSYCHIATRIE DE L ENFANCE ET DE L ADOLESCENCE, 1979, 27 (09) : 423 - 423
  • [8] VASOPRESSIN AND THE WAKE-SLEEP CYCLE IN THE RAT
    BIBENE, V
    PESTRE, M
    RODRIGUEZ, F
    ARNAULD, E
    PONCET, C
    VINCENT, JD
    [J]. ANNALES D ENDOCRINOLOGIE, 1985, 46 (06) : N13 - N13
  • [9] Dynamical analysis of the wake-sleep algorithm
    Shimasaki, S
    Kabashima, Y
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (01): : 41 - 49
  • [10] Wake-sleep transition as a noisy bifurcation
    Yang, Dong-Ping
    McKenzie-Sell, Lauren
    Karanjai, Angela
    Robinson, P. A.
    [J]. PHYSICAL REVIEW E, 2016, 94 (02)