A Cyclic Contrastive Divergence Learning Algorithm for High-order RBMs

被引:3
|
作者
Luo, Dingsheng [1 ,2 ]
Wang, Yi [1 ,2 ]
Han, Xiaoqiang [1 ,2 ]
Wu, Xihong [1 ,2 ]
机构
[1] Peking Univ, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China
[2] Peking Univ, Speech & Hearing Res Ctr, Beijing 100871, Peoples R China
关键词
High-order RBMs; Cyclic Contrastive Divergence Learning; Gradient Approximation; Convergence; Upper Bound; CONVERGENCE;
D O I
10.1109/ICMLA.2014.18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Restricted Boltzmann Machine (RBM), a special case of general Boltzmann Machines and a typical Probabilistic Graphical Models, has attracted much attention in recent years due to its powerful ability in extracting features and representing the distribution underlying the training data. A most commonly used algorithm in learning RBMs is called Contrastive Divergence (CD) proposed by Hinton, which starts a Markov chain at a data point and runs the chain for only a few iterations to get a low variance estimator. However, when referring to a high-order RBM, since there are interactions among its visible layers, the gradient approximation via CD learning usually becomes far from the log-likelihood gradient and even may cause CD learning to fall into an infinite loop with high reconstruction error. In this paper, a new algorithm named Cyclic Contrastive Divergence (CCD) is introduced for learning high-order RBMs. Unlike the standard CD algorithm, CCD updates the parameters according to each visible layer in turn, by borrowing the idea of Cyclic Block Coordinate Descent method. To evaluate the performance of the proposed CCD algorithm, regarding to high-order RBMs learning, both algorithms CCD and standard CD are theoretically analyzed, including convergence, estimate upper bound and both biases comparison, from which the superiority of CCD learning is revealed. Experiments on MNIST dataset for the handwritten digit classification task are performed. The experimental results show that CCD is more applicable and consistently outperforms the standard CD in both convergent speed and performance.
引用
收藏
页码:80 / 86
页数:7
相关论文
共 50 条
  • [21] Contrastive Divergence Learning with Chained Belief Propagation
    Ding, Fan
    Xue, Yexiang
    [J]. INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 161 - 172
  • [22] High-Order Sliding Modes Based On-Line Training Algorithm for Recurrent High-Order Neural Networks
    Alanis, Alma Y.
    Rios-Huerta, Daniel
    Rios, Jorge D.
    Arana-Daniel, Nancy
    Lopez-Franco, Carlos
    Sanchez, Edgar N.
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8187 - 8192
  • [23] High-order approximations for noncyclic and cyclic adsorption in a biporous adsorbent
    Kim, DH
    Lee, JT
    [J]. KOREAN JOURNAL OF CHEMICAL ENGINEERING, 1999, 16 (01) : 69 - 74
  • [24] High-order approximations for noncyclic and cyclic adsorption in a biporous adsorbent
    Dong Hyun Kim
    Jietae Lee
    [J]. Korean Journal of Chemical Engineering, 1999, 16 : 69 - 74
  • [25] An improved control algorithm of high-order nonlinear systems
    Duan, Na
    Xie, Xue-Jun
    Liu, Hai-Kuan
    [J]. Zidonghua Xuebao/ Acta Automatica Sinica, 2008, 34 (10): : 1262 - 1267
  • [26] Kalman carrier recovery algorithm for high-order QAM
    Chang, Dah-Chung
    Lin, Wei-Tsen
    Chen, Yung-Fang
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2006, E89B (11) : 3117 - 3119
  • [27] A fast high-order algorithm for the multiple cavity scattering
    Zhao, Meiling
    An, Sheng
    Nie, Yufeng
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2019, 96 (01) : 135 - 157
  • [28] A parallel high-order accuracy algorithm for the Helmholtz equations
    Bao, Tiantian
    Feng, Xiufang
    [J]. INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2024, 101 (01) : 56 - 94
  • [29] Implicit high-order compact algorithm for computational acoustics
    Fung, KY
    Man, RSO
    Davis, S
    [J]. AIAA JOURNAL, 1996, 34 (10) : 2029 - 2037
  • [30] A high-order algorithm for obstacle scattering in three dimensions
    Ganesh, M
    Graham, IG
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2004, 198 (01) : 211 - 242