Initializing Deep Learning Based on Latent Dirichlet Allocation for Document Classification

被引:0
|
作者
Jeon, Hyung-Bae [1 ,2 ]
Lee, Soo-Young [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon, South Korea
[2] Elect & Telecommun Res Inst, Daejeon, South Korea
[3] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea
关键词
Document classification; Deep learning; Latent dirichlet allocation; Good initialization; ALGORITHM;
D O I
10.1007/978-3-319-46675-0_70
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The gradient-descent learning of deep neural networks is subject to local minima, and good initialization may depend on the tasks. In contrast, for document classification tasks, latent Dirichlet allocation (LDA) was quite successful in extracting topic representations, but its performance was limited by its shallow architecture. In this study, LDA was adopted for efficient layer-by-layer pre-training of deep neural networks for a document classification task. Two-layer feedforward networks were added at the end of the process, and trained using a supervised learning algorithm. With 10 different random initializations, the LDA-based initialization generated a much lower mean and standard deviation for false recognition rates than other state-of-the-art initialization methods. This might demonstrate that the multi-layer expansion of probabilistic generative LDA model is capable of extracting efficient hierarchical topic representations for document classification.
引用
收藏
页码:634 / 641
页数:8
相关论文
共 50 条
  • [1] A text classification model constructed by Latent Dirichlet Allocation and Deep Learning
    Liu, Yu
    Jin, Zhengping
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2501 - 2504
  • [2] Latent Dirichlet Allocation Based Multilevel Classification
    Bhutada, Sunil
    Balaram, V. V. S. S. S.
    Bulusu, Vishnu Vardhan
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1020 - 1024
  • [3] The microblog sentiment analysis based on latent dirichlet allocation and deep learning approaches
    Ma, Xiaowen
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3113 - 3135
  • [4] A Machine Learning Framework for Document Classification by Topic Recognition Using Latent Dirichlet Allocation and Domain Knowledge
    Lavanya, B.
    Vageeswari, U.
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 509 - 520
  • [5] Semi-Supervised Latent Dirichlet Allocation and its Application for Document Classification
    Wang, Di
    Thint, Marcus
    Al-Rubaie, Ahmad
    [J]. 2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 306 - 310
  • [6] Latent Dirichlet Allocation for Automatic Document Categorization
    Biro, Istvan
    Szabo, Jacint
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 430 - 441
  • [7] LATENT DIRICHLET LEARNING FOR DOCUMENT SUMMARIZATION
    Chang, Ying-Lang
    Chien, Jen-Tzung
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1689 - 1692
  • [8] Classification of Indonesian News Articles based on Latent Dirichlet Allocation
    Kusumaningrum, Retno
    Adhy, Satriyo
    Wiedjayanto, M. Ihsan Aji
    Suryono
    [J]. PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
  • [9] Improving the Latent Dirichlet Allocation Document Model With WordNet
    Isaly, Laura
    Trias, Eric
    Peterson, Gilbert
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION WARFARE AND SECURITY, 2010, : 163 - 170
  • [10] Supervised labeled latent Dirichlet allocation for document categorization
    Li, Ximing
    Ouyang, Jihong
    Zhou, Xiaotang
    Lu, You
    Liu, Yanhui
    [J]. APPLIED INTELLIGENCE, 2015, 42 (03) : 581 - 593