Initializing Deep Learning Based on Latent Dirichlet Allocation for Document Classification

被引：0

作者：

Jeon, Hyung-Bae ^{[1
,2
]}

Lee, Soo-Young ^{[3
]}

机构：

[1] Korea Adv Inst Sci & Technol, Dept Bio & Brain Engn, Daejeon, South Korea

[2] Elect & Telecommun Res Inst, Daejeon, South Korea

[3] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon, South Korea

来源：

NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III | 2016年 / 9949卷

关键词：

Document classification; Deep learning; Latent dirichlet allocation; Good initialization; ALGORITHM;

D O I：

10.1007/978-3-319-46675-0_70

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The gradient-descent learning of deep neural networks is subject to local minima, and good initialization may depend on the tasks. In contrast, for document classification tasks, latent Dirichlet allocation (LDA) was quite successful in extracting topic representations, but its performance was limited by its shallow architecture. In this study, LDA was adopted for efficient layer-by-layer pre-training of deep neural networks for a document classification task. Two-layer feedforward networks were added at the end of the process, and trained using a supervised learning algorithm. With 10 different random initializations, the LDA-based initialization generated a much lower mean and standard deviation for false recognition rates than other state-of-the-art initialization methods. This might demonstrate that the multi-layer expansion of probabilistic generative LDA model is capable of extracting efficient hierarchical topic representations for document classification.

引用

页码：634 / 641

页数：8

共 50 条

[1] A text classification model constructed by Latent Dirichlet Allocation and Deep Learning
Liu, Yu
Jin, Zhengping
PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 2501 - 2504
[2] Latent Dirichlet Allocation Based Multilevel Classification
Bhutada, Sunil
Balaram, V. V. S. S. S.
Bulusu, Vishnu Vardhan
2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2014, : 1020 - 1024
[3] The microblog sentiment analysis based on latent dirichlet allocation and deep learning approaches
Ma, Xiaowen
JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3113 - 3135
[4] A Machine Learning Framework for Document Classification by Topic Recognition Using Latent Dirichlet Allocation and Domain Knowledge
Lavanya, B.
Vageeswari, U.
INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 509 - 520
[5] Semi-Supervised Latent Dirichlet Allocation and its Application for Document Classification
Wang, Di
Thint, Marcus
Al-Rubaie, Ahmad
2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 306 - 310
[6] Latent Dirichlet Allocation for Automatic Document Categorization
Biro, Istvan
Szabo, Jacint
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 430 - 441
[7] LATENT DIRICHLET LEARNING FOR DOCUMENT SUMMARIZATION
Chang, Ying-Lang
Chien, Jen-Tzung
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1689 - 1692
[8] Classification of Indonesian News Articles based on Latent Dirichlet Allocation
Kusumaningrum, Retno
Adhy, Satriyo
Wiedjayanto, M. Ihsan Aji
Suryono
PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2016,
[9] Improving the Latent Dirichlet Allocation Document Model With WordNet
Isaly, Laura
Trias, Eric
Peterson, Gilbert
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION WARFARE AND SECURITY, 2010, : 163 - 170
[10] Supervised labeled latent Dirichlet allocation for document categorization
Li, Ximing
Ouyang, Jihong
Zhou, Xiaotang
Lu, You
Liu, Yanhui
APPLIED INTELLIGENCE, 2015, 42 (03) : 581 - 593

← 1 2 3 4 5 →